LangSmith Self-Hosted Pricing - Enterprise Add-On, Cost Drivers, Architecture & Budget Control

If you’re searching for “LangSmith self-hosted pricing,” you’re usually not looking for a simple public price table. You’re trying to answer a bigger question: What does it cost to run LangSmith in my own environment and what exactly am I paying for?

This guide explains how self-hosted LangSmith fits into the LangSmith plan structure, what “self-hosted” and “hybrid” mean in practice, and how to estimate and control spend across seats, traces, retention, deployments, and Agent Builder usage. It’s written for builders and engineering leaders who need clarity before production rollout.

White background • Black text • Mobile-friendly Enterprise add-on (self-hosted) Hybrid vs full self-hosted vs standalone servers Seats + Traces + Retention + Deployments

Executive overview

LangSmith is an agent engineering platform for observability, evaluation, and deployment. In the cloud (SaaS), LangChain manages the infrastructure. In a self-hosted setup, you run the platform components (and the data stores) inside your own cloud/VPC or on-prem environment. Self-hosting is designed for security-conscious organizations that need tighter data residency and compliance controls, and it is positioned as an Enterprise add-on.

Self-hosted = Enterprise add-on Self-hosted LangSmith is offered as an Enterprise add-on and requires a license key for your environment.

Billing isn’t “model tokens” Model/token costs are billed by your model provider; LangSmith cost is driven by seats, traces, retention, and deployment usage.

Two self-hosted “shapes” You can self-host observability/evals only, or the full platform (including deployment management and Agent Server workflows).

Hybrid is a common compromise Hybrid uses a managed control plane with a self-hosted data plane; it can satisfy many governance needs without running every component yourself.

Jump to pricing model See deployment models Go to budgeting

“Last updated” notes: pricing and product terms can change. Always confirm the latest values in your account and the official pricing page before committing to production budgets.

1) What “LangSmith self-hosted” means

“Self-hosted LangSmith” means running LangSmith in your own environment rather than using the fully managed LangChain cloud offering. In other words, you manage the infrastructure, data stores, and operations inside your own boundary—typically a VPC or on-prem cluster.

Important framing: self-hosted LangSmith is positioned as an Enterprise add-on for organizations with advanced security and hosting requirements. In practice, that means you’ll typically talk to sales to obtain licensing and a trial key for your environment.

Self-hosted doesn’t necessarily mean “everything is local”

In modern “agent platforms,” self-hosting can mean multiple things depending on architecture. LangSmith supports different configurations: you can self-host the observability and evaluation platform without hosting agent deployment, or you can run the full platform. You can also use a hybrid approach where a managed control plane orchestrates deployments while your data plane runs in your environment.

LangSmith Pricing Calculator: A LangSmith Pricing Calculator estimates your monthly LangSmith cost based on key drivers like seats, trace volume, retention (base vs extended), and optional deployments/Agent Builder runs, helping you budget and avoid surprise bills.

Self-hosted: observability & evaluation only

This option is ideal if you primarily want tracing, monitoring, datasets, and evaluation workflows, but you do not want LangSmith to manage your agent deployment infrastructure. Your application sends traces to your self-hosted LangSmith instance, and your team uses the UI and APIs to debug, evaluate, and improve quality. Your actual agent runtime remains wherever you already deploy it (your Kubernetes cluster, serverless, VM fleet, or on-prem services).

Best for teams with strict data residency for traces and evaluation artifacts
Useful when you already have a mature deployment pipeline
Reduces platform scope compared to “full self-hosted”

Self-hosted: full platform (obs + eval + deployment)

This option enables deployment management in addition to observability and evaluation. In this mode, LangSmith can help manage deployments of agent applications (often packaged as containers) to run as agent servers. This introduces additional components—such as control plane and data plane elements—depending on the exact hosting model you choose.

Best for organizations standardizing agent deployments across teams
Good when you want a consistent “ship + monitor + evaluate” workflow
Requires more infrastructure and operational maturity

2) Who typically needs self-hosted LangSmith

Self-hosting is rarely about “saving money.” It’s usually about security, compliance, governance, and control. Teams choose self-hosting when they need to keep trace data and evaluation artifacts inside their own boundary or comply with strict internal policies.

Compliance & data residency

If your organization must keep data inside a specific region or controlled network segment, self-hosting can help you meet internal rules for data processing and storage. This matters when traces include user inputs, tool outputs, or other sensitive artifacts.

Security & access controls

Enterprise setups often require SSO, fine-grained RBAC, audit trails, and policy enforcement. Hosting the platform inside your environment can simplify alignment with existing security architecture and centralized monitoring.

Operational control

Some teams need full control over scaling, backups, upgrade windows, and incident response. If LangSmith becomes core infrastructure for an agent platform, owning the operational lifecycle can be a strategic choice.

When cloud is usually enough

If your team is early-stage, iterating quickly, and does not have strict data residency requirements, a managed cloud option usually reduces friction. You get fast onboarding and fewer operational burdens. You can still use strong cost-control tactics (sampling and retention policies) even in cloud.

3) How LangSmith pricing works (and where self-hosted fits)

LangSmith pricing is typically structured around plans plus usage. Even if you self-host, you should understand the standard billing units because they form the baseline mental model for budgeting: seats, traces (by retention), and optionally deployments and Agent Builder runs.

Plan anchors (cloud pricing reference)

The public plan cards are most relevant for cloud/SaaS usage, but they also clarify the usage units LangSmith cares about. Self-hosting is positioned as an Enterprise add-on, which means the exact contract is typically customized. Still, understanding the baseline units helps you estimate usage and compare options.

Plan	Who it’s for	Seat pricing (reference)	Included traces (base)	Support & hosting notes
Developer	Solo user getting started	$0 / seat per month	Up to 5k base traces / month included	Community support; “pay-as-you-go” beyond included usage
Plus	Teams building & deploying agents	$39 / seat per month	Up to 10k base traces / month included	Includes 1 dev-sized deployment; email support; up to 3 workspaces
Enterprise	Advanced hosting, security & support	Custom	Custom	Alternative hosting options including hybrid and self-hosted; SSO/RBAC; SLAs

Core usage units

Seats

A seat is a human user who can access your LangSmith organization/workspaces. Seats are usually a predictable, monthly subscription component. Self-hosted Enterprise contracts often customize seat counts and access models, but the idea is the same: how many humans need the UI/API access.

If only a few engineers need access, seat cost stays bounded.
If product, QA, and analytics teams all need access, seat count can grow quickly.

Traces

A trace represents a single execution of your application—agent run, evaluator run, or playground session. A single trace can contain many internal events (LLM calls, tool calls, steps), but the primary budgeting unit is usually how many traces you store.

Trace volume scales with product usage, test automation, and evaluation cadence.
In self-hosted mode you also pay for underlying storage/compute you operate, so trace volume affects your infra bill too.

4) Trace retention: the biggest pricing lever

Retention is often the make-or-break factor for cost predictability. LangSmith distinguishes between shorter-lived traces and long-retained traces. The longer you retain, the more you store and index, and the more valuable the data can become for continuous improvement—especially when traces contain feedback.

Trace type	Retention window	Cost reference	Best used for
Base traces	14 days	$2.50 per 1k traces	Debugging, short-lived monitoring, ad-hoc analysis
Extended traces	400 days	$5.00 per 1k traces	Feedback loops, long-term quality tracking, curated datasets
Upgrade base → extended	Convert after capture	$2.50 per 1k traces	Selectively keep only high-signal traces long-term

Why retention changes everything (even in self-hosted mode)

In SaaS, retention mainly affects your LangSmith usage bill. In self-hosted mode, retention affects both:

License/usage contract terms (customized under Enterprise) and how you structure “what you store where.”
Your infrastructure costs because you are responsible for the databases and storage systems that persist traces, indexes, embeddings (if used), and logs.

This is why most mature teams design a retention policy rather than simply “store everything forever.” The best policy keeps enough signal to debug and improve quality while aggressively discarding low-value data.

Practical rule: Store most traces as base by default, then “promote” only high-signal traces to extended: examples with user feedback, evaluator annotations, regressions, incident windows, and golden datasets used in recurring evals.

5) Deployment & Agent Builder cost drivers (relevant when you host agents)

If your self-hosted strategy includes managed deployments and no-code agent construction, you’ll want to budget for the units associated with deployments and Agent Builder. Even if you don’t use them today, this section helps you plan for future adoption.

Deployment usage units

Item	What it means	Reference cost (cloud)	Notes for self-hosted
Deployment runs	Each invocation of a deployed agent/app	$0.005 / deployment run	In self-hosted, the “run” unit still matters for capacity planning and may be part of contract terms; you also pay infra to execute those runs.
Uptime (Dev)	Time a development deployment is live	$0.0007 / minute	Even if a dev deployment is “cheap,” leaving many deployments running 24/7 creates persistent spend (and operational surface area).
Uptime (Prod)	Time a production deployment is live	$0.0036 / minute	Production deployments run continuously; in self-hosted, plan cluster capacity, autoscaling, and availability for true production workloads.

Agent Builder usage units

Item	Reference allowance	Overage reference	Important billing note
Agent Builder agents	Developer: 1 • Plus: unlimited	Enterprise: custom packages	Agent Builder is designed for rapid iteration; scale wisely if you turn it into a high-volume runtime.
Agent Builder runs	Developer: 50/mo • Plus: 500/mo	$0.05 / run	Runs are traced by default and count toward total traces; model tokens are billed separately by your model provider.

Why this matters for self-hosted planning

In self-hosted Enterprise, your exact pricing will be contract-specific, but the drivers remain the same: how many invocations you run, how long you keep deployments online, and how many traces each workflow produces. The “double counting” pattern to watch is: Agent Builder runs generate traces. If you let a no-code workflow become the primary production path, you can end up scaling both “run volume” and “trace volume” together.

6) Self-hosted models: obs/eval only vs full platform vs standalone servers

When people say “self-hosted LangSmith,” they often mean one of three setups. Understanding the differences is essential because the infrastructure scope, operational effort, and cost profile can be drastically different.

A) Self-host observability & evaluation

You run LangSmith to store traces, evaluations, datasets, and monitoring signals inside your environment. Your agents/apps may run anywhere, but all the debugging and quality workflows happen in your self-hosted UI/API.

Lowest complexity self-hosted option
Good for strict data residency for trace data
Deployment remains your responsibility

B) Full platform (with control plane + data plane)

You run the full platform including deployment management for agent applications. Teams can build Docker images and deploy via the UI into your infrastructure. This is designed for standardized “ship + run + observe” workflows.

Most powerful option
More components and operational requirements
Best for large orgs standardizing agent deployment

C) Standalone servers (deploy without control plane)

In some workflows you can deploy agents directly without a control plane. This keeps the deployment path closer to your existing infra patterns while still enabling LangSmith observability and evaluation.

Useful if you want more direct control
Can be easier to integrate with existing CI/CD
May reduce “platform surface area”

Supported compute platforms (what you should expect)

Self-hosted and hybrid setups typically assume Kubernetes as the primary compute platform for data-plane components, listeners, and agent servers. Your storage layer often includes managed or self-managed Postgres plus supporting services (such as caching/queues) depending on scale and features. The exact topology depends on whether you want only observability/evals or full deployment workflows.

Reality check: Self-hosting is not just “install a container and forget it.” It’s a platform: you’ll need capacity planning, backups, upgrades, observability of the observability layer, and incident response runbooks.

7) Hybrid vs full self-hosted: choosing the right boundary

Many organizations pick hybrid as an intermediate step. Hybrid is designed to keep the agent runtime and data-plane components in your environment while using a managed control plane for orchestration and UI workflows.

Hybrid architecture (plain English)

In a hybrid setup, your self-hosted data plane communicates with a control plane to poll for changes and enable deployment management. Traffic to the control plane is encrypted over HTTPS and authenticated using a LangSmith API key. Hybrid may require firewall allowlisting, and some private connectivity patterns may not be supported depending on your network requirements.

Dimension	Hybrid	Full self-hosted
Control plane	Managed control plane coordinates workflows	You run control plane components yourself
Data plane	Runs in your environment (Kubernetes)	Runs in your environment (Kubernetes or equivalent)
Primary motivation	Balance control + reduced operational burden	Maximum control and data boundary ownership
Operational complexity	Medium	High
Best for	Organizations needing data plane isolation and standardized deployment flows	Highly security-conscious orgs with strict internal controls and self-operated governance

Decision rubric

Choose hybrid if…

You want agent runtimes to stay in your environment but can accept a managed control plane.
You need standard deployment workflows without operating every platform component.
You want a faster path to production with fewer operational responsibilities.

Choose full self-hosted if…

Your security policy requires the entire platform boundary inside your environment.
You need maximum control over upgrades, audit rules, and data processing.
You have platform SRE capacity to operate a multi-component system reliably.

8) Estimating cost: a simple worksheet for self-hosted planning

Enterprise self-hosted pricing is custom, but you can estimate workload and build a budget model with the same units: seats, trace volume (base vs extended), deployments (runs + uptime), and Agent Builder runs. For self-hosted, you also add an “infrastructure envelope” for databases, storage, and cluster capacity.

Step 1 — estimate trace volume

Most teams underestimate trace volume because they forget that traces come from more than production traffic. You’ll likely have:

User traffic traces (end-user sessions, chat conversations, agent invocations)
Automated test traces (CI pipelines, integration tests, load tests)
Evaluation traces (offline eval batches, regression suites)
Playground/studio traces (humans experimenting and debugging)
Agent Builder traces (every Agent Builder run is traced by default)

Trace volume worksheet (copy/paste)

Inputs:
- Production agent invocations per day: ________
- Days per month: 30 (or your local month)
- Tracing sampling rate for successful runs: ________%  (example: 10%)
- Error tracing rate: 100% (recommended)
- CI / automated tests per day producing traces: ________
- Offline eval traces per month: ________
- Agent Builder runs per month: ________ (each run creates traces)

Compute:
1) Production successful traces = invocations/day * days/month * sampling_rate
2) Production error traces = invocations/day * days/month * error_rate (if estimated separately)
3) Total traces/month = success traces + error traces + CI traces + eval traces + Agent Builder traces

Split:
- Base traces/month: ________
- Extended traces/month: ________  (only high-signal promoted traces)

Step 2 — choose a retention strategy

Most organizations use a tiered approach:

Base retention for most traces: cheap, short-lived, great for debugging.
Extended retention for high-signal traces: those with feedback, evaluation labels, incidents, and golden datasets.

Policy pattern: “Default to base; promote to extended only if one of these is true: (1) user left feedback, (2) evaluator labeled it, (3) incident window, (4) regression test case, (5) golden dataset.”

Step 3 - deployment and Agent Builder load

If you plan to use deployment workflows, model:

How many deployments will be kept live 24/7 (production)
How many dev deployments are “always on” vs only used during working hours
Average deployment runs per day
Peak concurrency requirements (influence cluster sizing)
Whether Agent Builder is used for prototyping only or becomes operational tooling

Step 4 - infrastructure envelope (self-hosted only)

Self-hosting adds a second budget line: your infrastructure costs. You should estimate:

Database costs (Postgres storage, read/write IOPS, backups, replicas)
Object storage costs (if you store larger artifacts externally)
Cluster compute costs (Kubernetes nodes, autoscaling, headroom for incident spikes)
Operational tooling (monitoring, logging, secret management, CI/CD)

A common approach is to treat infra as a “fixed + variable” model: fixed for baseline platform availability, variable for trace/deployment growth. If you don’t do this, you can win the licensing discussion and still get surprised by the cloud bill.

9) How to keep self-hosted LangSmith costs predictable

“Predictable” doesn’t mean “cheap.” It means you can explain the bill, anticipate changes, and prevent runaway spikes. The best cost controls are not financial tricks—they are engineering policies and operational guardrails.

A) Control trace volume with sampling

In production, tracing 100% of traffic can be expensive and noisy. The mature pattern is:

Trace 100% of errors (and optionally slow outliers).
Sample successful runs at a rate that gives you enough visibility (often 5–20%).
Increase sampling temporarily during launches, incidents, or model changes—then reduce.

B) Default to base retention; promote selectively

Retention is the biggest lever. Store most traces as base and promote only high-value traces to extended. If your team needs long-term visibility for quality and compliance, make the promotion criteria explicit.

C) Separate environments and workspaces

A common mistake is dumping dev, staging, and production traces into the same place with no boundaries. Instead:

Use separate workspaces/environments for dev vs staging vs production.
Apply stricter sampling in noisy environments.
Use short retention for dev workspaces by default.

D) Reduce payload bloat

Even if the unit cost is “per trace,” huge trace payloads can slow down debugging and increase storage pressure. Trim or externalize:

Massive tool outputs (store externally, keep a reference ID in trace metadata)
Large documents (store in object storage and link)
Repetitive prompt templates (store as versions in a prompt hub, not embedded in every trace)

E) Treat deployments like infrastructure, not toys

If you use deployment features, keep an eye on “always on” resources. Dev deployments are for testing; production deployments are for customer-facing reliability. Avoid leaving many unused deployments running.

F) Create a “bill explanation” dashboard

In a healthy organization, the bill is never a mystery. You should be able to answer:

Which services produce the most traces?
Which environments are the noisiest?
How many traces are promoted to extended and why?
Which deployments run 24/7 and what business value do they serve?
How many Agent Builder runs occurred and who triggered them?

Tip: Put ownership on the engineering team. If there is no owner for trace policy and retention rules, cost tends to drift upward over time.

10) Implementation checklist (self-hosted readiness)

Use this checklist to move from “we want self-hosting” to “we can run it reliably with a predictable budget.” You can treat it as a migration plan, or as a readiness review for security and platform teams.

Licensing & program decisions

Confirm whether you need hybrid or full self-hosted boundary.
Define required security posture: SSO, RBAC, auditing, retention rules.
Engage sales for Enterprise + self-hosted add-on details and a trial license key.
Decide the number of workspaces/environments (dev/staging/prod).

Architecture & operations

Pick compute platform (commonly Kubernetes) and size baseline capacity.
Plan storage: Postgres sizing, backups, replica strategy, restore testing.
Set up secrets management, TLS, and network policies.
Establish upgrade strategy: maintenance windows, rollback plan.
Monitor the platform: metrics, logs, alerts for ingestion/backlog/storage pressure.

Trace policy & privacy

Define what to redact or avoid logging in traces (PII, secrets, tokens).
Set default retention: base vs extended promotion rules.
Define sampling rates per environment.
Implement payload trimming and external artifact storage patterns.

Rollout & governance

Pilot with one team and one service first; measure trace volume and storage growth.
Document “bill explanation” and cost drivers for leadership.
Train teams on tracing best practices and retention policy.
Review monthly: sampling, retention promotions, and deployment uptime.

11) FAQ - LangSmith self-hosted pricing

These answers focus on the self-hosted context: licensing, boundary decisions, and the cost model that matters once you run the platform yourself.

1) Is LangSmith self-hosted free?

LangSmith is proprietary software. There is a free, self-hosted version with access to basic features, but the cloud and paid self-hosted options are offered as paid services. For advanced self-hosted setups, self-hosting is positioned as an Enterprise add-on that requires contacting sales for licensing.

2) Is “self-hosted LangSmith” an Enterprise plan or an add-on?

Self-hosted LangSmith is positioned as an add-on to the Enterprise plan, typically designed for large, security-conscious customers. You generally obtain a license key to run LangSmith in your environment through a sales process.

3) What do I pay for in self-hosted LangSmith?

You should think in two layers: (1) Enterprise licensing/contract terms (custom, includes self-hosted add-on), and (2) your infrastructure costs (databases, storage, compute, backups, monitoring). Even when licensing is negotiated, the same usage drivers still matter: seats, trace volume (base vs extended retention), deployments (runs + uptime), and Agent Builder usage.

4) Are model/token costs included in LangSmith self-hosted pricing?

No. Model usage is billed separately by your model provider. LangSmith costs are about the platform: observability, evaluation, and deployment workflows. If you use Agent Builder or deployments, you still provide your model provider keys and pay your provider for tokens.

5) What’s the difference between hybrid and full self-hosted?

In hybrid, your data plane runs in your environment while a managed control plane coordinates deployments and workflows. In full self-hosted, you run the platform boundary yourself. Hybrid can reduce operational burden while keeping core runtime/data-plane components in your environment.

6) Can I self-host only observability and evaluation (without deployment)?

Yes. LangSmith supports self-hosted configurations for observability and evaluation only, and separate configurations for the full platform including deployment management. This is often the simplest self-hosted entry point.

7) What are traces and why do they matter for pricing?

A trace represents a single execution of your application (agent run, evaluator run, or playground session). Traces are the core unit for observability and evaluation. Trace volume and retention policy directly drive usage costs in SaaS and drive infrastructure costs in self-hosted setups.

8) What’s the difference between base and extended traces?

Base traces have shorter retention (14 days) and are priced for high-volume debugging. Extended traces have longer retention (400 days) and are often used for feedback loops and long-term improvement workflows. A common strategy is to capture most traces as base and selectively upgrade only high-signal traces.

9) How do I prevent surprise bills in self-hosted mode?

You prevent surprises by managing the drivers: sampling (trace fewer successful runs), retention (promote to extended selectively), environment separation (dev vs prod), and deployment hygiene (don’t keep unnecessary deployments online). Also measure storage growth and database load early in a pilot.

10) Does Agent Builder affect trace usage?

Yes. Agent Builder runs are traced by default and count toward total traces. If Agent Builder becomes a high-volume workflow, it can increase both run volume and trace volume. Many teams use it for prototyping and then migrate hot paths into code-based deployments.