LangSmith Self-Hosted Pricing - Enterprise Add-On, Cost Drivers, Architecture & Budget Control
If you’re searching for “LangSmith self-hosted pricing,” you’re usually not looking for a simple public price table. You’re trying to answer a bigger question: What does it cost to run LangSmith in my own environment and what exactly am I paying for?
This guide explains how self-hosted LangSmith fits into the LangSmith plan structure, what “self-hosted” and “hybrid” mean in practice, and how to estimate and control spend across seats, traces, retention, deployments, and Agent Builder usage. It’s written for builders and engineering leaders who need clarity before production rollout.
Executive overview
LangSmith is an agent engineering platform for observability, evaluation, and deployment. In the cloud (SaaS), LangChain manages the infrastructure. In a self-hosted setup, you run the platform components (and the data stores) inside your own cloud/VPC or on-prem environment. Self-hosting is designed for security-conscious organizations that need tighter data residency and compliance controls, and it is positioned as an Enterprise add-on.
“Last updated” notes: pricing and product terms can change. Always confirm the latest values in your account and the official pricing page before committing to production budgets.
1) What “LangSmith self-hosted” means
“Self-hosted LangSmith” means running LangSmith in your own environment rather than using the fully managed LangChain cloud offering. In other words, you manage the infrastructure, data stores, and operations inside your own boundary—typically a VPC or on-prem cluster.
Self-hosted doesn’t necessarily mean “everything is local”
In modern “agent platforms,” self-hosting can mean multiple things depending on architecture. LangSmith supports different configurations: you can self-host the observability and evaluation platform without hosting agent deployment, or you can run the full platform. You can also use a hybrid approach where a managed control plane orchestrates deployments while your data plane runs in your environment.
Self-hosted: observability & evaluation only
This option is ideal if you primarily want tracing, monitoring, datasets, and evaluation workflows, but you do not want LangSmith to manage your agent deployment infrastructure. Your application sends traces to your self-hosted LangSmith instance, and your team uses the UI and APIs to debug, evaluate, and improve quality. Your actual agent runtime remains wherever you already deploy it (your Kubernetes cluster, serverless, VM fleet, or on-prem services).
- Best for teams with strict data residency for traces and evaluation artifacts
- Useful when you already have a mature deployment pipeline
- Reduces platform scope compared to “full self-hosted”
Self-hosted: full platform (obs + eval + deployment)
This option enables deployment management in addition to observability and evaluation. In this mode, LangSmith can help manage deployments of agent applications (often packaged as containers) to run as agent servers. This introduces additional components—such as control plane and data plane elements—depending on the exact hosting model you choose.
- Best for organizations standardizing agent deployments across teams
- Good when you want a consistent “ship + monitor + evaluate” workflow
- Requires more infrastructure and operational maturity
2) Who typically needs self-hosted LangSmith
Self-hosting is rarely about “saving money.” It’s usually about security, compliance, governance, and control. Teams choose self-hosting when they need to keep trace data and evaluation artifacts inside their own boundary or comply with strict internal policies.
Compliance & data residency
If your organization must keep data inside a specific region or controlled network segment, self-hosting can help you meet internal rules for data processing and storage. This matters when traces include user inputs, tool outputs, or other sensitive artifacts.
Security & access controls
Enterprise setups often require SSO, fine-grained RBAC, audit trails, and policy enforcement. Hosting the platform inside your environment can simplify alignment with existing security architecture and centralized monitoring.
Operational control
Some teams need full control over scaling, backups, upgrade windows, and incident response. If LangSmith becomes core infrastructure for an agent platform, owning the operational lifecycle can be a strategic choice.
When cloud is usually enough
If your team is early-stage, iterating quickly, and does not have strict data residency requirements, a managed cloud option usually reduces friction. You get fast onboarding and fewer operational burdens. You can still use strong cost-control tactics (sampling and retention policies) even in cloud.
3) How LangSmith pricing works (and where self-hosted fits)
LangSmith pricing is typically structured around plans plus usage. Even if you self-host, you should understand the standard billing units because they form the baseline mental model for budgeting: seats, traces (by retention), and optionally deployments and Agent Builder runs.
Plan anchors (cloud pricing reference)
The public plan cards are most relevant for cloud/SaaS usage, but they also clarify the usage units LangSmith cares about. Self-hosting is positioned as an Enterprise add-on, which means the exact contract is typically customized. Still, understanding the baseline units helps you estimate usage and compare options.
| Plan | Who it’s for | Seat pricing (reference) | Included traces (base) | Support & hosting notes |
|---|---|---|---|---|
| Developer | Solo user getting started | $0 / seat per month | Up to 5k base traces / month included | Community support; “pay-as-you-go” beyond included usage |
| Plus | Teams building & deploying agents | $39 / seat per month | Up to 10k base traces / month included | Includes 1 dev-sized deployment; email support; up to 3 workspaces |
| Enterprise | Advanced hosting, security & support | Custom | Custom | Alternative hosting options including hybrid and self-hosted; SSO/RBAC; SLAs |
Core usage units
Seats
A seat is a human user who can access your LangSmith organization/workspaces. Seats are usually a predictable, monthly subscription component. Self-hosted Enterprise contracts often customize seat counts and access models, but the idea is the same: how many humans need the UI/API access.
- If only a few engineers need access, seat cost stays bounded.
- If product, QA, and analytics teams all need access, seat count can grow quickly.
Traces
A trace represents a single execution of your application—agent run, evaluator run, or playground session. A single trace can contain many internal events (LLM calls, tool calls, steps), but the primary budgeting unit is usually how many traces you store.
- Trace volume scales with product usage, test automation, and evaluation cadence.
- In self-hosted mode you also pay for underlying storage/compute you operate, so trace volume affects your infra bill too.
4) Trace retention: the biggest pricing lever
Retention is often the make-or-break factor for cost predictability. LangSmith distinguishes between shorter-lived traces and long-retained traces. The longer you retain, the more you store and index, and the more valuable the data can become for continuous improvement—especially when traces contain feedback.
| Trace type | Retention window | Cost reference | Best used for |
|---|---|---|---|
| Base traces | 14 days | $2.50 per 1k traces | Debugging, short-lived monitoring, ad-hoc analysis |
| Extended traces | 400 days | $5.00 per 1k traces | Feedback loops, long-term quality tracking, curated datasets |
| Upgrade base → extended | Convert after capture | $2.50 per 1k traces | Selectively keep only high-signal traces long-term |
Why retention changes everything (even in self-hosted mode)
In SaaS, retention mainly affects your LangSmith usage bill. In self-hosted mode, retention affects both:
- License/usage contract terms (customized under Enterprise) and how you structure “what you store where.”
- Your infrastructure costs because you are responsible for the databases and storage systems that persist traces, indexes, embeddings (if used), and logs.
This is why most mature teams design a retention policy rather than simply “store everything forever.” The best policy keeps enough signal to debug and improve quality while aggressively discarding low-value data.
5) Deployment & Agent Builder cost drivers (relevant when you host agents)
If your self-hosted strategy includes managed deployments and no-code agent construction, you’ll want to budget for the units associated with deployments and Agent Builder. Even if you don’t use them today, this section helps you plan for future adoption.
Deployment usage units
| Item | What it means | Reference cost (cloud) | Notes for self-hosted |
|---|---|---|---|
| Deployment runs | Each invocation of a deployed agent/app | $0.005 / deployment run | In self-hosted, the “run” unit still matters for capacity planning and may be part of contract terms; you also pay infra to execute those runs. |
| Uptime (Dev) | Time a development deployment is live | $0.0007 / minute | Even if a dev deployment is “cheap,” leaving many deployments running 24/7 creates persistent spend (and operational surface area). |
| Uptime (Prod) | Time a production deployment is live | $0.0036 / minute | Production deployments run continuously; in self-hosted, plan cluster capacity, autoscaling, and availability for true production workloads. |
Agent Builder usage units
| Item | Reference allowance | Overage reference | Important billing note |
|---|---|---|---|
| Agent Builder agents | Developer: 1 • Plus: unlimited | Enterprise: custom packages | Agent Builder is designed for rapid iteration; scale wisely if you turn it into a high-volume runtime. |
| Agent Builder runs | Developer: 50/mo • Plus: 500/mo | $0.05 / run | Runs are traced by default and count toward total traces; model tokens are billed separately by your model provider. |
Why this matters for self-hosted planning
In self-hosted Enterprise, your exact pricing will be contract-specific, but the drivers remain the same: how many invocations you run, how long you keep deployments online, and how many traces each workflow produces. The “double counting” pattern to watch is: Agent Builder runs generate traces. If you let a no-code workflow become the primary production path, you can end up scaling both “run volume” and “trace volume” together.
6) Self-hosted models: obs/eval only vs full platform vs standalone servers
When people say “self-hosted LangSmith,” they often mean one of three setups. Understanding the differences is essential because the infrastructure scope, operational effort, and cost profile can be drastically different.
A) Self-host observability & evaluation
You run LangSmith to store traces, evaluations, datasets, and monitoring signals inside your environment. Your agents/apps may run anywhere, but all the debugging and quality workflows happen in your self-hosted UI/API.
- Lowest complexity self-hosted option
- Good for strict data residency for trace data
- Deployment remains your responsibility
B) Full platform (with control plane + data plane)
You run the full platform including deployment management for agent applications. Teams can build Docker images and deploy via the UI into your infrastructure. This is designed for standardized “ship + run + observe” workflows.
- Most powerful option
- More components and operational requirements
- Best for large orgs standardizing agent deployment
C) Standalone servers (deploy without control plane)
In some workflows you can deploy agents directly without a control plane. This keeps the deployment path closer to your existing infra patterns while still enabling LangSmith observability and evaluation.
- Useful if you want more direct control
- Can be easier to integrate with existing CI/CD
- May reduce “platform surface area”
Supported compute platforms (what you should expect)
Self-hosted and hybrid setups typically assume Kubernetes as the primary compute platform for data-plane components, listeners, and agent servers. Your storage layer often includes managed or self-managed Postgres plus supporting services (such as caching/queues) depending on scale and features. The exact topology depends on whether you want only observability/evals or full deployment workflows.
7) Hybrid vs full self-hosted: choosing the right boundary
Many organizations pick hybrid as an intermediate step. Hybrid is designed to keep the agent runtime and data-plane components in your environment while using a managed control plane for orchestration and UI workflows.
Hybrid architecture (plain English)
In a hybrid setup, your self-hosted data plane communicates with a control plane to poll for changes and enable deployment management. Traffic to the control plane is encrypted over HTTPS and authenticated using a LangSmith API key. Hybrid may require firewall allowlisting, and some private connectivity patterns may not be supported depending on your network requirements.
| Dimension | Hybrid | Full self-hosted |
|---|---|---|
| Control plane | Managed control plane coordinates workflows | You run control plane components yourself |
| Data plane | Runs in your environment (Kubernetes) | Runs in your environment (Kubernetes or equivalent) |
| Primary motivation | Balance control + reduced operational burden | Maximum control and data boundary ownership |
| Operational complexity | Medium | High |
| Best for | Organizations needing data plane isolation and standardized deployment flows | Highly security-conscious orgs with strict internal controls and self-operated governance |
Decision rubric
Choose hybrid if…
- You want agent runtimes to stay in your environment but can accept a managed control plane.
- You need standard deployment workflows without operating every platform component.
- You want a faster path to production with fewer operational responsibilities.
Choose full self-hosted if…
- Your security policy requires the entire platform boundary inside your environment.
- You need maximum control over upgrades, audit rules, and data processing.
- You have platform SRE capacity to operate a multi-component system reliably.
8) Estimating cost: a simple worksheet for self-hosted planning
Enterprise self-hosted pricing is custom, but you can estimate workload and build a budget model with the same units: seats, trace volume (base vs extended), deployments (runs + uptime), and Agent Builder runs. For self-hosted, you also add an “infrastructure envelope” for databases, storage, and cluster capacity.
Step 1 — estimate trace volume
Most teams underestimate trace volume because they forget that traces come from more than production traffic. You’ll likely have:
- User traffic traces (end-user sessions, chat conversations, agent invocations)
- Automated test traces (CI pipelines, integration tests, load tests)
- Evaluation traces (offline eval batches, regression suites)
- Playground/studio traces (humans experimenting and debugging)
- Agent Builder traces (every Agent Builder run is traced by default)
Trace volume worksheet (copy/paste)
Inputs:
- Production agent invocations per day: ________
- Days per month: 30 (or your local month)
- Tracing sampling rate for successful runs: ________% (example: 10%)
- Error tracing rate: 100% (recommended)
- CI / automated tests per day producing traces: ________
- Offline eval traces per month: ________
- Agent Builder runs per month: ________ (each run creates traces)
Compute:
1) Production successful traces = invocations/day * days/month * sampling_rate
2) Production error traces = invocations/day * days/month * error_rate (if estimated separately)
3) Total traces/month = success traces + error traces + CI traces + eval traces + Agent Builder traces
Split:
- Base traces/month: ________
- Extended traces/month: ________ (only high-signal promoted traces)
Step 2 — choose a retention strategy
Most organizations use a tiered approach:
- Base retention for most traces: cheap, short-lived, great for debugging.
- Extended retention for high-signal traces: those with feedback, evaluation labels, incidents, and golden datasets.
Step 3 - deployment and Agent Builder load
If you plan to use deployment workflows, model:
- How many deployments will be kept live 24/7 (production)
- How many dev deployments are “always on” vs only used during working hours
- Average deployment runs per day
- Peak concurrency requirements (influence cluster sizing)
- Whether Agent Builder is used for prototyping only or becomes operational tooling
Step 4 - infrastructure envelope (self-hosted only)
Self-hosting adds a second budget line: your infrastructure costs. You should estimate:
- Database costs (Postgres storage, read/write IOPS, backups, replicas)
- Object storage costs (if you store larger artifacts externally)
- Cluster compute costs (Kubernetes nodes, autoscaling, headroom for incident spikes)
- Operational tooling (monitoring, logging, secret management, CI/CD)
A common approach is to treat infra as a “fixed + variable” model: fixed for baseline platform availability, variable for trace/deployment growth. If you don’t do this, you can win the licensing discussion and still get surprised by the cloud bill.
9) How to keep self-hosted LangSmith costs predictable
“Predictable” doesn’t mean “cheap.” It means you can explain the bill, anticipate changes, and prevent runaway spikes. The best cost controls are not financial tricks—they are engineering policies and operational guardrails.
A) Control trace volume with sampling
In production, tracing 100% of traffic can be expensive and noisy. The mature pattern is:
- Trace 100% of errors (and optionally slow outliers).
- Sample successful runs at a rate that gives you enough visibility (often 5–20%).
- Increase sampling temporarily during launches, incidents, or model changes—then reduce.
B) Default to base retention; promote selectively
Retention is the biggest lever. Store most traces as base and promote only high-value traces to extended. If your team needs long-term visibility for quality and compliance, make the promotion criteria explicit.
C) Separate environments and workspaces
A common mistake is dumping dev, staging, and production traces into the same place with no boundaries. Instead:
- Use separate workspaces/environments for dev vs staging vs production.
- Apply stricter sampling in noisy environments.
- Use short retention for dev workspaces by default.
D) Reduce payload bloat
Even if the unit cost is “per trace,” huge trace payloads can slow down debugging and increase storage pressure. Trim or externalize:
- Massive tool outputs (store externally, keep a reference ID in trace metadata)
- Large documents (store in object storage and link)
- Repetitive prompt templates (store as versions in a prompt hub, not embedded in every trace)
E) Treat deployments like infrastructure, not toys
If you use deployment features, keep an eye on “always on” resources. Dev deployments are for testing; production deployments are for customer-facing reliability. Avoid leaving many unused deployments running.
F) Create a “bill explanation” dashboard
In a healthy organization, the bill is never a mystery. You should be able to answer:
- Which services produce the most traces?
- Which environments are the noisiest?
- How many traces are promoted to extended and why?
- Which deployments run 24/7 and what business value do they serve?
- How many Agent Builder runs occurred and who triggered them?
10) Implementation checklist (self-hosted readiness)
Use this checklist to move from “we want self-hosting” to “we can run it reliably with a predictable budget.” You can treat it as a migration plan, or as a readiness review for security and platform teams.
Licensing & program decisions
- Confirm whether you need hybrid or full self-hosted boundary.
- Define required security posture: SSO, RBAC, auditing, retention rules.
- Engage sales for Enterprise + self-hosted add-on details and a trial license key.
- Decide the number of workspaces/environments (dev/staging/prod).
Architecture & operations
- Pick compute platform (commonly Kubernetes) and size baseline capacity.
- Plan storage: Postgres sizing, backups, replica strategy, restore testing.
- Set up secrets management, TLS, and network policies.
- Establish upgrade strategy: maintenance windows, rollback plan.
- Monitor the platform: metrics, logs, alerts for ingestion/backlog/storage pressure.
Trace policy & privacy
- Define what to redact or avoid logging in traces (PII, secrets, tokens).
- Set default retention: base vs extended promotion rules.
- Define sampling rates per environment.
- Implement payload trimming and external artifact storage patterns.
Rollout & governance
- Pilot with one team and one service first; measure trace volume and storage growth.
- Document “bill explanation” and cost drivers for leadership.
- Train teams on tracing best practices and retention policy.
- Review monthly: sampling, retention promotions, and deployment uptime.
11) FAQ - LangSmith self-hosted pricing
These answers focus on the self-hosted context: licensing, boundary decisions, and the cost model that matters once you run the platform yourself.
1) Is LangSmith self-hosted free?
LangSmith is proprietary software. There is a free, self-hosted version with access to basic features, but the cloud and paid self-hosted options are offered as paid services. For advanced self-hosted setups, self-hosting is positioned as an Enterprise add-on that requires contacting sales for licensing.
2) Is “self-hosted LangSmith” an Enterprise plan or an add-on?
Self-hosted LangSmith is positioned as an add-on to the Enterprise plan, typically designed for large, security-conscious customers. You generally obtain a license key to run LangSmith in your environment through a sales process.
3) What do I pay for in self-hosted LangSmith?
You should think in two layers: (1) Enterprise licensing/contract terms (custom, includes self-hosted add-on), and (2) your infrastructure costs (databases, storage, compute, backups, monitoring). Even when licensing is negotiated, the same usage drivers still matter: seats, trace volume (base vs extended retention), deployments (runs + uptime), and Agent Builder usage.
4) Are model/token costs included in LangSmith self-hosted pricing?
No. Model usage is billed separately by your model provider. LangSmith costs are about the platform: observability, evaluation, and deployment workflows. If you use Agent Builder or deployments, you still provide your model provider keys and pay your provider for tokens.
5) What’s the difference between hybrid and full self-hosted?
In hybrid, your data plane runs in your environment while a managed control plane coordinates deployments and workflows. In full self-hosted, you run the platform boundary yourself. Hybrid can reduce operational burden while keeping core runtime/data-plane components in your environment.
6) Can I self-host only observability and evaluation (without deployment)?
Yes. LangSmith supports self-hosted configurations for observability and evaluation only, and separate configurations for the full platform including deployment management. This is often the simplest self-hosted entry point.
7) What are traces and why do they matter for pricing?
A trace represents a single execution of your application (agent run, evaluator run, or playground session). Traces are the core unit for observability and evaluation. Trace volume and retention policy directly drive usage costs in SaaS and drive infrastructure costs in self-hosted setups.
8) What’s the difference between base and extended traces?
Base traces have shorter retention (14 days) and are priced for high-volume debugging. Extended traces have longer retention (400 days) and are often used for feedback loops and long-term improvement workflows. A common strategy is to capture most traces as base and selectively upgrade only high-signal traces.
9) How do I prevent surprise bills in self-hosted mode?
You prevent surprises by managing the drivers: sampling (trace fewer successful runs), retention (promote to extended selectively), environment separation (dev vs prod), and deployment hygiene (don’t keep unnecessary deployments online). Also measure storage growth and database load early in a pilot.
10) Does Agent Builder affect trace usage?
Yes. Agent Builder runs are traced by default and count toward total traces. If Agent Builder becomes a high-volume workflow, it can increase both run volume and trace volume. Many teams use it for prototyping and then migrate hot paths into code-based deployments.
© Agents API Hub