feat: add native LangGraph runtime to gateway, replace langgraph-cli by greatmengqi · Pull Request #1574 · bytedance/deer-flow

greatmengqi · 2026-03-29T16:51:15Z

Summary

Add LangGraph Platform API endpoints (threads, runs/stream, assistants) to the existing gateway behind DEERFLOW_NATIVE_RUNTIME feature flag
When enabled (default), gateway serves everything on port 8001 — no langgraph-cli needed, eliminating the RUN_LOCK serialization bottleneck
DeerFlowClient stream_mode="dual" provides independent token-level message streaming and state snapshots
nginx: one-line change per config (langgraph upstream points to gateway)

Architecture

DEERFLOW_NATIVE_RUNTIME=true (default):
  nginx(2026) → gateway(8001) ← all APIs (original + LangGraph)
  Processes: 3 (gateway + frontend + nginx)

DEERFLOW_NATIVE_RUNTIME=false (legacy fallback):
  nginx needs langgraph upstream reverted to :2024
  nginx(2026) → langgraph-cli(2024) + gateway(8001)
  Processes: 4

Files Changed

File	Change
`gateway/app.py`	Feature flag + DeerFlowClient init in lifespan
`gateway/deps.py`	New — singleton store + client
`gateway/store.py`	New — ThreadStore with bounded eviction
`gateway/routers/runtime_threads.py`	New — thread CRUD + state
`gateway/routers/runs.py`	New — SSE streaming with disconnect handling
`gateway/routers/assistants.py`	New — assistant search stub
`deerflow/client.py`	`stream_mode="dual"` + metadata field
`nginx.local.conf`	langgraph upstream `127.0.0.1:2024` → `:8001`
`nginx.conf` (Docker)	langgraph upstream `langgraph:2024` → `gateway:8001`
`serve.sh`	Conditional langgraph-cli startup
`start-daemon.sh`	Same
`Makefile`	Stop target updated

Test plan

34 new unit tests (ThreadStore, endpoints, SSE format)
1105+ existing tests pass (0 regressions)
End-to-end scenario testing (create→stream→rename→search→delete)
13-round Codex adversarial review
2x /simplify (6 parallel review agents)
Concurrency benchmark: 3.7x parallelism (vs 3.2x with langgraph-cli)

Closes #1595

Consolidate the LangGraph CLI server (port 2024) and Gateway API (port 8001) into a single FastAPI service on port 2024. This eliminates the RUN_LOCK serialization bottleneck in langgraph-cli's inmem runtime and reduces the backend from 2 processes to 1. New server implements LangGraph Platform API endpoints (threads, runs/stream, assistants) using DeerFlowClient, while mounting all existing Gateway routers (models, skills, mcp, memory, uploads, artifacts, etc.) on the same app. Key changes: - Add DeerFlowClient stream_mode="dual" for independent messages + values streams - Filter middleware-internal messages (TitleMiddleware etc.) at server layer - Bounded ThreadStore with LRU eviction (max 10k threads) - Client disconnect handling via threading.Event cancel signal - 33 new tests, 1105 existing tests pass

Simplify models, skills, mcp, memory routers by delegating to the shared DeerFlowClient singleton instead of importing from low-level harness modules directly.

- Remove null created_at from empty thread state response - Add SSE end event at stream completion (normal + error paths) - Fix concurrent status race with atomic busy/idle counter - Format with ruff

- Add SSE metadata event at stream start (run_id + thread_id) - Sanitize error messages in SSE (no internal exception leakage) - Fix title update tracking to detect title-clear transitions - Return shallow copies from ThreadStore.to_dict() to prevent mutation

- Add server-layer router implementations for models, skills, mcp, memory (thin wrappers around DeerFlowClient) - Revert gateway routers to original upstream implementations - Feature flag DEERFLOW_USE_GATEWAY_ROUTERS switches between the two - Default: server-layer routers (no gateway dependency for these 4) - Eliminates bidirectional dependency between server and gateway packages

Default (split): server:2024 + gateway:8001 + nginx routes between them. Server handles LangGraph API + models/mcp/skills/memory (via DeerFlowClient). Gateway handles artifacts/uploads/agents/suggestions/channels. DEERFLOW_STANDALONE=true: server:2024 only, mounts all routes. No gateway process needed. - Add nginx.split.conf (original routing, langgraph→server rename only) - Add nginx.standalone.conf (all routes → server) - serve.sh conditionally starts gateway based on DEERFLOW_STANDALONE - Revert gateway routers to original upstream implementations

- server: single process on port 2024, all APIs (LangGraph + REST) - gateway: legacy mode — langgraph-cli(2024) + gateway(8001) - nginx picks matching config (standalone vs split) - No feature flags or standalone/split complexity — just pick a runtime

Instead of a separate server process, add LangGraph Platform API routes directly to the existing gateway behind DEERFLOW_NATIVE_RUNTIME flag. - Flag ON (default): gateway serves everything, no langgraph-cli needed - Flag OFF: legacy mode — langgraph-cli(2024) + gateway(8001) - nginx standalone config points to gateway:8001 - One process, one app, one flag

- Delete app/server/app.py (standalone entry point) - Delete redundant server-layer routers (models, mcp, skills, memory) - Keep threads/runs/assistants routers + store + deps as library - Gateway imports them behind DEERFLOW_NATIVE_RUNTIME flag - Remove make server target

Move deps.py, store.py, and LangGraph routers (threads, runs, assistants) into app/gateway/. Delete app/server/ entirely. All code now lives under one package: app/gateway/.

Instead of two nginx configs and port juggling, just change the langgraph upstream from 127.0.0.1:2024 to 127.0.0.1:8001. Gateway on 8001 serves everything. Nginx routes unchanged.

- start-daemon.sh: replace deleted app.server.app with app.gateway.app - serve.sh: fix log label (langgraph.log → gateway.log) - serve.sh: add "yes" to flag check matching Python's accepted values - serve.sh/start-daemon.sh: remove stale app.server.app pkill refs

greatmengqi added 2 commits March 30, 2026 01:05

refactor: gateway routers delegate to DeerFlowClient

dd3a2b0

Simplify models, skills, mcp, memory routers by delegating to the shared DeerFlowClient singleton instead of importing from low-level harness modules directly.

greatmengqi force-pushed the feat/unified-server branch from ccf4fa4 to dd3a2b0 Compare March 29, 2026 17:07

greatmengqi added 9 commits March 30, 2026 01:14

fix: address adversarial review findings

88937c8

- Remove null created_at from empty thread state response - Add SSE end event at stream completion (normal + error paths) - Fix concurrent status race with atomic busy/idle counter - Format with ruff

refactor: merge server into gateway — eliminate app/server package

f463452

Move deps.py, store.py, and LangGraph routers (threads, runs, assistants) into app/gateway/. Delete app/server/ entirely. All code now lives under one package: app/gateway/.

simplify: reuse DeerFlowClient._extract_text, trim unused params

8a8d7ef

greatmengqi force-pushed the feat/unified-server branch from 30ceb3d to 9747bc2 Compare March 29, 2026 17:57

greatmengqi changed the title ~~feat: replace langgraph-cli + gateway with unified FastAPI server~~ feat: add native LangGraph runtime to gateway, replace langgraph-cli Mar 29, 2026

greatmengqi force-pushed the feat/unified-server branch from 9747bc2 to 0fc4950 Compare March 29, 2026 18:02

simplify: one nginx config, one port — delete split/standalone configs

6bc8d9c

Instead of two nginx configs and port juggling, just change the langgraph upstream from 127.0.0.1:2024 to 127.0.0.1:8001. Gateway on 8001 serves everything. Nginx routes unchanged.

greatmengqi force-pushed the feat/unified-server branch from 0fc4950 to 6bc8d9c Compare March 29, 2026 18:03

greatmengqi added 2 commits March 30, 2026 02:04

fix: remove duplicated shell error redirection in pkill commands

f1accb9

greatmengqi mentioned this pull request Mar 30, 2026

feat: native LangGraph runtime — eliminate langgraph-cli dependency #1595

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add native LangGraph runtime to gateway, replace langgraph-cli#1574

feat: add native LangGraph runtime to gateway, replace langgraph-cli#1574
greatmengqi wants to merge 14 commits intobytedance:mainfrom
greatmengqi:feat/unified-server

greatmengqi commented Mar 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

greatmengqi commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Architecture

Files Changed

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

greatmengqi commented Mar 29, 2026 •

edited

Loading