built as a graph

Tangyi (Jerry) Qian

I build agent systems end-to-end —
from GNN/RL training to full-stack product.

[Berkeley, CA][Founding AI Engineer @ PiPlan.ai][LLM model-editing research — paper in progress]

nodes/ — things I've built

FOUNDERSHIPPEDFULL-STACK

PiPlan.ai

An agentic planning system — goals become typed graphs, an agent harness governs every change, and the plan re-plans itself as reality drifts.

AgentsInference & ServingML PlatformFull-stack

Details

Problem

Every planner dies the moment reality diverges from the plan. Static to-do lists can't answer the only question that matters: "X just slipped — what should I do now?"

What I built

Built the entire system solo. Full-stack core: Python · FastAPI · SQLAlchemy · NetworkX backend (100+ REST endpoints over a 39-table versioned schema) with a React 18 · TypeScript · React Flow workspace. Agent harness: proposal-first review (draft → diff → human commit), a simulation sandbox for candidate plans, and a full event log doubling as training-data trajectories. Serving: a self-hosted multi-GPU vLLM node with a dynamic routing layer — routine agent steps run locally, complex planning escalates to frontier APIs. Adaptive re-planning re-derives the critical path as reality changes; the conversational planner–executor agent is in progress on top of the same guardrails.

piplan.ai ↗

OPEN SOURCEHACKATHON BUILD

Engram

A continual-learning personal agent that writes your beliefs into model weights.

Model EditingAgentsContinual Learning

Details

Problem

RAG remembers facts, but it can't change what a model believes. Preferences and beliefs shouldn't live in a vector store bolted onto the side of a frozen model.

What I built

An agent with a fact-vs-belief router: facts go to retrieval, beliefs get written directly into the model's weights via model editing. Includes an attribution demo — the agent recalls injected beliefs with RAG fully disabled, proving the knowledge lives in the weights.

GitHub ↗

REPRODUCE → VERIFY → BENCHMARK

Paper Reproductions

I re-implement papers end-to-end and benchmark them against reported numbers.

PyTorchBenchmarking

Details

What I built

DCN-V2 (Google, WWW '21) — implemented and benchmarked on Criteo. ESMM (Alibaba, SIGIR '18) — reproduced the entire-space multi-task framework. ColBERT (Stanford, SIGIR '20) — implemented the MaxSim operator in PyTorch. In progress: post-training, model-architecture, and inference-acceleration papers.

GitHub ↗

OPEN SOURCE

ClawConclave

A multi-agent Discord framework — LLM agents with distinct roles coordinating in shared channels.

Multi-agentLLM Systems

GitHub ↗

path/ — the trajectory so far

Apr 2026 — Present

PiPlan.ai · Founding AI Engineer

Building an agentic planning system end-to-end as the sole engineer — product, agent harness, inference stack, and data platform.

Read details

Architected and shipped the full-stack core (Python · FastAPI · SQLAlchemy 2.0 · NetworkX; React 18 · TypeScript · React Flow — 100+ REST endpoints over a 39-table versioned schema) modeling goals as typed directed multigraphs with live completion-state tracking and versioned snapshots.

Built the agent harness governing all mutations: a proposal-first review protocol (draft → diff preview → human commit/reject), a simulation sandbox for candidate plans, and a full event log capturing every action for auditability and downstream training data.

Built adaptive re-planning services that re-derive the critical path as reality changes (constraint-solver scheduling with Monte-Carlo feasibility checks); now building the conversational planner–executor agent: multi-step tool-call loops over CLI-wrapped system capabilities, bounded retries, and human-in-the-loop batch review.

Deployed a self-hosted 6-GPU inference node (vLLM serving GLM / MiniMax-class open models) for latency-critical agent steps: tuned tensor-parallel vs multi-replica layout and continuous batching, and exploited prefix caching over shared system-prompt and graph-state context across multi-turn sessions.

Built a dynamic model-routing layer trading off latency, cost, and capacity: routine agent steps (tool-argument formatting, summarization) run on the local node, complex planning steps escalate to frontier APIs, with load-aware spillover and per-step cost telemetry.

Engineered stateful agent-session serving: async long-running loops with streaming (WebSocket/SSE), checkpointed and resumable runs, idempotent tool mutations, per-user concurrency caps, and per-step tracing (model, tokens, latency) for production debuggability.

Migrated the platform to multi-tenant Postgres (39-table schema + full versioning system) with per-tenant isolation, graph-hydration caching keyed on (user, graph-version), and monthly event-log archival designed around training-data extraction.

Moved solver workloads off the request path onto a job queue with per-tenant quotas and incremental local-repair scheduling, keeping API latency flat under concurrent re-planning load.

Built the training-data flywheel: trajectory and preference records (proposal → accept / reject → actual outcome) extracted from the event log into parquet datasets, powering evaluation suites and a bandit layer for per-user proposal-style personalization.

AgentsInference & ServingML Platform

Feb 2026 — Jun 2026

GoldenMeadow Investments LLC · Software Developer Intern

Built an automated investment-research agent pipeline that turns raw filings and market data into analyst-ready research briefs for the firm's internal workflow.

Read details

Architected the end-to-end auto-research pipeline: ingestion and parsing of earnings releases, SEC filings, and market news (10K+ document corpus) into structured signals (guidance changes, event tags, sentiment deltas), with a gradient-boosted relevance scorer gating what enters the research queue.

Built the research agent core: given a ticker or event, the agent plans an evidence-gathering workflow (filing sections, price/volume context, peer comparisons, news timeline) compiled into a parallel tool-call DAG (LLMCompiler-style), producing citation-grounded research briefs.

Built a FrugalGPT / RouteLLM-style cascade router — a lightweight classifier over document type, relevance score, and small-model confidence routes routine coverage to a compact model and escalates only complex filings (M&A, restatements, non-standard disclosures) to a frontier model.

AgentsLLM SystemsFinance

Sep — Dec 2025

CMU Heinz XR Lab · Machine Learning Engineer (Capstone)

LLM + knowledge-graph recommendation engine for VR coursework, serving 200+ students. Dual-stage matching over a 4,700-node curriculum KG; implicit-feedback loop lifted offline Recall@5 by 30% (NDCG@10 0.82).

LLM SystemsKnowledge GraphsRetrieval

May 2025 — Feb 2026

XY Investments · AI Implementation Engineer

Three subsystems of an LLM research platform: hybrid retrieval (BGE + BM25 + RRF, Recall@10 62% → 85%); an embedding + LLM intent router for a multi-agent platform (routing accuracy 34% → 91%); multimodal PDF extraction at 95%+ accuracy.

AgentsRAGLLM Systems

Jun — Jul 2024

Century Frontier Asset Management · Quant ML Intern

Pairwise ranking (RankNet-style) for high-frequency cross-sectional selection; VAE over tick-level order-book data for denoised dense features.

MLRanking

Jul — Aug 2023

Global AI · Data Engineering & ML Intern

Heterogeneous-information-network embeddings over a Wikidata-derived company graph; meta-path constrained random walks; query-expansion layer for search Recall@10.

Graph LearningSearch

logs/ — build notes, no thought leadership

2026-06-27

Engram: writing user beliefs into model weights

Build note on routing personal memory between RAG and model-weight edits, then proving attribution with retrieval disabled.

takeaway: memory systems need provenance, not just recall.

2026-05-28

Training a Safe-Routes skill: notes from the loop

Build note on a function + agent road-trip planner where agents verify the world but never mutate the route.

takeaway: keep agents as sensors; keep route decisions in deterministic code.

state/ — who I am

I'm Jerry — an AI engineer in Berkeley, CA. I like owning the whole stack: data pipelines, training loops, solvers, agents, and the product around them.

In April 2026 I founded PiPlan.ai to build a planner that behaves like an engineer: model the goal as a graph, optimize the schedule, and re-plan automatically when reality drifts. Building a company end-to-end confirmed what I enjoy most — deep technical work close to the model and the system.

Before that: Carnegie Mellon (M.S., Dec 2025), Emory (B.S. Quantitative Sciences), and ML work across retrieval, ranking, and graph learning.

Education

2024 — 2025Carnegie Mellon University · M.S. Information Systems Management

2021 — 2024Emory University · B.S. Quantitative Sciences: Informatics

Stack

Agents & LLM planner–executor runtimes · tool use · model editing · RAG & hybrid retrieval · vLLM

ML PyTorch · GNNs (PyG) · ranking · imitation learning · bandits

Systems FastAPI · SQLAlchemy/Postgres · Redis · Docker/K8s · OR-Tools

Frontend React 18 · TypeScript · React Flow

GitHub: github.com/xesws · LinkedIn: linkedin.com/in/tangyi-qian-jerry · Email: jq@piplan.ai