Zanwen Fu
Summer 2026 · Machine Learning Engineer Intern @ Robinhood

Software that thinks, ships,and scales.

Engineer & founder building production-grade autonomous agents, grounded in strong software engineering to solve real-world problems.

I build agentic AI systems that actually work in the real world — reliable, observable, and designed to survive production, not just demos.

About

From NUS to Duke — shipping production systems at every stop.

I'm a Software Engineer and MS Computer Science (AI/ML) student at Duke University, focused on building production-grade autonomous systems — from multi-agent orchestration and LLM tooling to the distributed backends that run them reliably at scale.

I'm the sole founder and engineer of VYNN AI, an agentic financial analyst platform built end-to-end and deployed to ~500 pilot users.

Previously, I designed core components of AutoCodeRover, an autonomous code repair system acquired by Sonar, integrating agentic reasoning directly into JetBrains IDEs. In parallel, I’ve led research on multi-agent LLM frameworks for large-scale medical text mining, with work currently under review at NEJM AI.

This summer, I’ll be joining Robinhood in Menlo Park as a Machine Learning Engineer on the Agentic AI team, continuing my focus on building autonomous systems that operate reliably at real-world scale.

What drives me

Systems that are reliable, observable, and production-ready — not just demo-ready. I care deeply about turning ideas into robust software that solves real problems and serves real users.

Duke University

Duke University

M.S. Computer Science (AI/ML)

2025 – 2027 · GPA: 3.77

National University of Singapore

National University of Singapore

B.Comp. in Computer Science

2021 – 2025 · Distinction

Distinction in Software Engineering

View verified credential

Exchange Semester at HKU (Fall 2023)

Experience

Startups, big tech, research, and teaching.

Robinhood logo

Machine Learning Engineer (Agentic)

UPCOMING
Robinhood·Agentic ML Team
May 2026 – Aug 2026

Building ML-powered agentic systems for Robinhood's financial products this summer.

VYNN AI logo

Founder & Software Engineer (Agentic)

Jul 2025 – Dec 2025

Built and deployed an end-to-end agentic financial analysis platform as sole engineer, serving ~500 pilot users. Multi-agent orchestration, real-time data pipelines, and autonomous LLM-driven analysis — all in production.

Duke University logo

Graduate Teaching Assistant

CURRENT
Aug 2025 – Present

Teaching 3 CS courses — covering software architecture, DevOps, AI agents, and LLM-oriented programming. Mentoring student teams shipping production software for real clients.

AutoCodeRover (now Sonar) logo

Research Software Engineer

Aug 2024 – May 2025

Architected the JetBrains IDE plugin for autonomous code repair with AST-level patch alignment via GumTree. Enhanced the agentic repair algorithm with LLM-as-a-Judge self-improvement and continuous user feedback loops. Achieved 46% on SWE-bench Verified. Core technology acquired by Sonar.

Binance logo

Software Engineer

Binance·Web3 Wallet Team
Jul 2025 – Oct 2025

End-to-end test automation for Binance's Boosters campaign across ~500K simulated transactions. Achieved 85% automation coverage and cut cross-team bug resolution time by 40%.

NUS Undergraduate Research logo

AI Researcher

Jan 2024 – Jul 2025

Primary contributor to manuscript under review at NEJM AI. Built a multi-agent LLM framework for citation screening across ~150K abstracts, achieving 99.5% sensitivity.

Earlier Experience

Full-Stack Software EngineerST Engineering
May 2023 – Aug 2023
Quantum Software EngineerCentre for Quantum Technologies (Singapore)
May 2024 – Dec 2024
Web DeveloperNUS Computing
Feb 2024 – Nov 2024
Data EngineerNUS Research Institute
Mar 2024 – Oct 2024

Selected Work

FOUNDER · SOLE ENGINEER~500 PILOT USERSFULL-STACK · PRODUCTION

VYNN AI

A full-stack financial intelligence platform combining conversational AI analysis, real-time market data streaming, portfolio management, and automated reporting — built end-to-end as sole engineer and deployed to ~500 pilot users in production.

System Architecture

React / TypeScript + ViteAI Chat (SSE) · Market Dashboard · Portfolio Mgmt · News FeedCaddyFastAPI BackendAuth (OAuth+Email)Chat / SSE / JobsWS (News + Prices)Daily ReportsPortfolio (CRUD)LangGraph OrchestratorData AgentDCF AgentNews AgentReport Gen(HTML/PDF/XLSX)RedisQueues+CacheMongoDBDocumentsDockerContainerizedNginxSPA ServeWSS

< 7 min

Full equity analysis — data scraping, DCF modeling, news intel, and PDF report generation

72%

Latency reduction via parallel agent execution and result caching

Real-Time

Dual WebSocket streams for live prices and news with auto-reconnect and health checks

~500

Pilot users on production Hetzner Cloud infrastructure with zero-downtime deployments

🤖

AI Chat Engine

SSE streaming with log batching, multi-conversation management, downloadable XLSX + PDF reports

📊

Market Dashboard

Live prices, stock charts, news aggregation

💼

Portfolio Mgmt

Multi-portfolio, real-time P&L, holdings CRUD

📈

Analytics

6 interactive chart types (area, bar, pie, radar, scatter, treemap), one-click PNG export

📄

Daily Reports

Company, sector, and global market reports with batch generation and smart polling

🔐

Auth & Security

OAuth, passwordless login, HTTP-only cookies, cross-tab sync, user-scoped storage

LangGraphMCPFastAPIRedisMongoDBReactTypeScriptViteTailwind CSSshadcn/uiRechartsDockerNginxCaddySSEWebSocketOAuthPython
SONAR · SOFTWARE ENGINEERACQUIRED BY SONARISSTA 2024 + arXiv

AutoCodeRover — IDE Plugin + Repair Agent

Brought autonomous code repair from research to a production developer tool. AutoCodeRover is a multi-agent system that resolves real GitHub issues end-to-end — reproducing bugs, searching codebases across 7 languages via tree-sitter, generating patches with iterative refinement, and self-correcting through an LLM-as-a-Judge reviewer. I built the JetBrains IDE plugin end-to-end in Kotlin: a conversational agent UI with real-time SSE streaming, GumTree-based three-way AST merge for conflict-free patch application, embedded SonarLint static analysis, and a feedback loop where developers can critique any reasoning step to trigger guided re-runs. On the backend, I designed the self-fix agent that diagnoses inapplicable patches and autonomously replays the pipeline from the most suspicious stage — lifting SWE-bench Verified to 46%. The core technology was acquired by Sonar.

Repair Pipeline Architecture

JetBrains IDE (IntelliJ / PyCharm)PSI Traversal · Build/Test Listeners · Git4Idea · Editor APIACR Plugin (Kotlin)Chat UI + SSEReal-time streaming with typewriter animationSonarLint EngineEmbedded Java/Python static analysisGumTree 3-Way MergeBaseline → Modified → Patched AST alignmentContext EnrichmentPSI refs + cursor history + open filesUser Feedback per Reasoning Step→ Guided Re-runcritique any agent stepAutoCodeRover Backend (Python · Docker)Meta-Agent OrchestratorHardcoded or LLM-drivenReproducer AgentContext Retrieval7-language tree-sitterPatching AgentReviewer AgentSelection AgentBest-of-N + regressionSelf-Fix AgentLLM-as-a-JudgeSWE-bench 46% VerifiedPatched Code + EvidenceSpecs + Reproducer + ReviewREST API (OkHttp)SSE Stream (real-time logs)replay loopFeedbackContext / FeedbackPatch

46%

SWE-bench Verified

State-of-the-art across 2,294 real GitHub issues — highest among open-source agents

50%

Patch Precision

1.8× higher than next best (Agentless at 27%) — reviewer agent reduces noise for developers

3-Way

AST Merge (GumTree)

Conflict-free patch application when local code has diverged from agent's baseline

7

Languages Supported

Tree-sitter search across Python, Java, JS, TS, C/C++, Go, PHP

🐛

Autonomous Repair

Describe a bug → ACR localizes, patches, and validates autonomously

🔍

SonarLint Integration

Embedded static analysis for Java/Python with one-click ACR fixes

🌳

3-Way AST Merge

GumTree conflict resolution across baseline/modified/patched

💬

Interactive Feedback

Critique any agent reasoning step — triggers guided pipeline re-run

🔄

Self-Fix Agent

LLM-as-a-Judge diagnoses inapplicable patches and replays from failure point

🔨

Build/Test Capture

Auto-captures IDE build and test failures with one-click ACR submission

KotlinPythonJetBrains PSIGumTreetree-sitterSonarLintOkHttpSSEREST APIsJGitDockerLLM-as-a-JudgeClaude 3.5 SonnetGPT-4o
FIRST AUTHOR · AI RESEARCH

LUMINA — Multi-Agent Citation Screening for Systematic Reviews

Designed and built LUMINA, a four-agent framework that automates citation screening for medical systematic reviews and meta-analyses. A classifier agent triages citations, a detailed screening agent applies PICOS-guided Chain-of-Thought evaluation, a reviewer agent audits each decision via LLM-as-a-Judge, and an improvement agent self-corrects when disagreements arise — mirroring the human peer-review process. Evaluated on 15 SRMAs across ~90K citations from BMJ, JAMA, and Lancet journals: achieved 98.2% mean sensitivity (10 of 15 at perfect 100%) with a 1.8% false negative rate, dramatically outperforming published baselines by Li et al. (37% sensitivity) and Strachan (58%).

Multi-AgentChain-of-ThoughtPICOSLLM-as-a-JudgeGPT-4o-miniGPT-o3-miniPython

98.2%

Sensitivity

10 of 15 reviews at perfect 100% — near-zero missed studies

1.8%

False Negative Rate

vs. 63% (Li et al.) and 42% (Strachan) — 35× reduction

15

Systematic Reviews

~90K citations from BMJ, JAMA, Lancet — $0.07 per 10 articles

Teaching

Graduate TA at Duke — 3 CS courses.

Teaching software engineering, DevOps, and agentic AI systems to undergraduate and graduate students at Duke.