Leon Liu
📗 CS at UCLA building ML systems.
🏢 Shipped recommendation infrastructure at Jingo (AI startup) serving 5M products with RAG and vector search.
🔬 Fine-tuning diffusion models on GPU clusters for autonomous driving research at UCLA Zhou Lab (ICLR 2026 under review).
🌱 Side projects:
- 🐍 RL agents for Atari Surround
- ⌨️ Transformer EMG→QWERTY decoder for keyboardless typing from biosignals
- 📝 SaaS resume builder accessible editor → LaTeX/ATS-friendly PDF
- ... and more!
Experience
Machine Learning Engineer Intern
Jingo (AI-Native E-commerce Startup) • Machine Learning Engineer Intern
- Architected and built the core LLM-powered product search + recommendation service infra with FastAPI, Redis, and PostgreSQL (SQLAlchemy ORM), integrating RAG for user memory, vector search across a 5M+ product catalog
- Cut search initial p95 latency by ~50% by optimizing async orchestration with FastAPI workers and Redis caching, enabling real-time infinite scroll
- Productionized contextual rec engine combining graph-based user memory with context signals (geolocation, weather, Pinterest profile), plus an LLM-based query preprocessor that improved product recall and rec relevance in A/B tests
- Shipped web/deep research feature with look-aside cache using Perplexity Sonar API powering review/spec product comparisons across products while minimizing latency and amortizing API cost
- Designed an LLM-driven ETL pipeline that transformed messy product data into enriched embeddings and structured metadata, boosting product recall and recommendation relevance
No preview available
🚧No full page available yet!🚧
Undergraduate Researcher
UCLA Bolei Zhou Lab • Undergraduate Researcher
- Engineered diffusion-based simulator-to-realistic scene generation models grounded in driving simulator physics
- Trained Flux and SD3 models across 8 L40S GPUs using PyTorch w/ DeepSpeed and SLURM, improving training throughput (2x batch sizes)
- Trained adapter layers for simulator-conditioned diffusion models to improve realism in driving scene generation (ICLR 2026 under review)
No preview available
🚧No full page available yet!🚧
Software Engineer Intern
UCLA Bionics Lab • Software Engineer Intern
- Trained LSTM achieving >90% accuracy for automating stroke rehabilitation assessments
- Developed data pipelines integrating MediaPipe-based 3D triangulation, doubling clinical data processing throughput
No preview available
🚧No full page available yet!🚧
Projects
Prolio — Resume Versioning App
- Designed and deployed a full-stack SaaS resume platform with modular editing, multiple versions, Gemini LLM-powered reviews, and PWA client (Next.js, React, Tailwind, shadcn)
- Engineered microservice backend (Flask, PostgreSQL, Redis, Celery, GCP) with async LaTeX/PDF compilation and secure OAuth authentication
- Productionized infra with Docker, GitHub Actions, and GCP Cloud Build, enabling CI/CD and scaling
No preview available
🚧No full page available yet!🚧
emg2qwerty — sEMG-to-QWERTY AI Text Decoding
- Trained Transformer sequence-to-sequence models to decode wrist-worn EMG signals into typed QWERTY text, enabling hands-free typing from biosignals
- Outperformed Meta's published ConvNet baseline by ~10% lower character error rate through Transformer and RNN fine-tuning
- Developed full training pipeline with preprocessing (log spectrograms, augmentation), CTCLoss optimization, and benchmark evaluation across GRU, LSTM, and Transformer architectures
- Increased training throughput by 2.5x via spectrogram down-sampling (125Hz→50Hz) with negligible accuracy loss, enabling faster experimentation
No preview available
🚧No full page available yet!🚧
Lingsim — AI-Powered Language Learning App
- Built multilingual AI tutoring full-stack app integrating speech recognition (Whisper) + GPT-4 dialogue + ElevenLabs TTS to enable real-time conversational practice
- Demonstrated sub-second (~500 ms) speech→text→response→speech loop across ~10 languages, showcasing feasibility of real-time interactive AI tutors
- Designed end-to-end system with Next.js, React, Firebase, and REST APIs to support secure authentication and optimized low-latency chat
No preview available
🚧No full page available yet!🚧
Reliable Transport Layer (TCP over UDP)
- Built a custom transport protocol in C providing reliable, ordered delivery on top of UDP by adding packet retransmission, acknowledgments, and error detection
- Validated correctness with stress tests simulating dropped/out-of-order packets, demonstrating consistent end-to-end data integrity
No preview available
🚧No full page available yet!🚧
Basilisk — Multi-Agent Social Deduction Simulator
- Developed framework to simulate and manage social deduction games with both human players and LLM-based agents
- Integrated LangChain and Ollama/Gemini APIs for multi-agent orchestration, dialogue, and strategy execution
- Implemented structured input/response validation with Pydantic, ensuring reliable agent interactions and reproducibility
- Built orchestration layer for game-state tracking, rule enforcement, and cross-agent coordination in Python
No preview available
🚧No full page available yet!🚧
Finduo — Full-Stack Gaming Social Media Web App
- Designed and implemented full-stack social media platform for users to find gaming partners
- Implemented user profiles, dynamic feed, search capabilities, and viewing friend invites with React
- Implemented RESTful API for auth, user profiles, ratings, and friends list in the backend with MongoDB, Express, Node.js
No preview available
🚧No full page available yet!🚧
Dynamic Analysis of Cryptographic API Misuse in Node.js
- Paper accepted at IEEE MIT URTC
- Implemented multipart system to perform dynamic analysis on Node.js codebases to detect vulnerabilities from API misuse
- Implemented custom module loader that instrumented code to log API usage
- Developed rule enforcer to detect API misuse defined by a modular set of declarative rules
No preview available
🚧No full page available yet!🚧
Unity Real-Time Strategy PVP Game
- Engineered robust game systems and mechanics, leveraging data structures to optimize performance
- Architected and implemented modular scripts and game systems with Object-Oriented Programming
- Utilized Unity Profiler for game performance tuning and software testing
- Enhanced user experience by identifying and resolving performance bottlenecks
No preview available
🚧No full page available yet!🚧
Brewin Interpreter
- Implemented an interpreter in Python for a custom language "Brewin"
- Designed interpreter for several variations of the language with common features
- Handled function calls, recursion, scoping, variables, control flow (conditionals and loops)
- Built systems for dynamic and statically-typed variations, structs, error handling, lazy evaluation
No preview available
🚧No full page available yet!🚧
Reverse Proxy for ACM Cyber
- Implemented a reverse proxy with HTTP request parsing, authentication (bcrypt), and structured logging to securely route and monitor backend traffic
- Designed and tested routing, auth, and logging subsystems to improve reliability and observability in multi-service environments
No preview available
🚧No full page available yet!🚧
Atari Reinforcement Learning
- Implemented policy gradient methods and deep Q learning for a reinforcement learning agent in Atari
- Trained agent to solve and beat the Atari Surround environment
- Spearheaded and taught a quarter-long workshop for ACM AI for 60+ members
No preview available
🚧No full page available yet!🚧