Leon Liu

AI Researcher & Developer • Prev MLE @ Jingo

📗 CS at UCLA building ML systems.

🏢 Shipped recommendation infrastructure at Jingo (AI startup) serving 5M products with RAG and vector search.

🔬 Fine-tuning diffusion models on GPU clusters for autonomous driving research at UCLA Zhou Lab (ICLR 2026 under review).

🌱 Side projects:

🐍 RL agents for Atari Surround
⌨️ Transformer EMG→QWERTY decoder for keyboardless typing from biosignals
📝 SaaS resume builder accessible editor → LaTeX/ATS-friendly PDF
... and more!

Experience

Machine Learning Engineer Intern

Jingo (AI-Native E-commerce Startup) • Machine Learning Engineer Intern

May 2025 — Sep 2025

Architected and built the core LLM-powered product search + recommendation service infra with FastAPI, Redis, and PostgreSQL (SQLAlchemy ORM), integrating RAG for user memory, vector search across a 5M+ product catalog
Cut search initial p95 latency by ~50% by optimizing async orchestration with FastAPI workers and Redis caching, enabling real-time infinite scroll
Productionized contextual rec engine combining graph-based user memory with context signals (geolocation, weather, Pinterest profile), plus an LLM-based query preprocessor that improved product recall and rec relevance in A/B tests
Shipped web/deep research feature with look-aside cache using Perplexity Sonar API powering review/spec product comparisons across products while minimizing latency and amortizing API cost
Designed an LLM-driven ETL pipeline that transformed messy product data into enriched embeddings and structured metadata, boosting product recall and recommendation relevance

Machine Learning

Backend Engineering

LLM

Vector Search

FastAPI

No preview available

🚧No full page available yet!🚧

Undergraduate Researcher

UCLA Bolei Zhou Lab • Undergraduate Researcher

Jan 2025 — May 2025

Engineered diffusion-based simulator-to-realistic scene generation models grounded in driving simulator physics
Trained Flux and SD3 models across 8 L40S GPUs using PyTorch w/ DeepSpeed and SLURM, improving training throughput (2x batch sizes)
Trained adapter layers for simulator-conditioned diffusion models to improve realism in driving scene generation (ICLR 2026 under review)

Gen AI

Diffusion

Simulation

PyTorch

DeepSpeed

No preview available

🚧No full page available yet!🚧

Software Engineer Intern

UCLA Bionics Lab • Software Engineer Intern

Jun 2024 — Oct 2024

Trained LSTM achieving >90% accuracy for automating stroke rehabilitation assessments
Developed data pipelines integrating MediaPipe-based 3D triangulation, doubling clinical data processing throughput

Machine Learning

Computer Vision

Healthcare

PyTorch

No preview available

🚧No full page available yet!🚧

Projects

Prolio — Resume Versioning App

Sep 2024 — Jan 2025

Designed and deployed a full-stack SaaS resume platform with modular editing, multiple versions, Gemini LLM-powered reviews, and PWA client (Next.js, React, Tailwind, shadcn)
Engineered microservice backend (Flask, PostgreSQL, Redis, Celery, GCP) with async LaTeX/PDF compilation and secure OAuth authentication
Productionized infra with Docker, GitHub Actions, and GCP Cloud Build, enabling CI/CD and scaling

Next.js

React

Flask

PostgreSQL

Redis

GCP

Docker

OAuth

Source

Full Stack

SaaS

CI/CD

LaTeX

No preview available

🚧No full page available yet!🚧

emg2qwerty — sEMG-to-QWERTY AI Text Decoding

Jun 2024 — Sep 2024

Trained Transformer sequence-to-sequence models to decode wrist-worn EMG signals into typed QWERTY text, enabling hands-free typing from biosignals
Outperformed Meta's published ConvNet baseline by ~10% lower character error rate through Transformer and RNN fine-tuning
Developed full training pipeline with preprocessing (log spectrograms, augmentation), CTCLoss optimization, and benchmark evaluation across GRU, LSTM, and Transformer architectures
Increased training throughput by 2.5x via spectrogram down-sampling (125Hz→50Hz) with negligible accuracy loss, enabling faster experimentation

PyTorch

PyTorch Lightning

Transformers

Python

Source

Machine Learning

Signal Processing

Transformers

PyTorch

No preview available

🚧No full page available yet!🚧

Lingsim — AI-Powered Language Learning App

Jan 2024 — Mar 2024

Built multilingual AI tutoring full-stack app integrating speech recognition (Whisper) + GPT-4 dialogue + ElevenLabs TTS to enable real-time conversational practice
Demonstrated sub-second (~500 ms) speech→text→response→speech loop across ~10 languages, showcasing feasibility of real-time interactive AI tutors
Designed end-to-end system with Next.js, React, Firebase, and REST APIs to support secure authentication and optimized low-latency chat

Next.js

React

Firebase

TypeScript

GPT-4

Whisper

Demo

Full Stack

AI/ML

EdTech

Chat

Speech Recognition

TTS

No preview available

🚧No full page available yet!🚧

Reliable Transport Layer (TCP over UDP)

Nov 2024 — Dec 2024

Built a custom transport protocol in C providing reliable, ordered delivery on top of UDP by adding packet retransmission, acknowledgments, and error detection
Validated correctness with stress tests simulating dropped/out-of-order packets, demonstrating consistent end-to-end data integrity

Networking

Systems Programming

No preview available

🚧No full page available yet!🚧

Basilisk — Multi-Agent Social Deduction Simulator

Aug 2024 — Oct 2024

Developed framework to simulate and manage social deduction games with both human players and LLM-based agents
Integrated LangChain and Ollama/Gemini APIs for multi-agent orchestration, dialogue, and strategy execution
Implemented structured input/response validation with Pydantic, ensuring reliable agent interactions and reproducibility
Built orchestration layer for game-state tracking, rule enforcement, and cross-agent coordination in Python

Python

LangChain

Pydantic

Gemini

Ollama

Source

LLM

Multi-Agent Systems

Game AI

No preview available

🚧No full page available yet!🚧

Finduo — Full-Stack Gaming Social Media Web App

Jan 2023 — Mar 2023

Designed and implemented full-stack social media platform for users to find gaming partners
Implemented user profiles, dynamic feed, search capabilities, and viewing friend invites with React
Implemented RESTful API for auth, user profiles, ratings, and friends list in the backend with MongoDB, Express, Node.js

MongoDB

Express

React

Node.js

TypeScript

Source

Full Stack

Social Media

Gaming

No preview available

🚧No full page available yet!🚧

Dynamic Analysis of Cryptographic API Misuse in Node.js

Jun 2023 — Aug 2023

Paper accepted at IEEE MIT URTC
Implemented multipart system to perform dynamic analysis on Node.js codebases to detect vulnerabilities from API misuse
Implemented custom module loader that instrumented code to log API usage
Developed rule enforcer to detect API misuse defined by a modular set of declarative rules

Node.js

TypeScript

Bash

Demo Source

Security

Dynamic Analysis

Research

No preview available

🚧No full page available yet!🚧

Unity Real-Time Strategy PVP Game

Sep 2023 — Dec 2023

Engineered robust game systems and mechanics, leveraging data structures to optimize performance
Architected and implemented modular scripts and game systems with Object-Oriented Programming
Utilized Unity Profiler for game performance tuning and software testing
Enhanced user experience by identifying and resolving performance bottlenecks

Unity

Demo Source

Game Development

Real-time Systems

Object-Oriented Programming

No preview available

🚧No full page available yet!🚧

Brewin Interpreter

Jan 2024 — Mar 2024

Implemented an interpreter in Python for a custom language "Brewin"
Designed interpreter for several variations of the language with common features
Handled function calls, recursion, scoping, variables, control flow (conditionals and loops)
Built systems for dynamic and statically-typed variations, structs, error handling, lazy evaluation

Python

Source

Programming Languages

Interpreters

Compilers

No preview available

🚧No full page available yet!🚧

Reverse Proxy for ACM Cyber

Oct 2023 — Dec 2023

Implemented a reverse proxy with HTTP request parsing, authentication (bcrypt), and structured logging to securely route and monitor backend traffic
Designed and tested routing, auth, and logging subsystems to improve reliability and observability in multi-service environments

TypeScript

Deno

Networking

Backend

Security

No preview available

🚧No full page available yet!🚧

Atari Reinforcement Learning

Aug 2023 — Nov 2023

Implemented policy gradient methods and deep Q learning for a reinforcement learning agent in Atari
Trained agent to solve and beat the Atari Surround environment
Spearheaded and taught a quarter-long workshop for ACM AI for 60+ members

Python

PyTorch

PyTorch Lightning

Source

Reinforcement Learning

PyTorch

Python

No preview available

🚧No full page available yet!🚧