Software Engineer — Google Cloud
Oct 2019 – Present
Google · Mountain View, CA
- Engineered synthetic data generation pipelines utilising Gemini, producing high-fidelity instruction-tuning pairs to improve foundational model performance.
- Architected a low-latency LLMOps platform (DQaaS) using gRPC/Protobufs, enabling enterprise-scale prompt versioning, testing, and retrieval for GenAI and RAG workflows.
- Led backend storage and data integrity for CrowdCompute, Google's massive-scale data engine critical for generating RLHF and high-quality SFT datasets for foundation models.
- Provided technical leadership to the Crowd Data Platform team, driving the evolution of AI data-generation tools for GenAI/LLM use cases.
- Worked cross-functionally with various teams to streamline the collection of high-quality data.
- Received 21 awards including 5 Spot Bonuses