Joseph Thomas

Joseph Thomas

Software Engineer  ·  Data Scientist  ·  AI/ML Researcher

Saratoga, CA 95070 (607) 216-7730 Google Scholar GitHub LinkedIn
Software Engineer — Google Cloud Oct 2019 – Present
Google  ·  Mountain View, CA
  • Led backend storage & data quality systems within CrowdCompute, a human-computation platform for ML and data-integrity tasks.
  • Provided technical leadership to the Crowd Data Platform team, driving AI data-generation tools for GenAI / LLM use cases.
  • Improved GenAI model training data quality through quantitative pilot studies and standardised best practices.
  • Created a new metrics dashboard that accelerated data-issue identification across the full collection lifecycle.
  • Received 21 awards including 5 Spot Bonuses.
Software Development Engineer — Big Data Technologies Feb 2017 – Sep 2019
Amazon  ·  Palo Alto, CA
  • Architected and launched DataCraft, a centralised ingestion platform processing > 10 billion events daily.
  • Built resilient, fault-tolerant pipelines using Kinesis, Lambda, and S3, ensuring data integrity into the data lake.
Data Scientist Feb 2015 – Jan 2017
Datanyze  ·  San Mateo, CA
  • Owned strategy & roadmap for a CRM-integrated product (Salesforce) that analysed clients' most successful customers.
  • Spidered millions of websites daily, identifying technology stacks as "technographic" predictive lead-scoring signals.
  • Beta tests demonstrated measurable increases in qualified opportunities and average deal sizes.
Energy Analytics Software Developer Jun 2013 – Dec 2014
Ascend Analytics  ·  Oakland, CA
  • Migrated the core energy analytics codebase from SAS to WPS, saving an estimated $250,000 in annual licensing costs and improving platform stability.
Senior Analyst — R&D Jun 2011 – Jul 2012
Global Analytics  ·  Chennai, India
  • Led a 4-member team applying social-media signals to online-lending risk modelling, generating $1.2M in additional revenue.
  • Designed key modules in the Automated Modeling Platform (R, Python, MySQL).
Associate — R&D Jan 2010 – Jun 2011
Idea Research and Development  ·  Pune, India
  • Developed SCION, an evolutionary computational algorithm for maximising supersonic missile intake performance (Matlab, Python, Qt).
CS
Master of Engineering, Computer Science
Cornell University  ·  New York, USA
Machine Learning · AI · NLP · Algorithm Design · Databases
2012 – 2013 GPA 3.5 / 4.0
AE
MSc (Eng), Aerospace Engineering
Indian Institute of Science  ·  Bangalore, India
Thesis: Odor Source Localization using Swarm Robotics
2006 – 2008 GPA 7.0 / 8.0
EC
BTech, Electronics & Communication Engineering
Government Engineering College  ·  Trichur, India
Thesis: Blind Source Separation using Independent Component Analysis
2002 – 2006
View Google Scholar Profile
Conference Paper 12 citations
Strategies for Locating Multiple Odor Sources using Glowworm Swarm Optimization ↗
J. Thomas, D. Ghose  ·  IICAI, pp. 842–861  ·  2009
Patent 11 citations
Detection of Nuclear Spills Using Swarm Optimization Algorithms ↗
D. Ghose, J. Thomas, K.N. Krishnanand  ·  US Patent 8,838,271  ·  2014
Thesis 3 citations
Odor Source Localization using Swarm Robotics ↗
J. Thomas  ·  Master's thesis, Indian Institute of Science, Bangalore  ·  2008
Book Chapter 1 citation
A GSO-Based Swarm Algorithm for Odor Source Localization in Turbulent Environments ↗
J. Thomas, D. Ghose  ·  Handbook of Approximation Algorithms and Metaheuristics, 2nd Ed., pp. 711–737 (CRC Press, 2018)
Invited Talk
Industry Talk: Data Science Road
University of New Brunswick  ·  March 18, 2022
Invited Talk
Odor Source Localization using Swarm Robotics
IDSIA (Istituto Dalle Molle di Studi sull'Intelligenza Artificiale), Switzerland  ·  Nov 20, 2009