# Rayaan Anil Khatau > Software Engineer at Formation Bio Location: Queens County, New York, United States Profile: https://flows.cv/rayaan I'm interested in all the ways technology can capture, quantify and copy humans behavior. I build features that keep Respondent's research marketplace going and occasionally ones that bring it down :D ## Work Experience ### Software Engineer @ Formation Bio Jan 2025 – Present ### Software Engineer @ Benchling Jan 2022 – Jan 2025 | Brooklyn, New York, United States At Benchling, I played a key role in enhancing our lab automation tools, empowering R&D teams in biology to manage and analyze experimental data at scale. My focus was on designing and optimizing systems for the ingestion, transformation, and processing of complex lab instrument data, enabling more meaningful data capture to support experimentation, collaboration, and compliance. Key Responsibilities: - Developed and maintained tools for bulk ingestion and transformation of tabular data from various lab instruments, ensuring seamless integration with Benchling's laboratory systems of record. - Implemented and supported data pipelines to import and normalize data from customer-hosted sources, including laboratory computers and cloud-based systems. - Scrum leader responsible for sprint rituals including standups, backlog grooming, and sprint planning. Responsible for triaging internal and customer escalations, and managed alignment product roadmap and daily team execution Technologies & Tools: Python, JavaScript, AWS IoT Greengrass, S3, SQS, DynamoDB, Postgres, SQLAlchemy ### Software Engineer @ Respondent Inc Jan 2017 – Jan 2022 | Brooklyn, New York • Maintain and expand Respondent's back-end databases - MongoDB and Neo4J - of platform users • Responsible for the matching algorithm infrastructure that delivers highly targeted paid research studies to our users • Enriching data on existing user profiles by leveraging their online social data, activity on the platform, public data for eg. information on employers etc, as well as shared patterns/behaviors with other users. • Infrastructure for data collection and capturing of key marketplace/performance metrics for ~500 live projects, feeding charts/graphs to showcase time series data ### Software Engineer @ Thread Genius Jan 2017 – Jan 2017 | Greater New York City Area • Designed and built failure-safe end-to-end service to log and monitor API usage, and restrict customer access by pricing tier • Integrated Redis, Redshift and S3 to create a scalable, low-latency solution for capturing and querying usage statistics • Created and sanitized image databases from different sources - Shopstyle, Bloglovin, Instagram, • Developed robust API layer for Thread Genius’s service that detects bounding boxes in clothing images from URLs and an in-app camera ### Workshop Lead @ Columbia University Emerging Scholars Program Jan 2017 – Jan 2017 | Greater New York City Area • Teaching two weekly seminars covering introductions to specialized topics in Computer Science such as Encryption, Biometrics, Network Theory to freshmen with little or no experience • Designing curriculum as well as coursework and presentation materials on specific problems within these disciplines ### Data Science Intern @ Envestnet | Yodlee Jan 2016 – Jan 2016 | Redwood Shores, CA • Developed project to enrich financial transaction records with phone numbers string extracted from the unstructured text descriptions of hundreds of millions of credit card and bank transactions • Built classifier that uses natural language processing, pattern matching, and learning from crowd behavior to recognize a phone number as belonging to either a business or an individual account holder (with 99.07% precision) • Researched the applications of Robust Principal Component Analysis (RPCA) to the detection of outliers in Yodlee’s financial data • Researched alternate time series representations of financial data such as SAX (Symbolic Agreement Approximation) to speed up data processing and motif discovery ### Software Development Intern - Machine Learning @ Yodlee Jan 2015 – Jan 2015 | Redwood Shores, CA • Developed multiple models for hyperparameter optimization for CNN classification of unstructured text– customization of stochastic gradient descent, implementation of basinhopping algorithms from scikit-learn • Built Jenkins Continuous Integration server and wrote battery of tests to maintain quality of the build – server executed unittests, generated code coverage metrics, and reported violations in code committed to project repository ### Software Development Intern @ Parity Cube Pvt. Ltd. Jan 2014 – Jan 2014 | Mumbai Area, India • Worked closely with senior developer and CTO to revamp automated process of placing native advertisements in text segments of online editorial content such as blogs, forums, articles • Built system for linguistic genre classification for online content, implementing Natural Language ToolKit (NLTK) ## Education ### Bachelor of Arts (B.A.) in Computer Science Columbia University Jan 2013 – Jan 2017 ### ISC - Indian School Certificate Cathedral and John Connon School Jan 2011 – Jan 2013 ### ICSE - Indian Certificate of Secondary Education in Science Stream Cathedral and John Connon School Jan 2009 – Jan 2011 ## Contact & Social - LinkedIn: https://linkedin.com/in/rayaan-anil-khatau-01a189b3 --- Source: https://flows.cv/rayaan JSON Resume: https://flows.cv/rayaan/resume.json Last updated: 2026-04-01