Working to put our long-term future on a better track. I've worked as a software engineer, as a researcher, and as the first employee of a startup getting it off the ground. I currently work to make Elicit's (elicit.com) answers more accurate and vettable.

2021 — NowElicitMachine Learning and Evaluation Product Manager

2021 — Now

San Francisco Bay Area

I product manage the team that makes Elicit give accurate and vettable answers to users' research questions

Set the roadmap for ML features and eval. I talk to key customers to learn how we need to improve Elicit's answers, then I translate this into specs for the ML team and eval designs

Started and lead the 2-person eval team at Elicit. This team creates evals for all Elicit features that let an ML engineer run a quick command to learn whether the change they just made was an improvement, and to have justified confidence in that answer

Led our work on data extraction from its inception -- worked very closely with key users to build a minimum viable product that fit their needs, then product-managed extending the MVP. Data extraction is now the core feature of Elicit and we make constant improvements, e.g. https://email.elicit.com/deliveries/dgTd9ggDANJo0WgBjbfL0NY-iZqxBk4O86BA

Designed the tasks and evals for our paper on answering questions about papers using iterated decomposition: https://arxiv.org/pdf/2301.01751

Designed and created the dataset for a high-fidelity approach to evaluate Elicit's search which allowed us to improve precision by around 78% (see https://email.elicit.com/deliveries/dgTd9ggDAKHze6DzewGQc_h8U7zEMbsdsvUZ6pw=)

(When Ought spun off Elicit, I moved along with the entire team from Ought to Elicit. That happened Summer 2023, but my role transitioned earlier so I've gone with the role transition date here)

2020 — 2021OughtTechnical Product Manager

2020 — 2021

San Francisco Bay Area

Ought pivoted to building a forecasting product. I worked as a technical product manager, data scientist, and software engineer on the new product

Led the team that won Metaculus's El Paso COVID forecasting competition: https://pandemic.metaculus.com/questions/4161/el-paso-series-supporting-covid-19-response-planning-in-a-mid-sized-city/

Launched the first prototype of literature review in Elicit, which led us to pivot to literature review

2019 — 2020OughtExperimenter

2019 — 2020

San Francisco Bay Area

Led our research on Factored Cognition/Iterated Distillation and Amplification. This research was Ought's main focus as an organization.

Our main report on this research: https://ought.org/updates/2020-01-11-arguments

Set and carried out our research plan. Created ~weekly loop of running an experiment, analyzing the results, and then running a tweaked experiment

Managed a team of ~15 contractors pretending to be AIs in our experiments (pre-GPT-3!). Created onboarding curriculum to teach this large part-time team to do some really odd work (to trick each other as part of our experiments)

We got negative results from these experiments, so we abandoned them

2018 — 2019Ought1st employee

2018 — 2019

San Francisco Bay Area

Helped get Elicit off the ground. Mostly ops and hiring.

Put fundamental policies and procedures in place (e.g. Gusto for payroll and HR)

Created our first ops handbook

2018 — 2018Sovereign FinanceCo-Founder

2018 — 2018

Explored using satellite imagery for economic development: https://medium.com/alan-do/empowering-emerging-economies-ai-satellite-imagery-and-economic-analysis-311e168465cf

Education

Swarthmore College

B.A. with High Honors

Catalina Foothills High School

Hack Reactor

Experience+

Education

B.A. with High Honors

Software Engineering