# Joseph Caltabiano > Software Engineer | React, TypeScript, Node, AI/ML | 4 YOE building data-driven web apps Location: San Francisco, California, United States Profile: https://flows.cv/josephcaltabiano Versatile software engineer with expertise in data science, AI, and full-stack web development. Specializing in creating data-driven applications and visualizations, with a focus on genomic research, environmental conservation, and scientific data analysis. Technologies and expertise: React, AWS (S3, Athena, Glue, CloudFormation), R&D web development stack, machine learning, prompt-engineering, generative AI, large language models, Python, SQL, Docker, Terraform, GitHub Actions, natural language processing, multi-agent architecture, prompt engineering, OpenAI API, Vercel, Svelte, Google Earth Engine, Streamlit, Material UI, Plotly, D3.js, Gosling, Boto3, Pydantic, CRISPResso2 ## Work Experience ### Software Engineer @ Earth Genome Jan 2024 – Jan 2024 Overview: • Designed, developed and deployed natural language-based applications to make Conservation International’s research more accessible and engaging for policymakers, journalists, and researchers. Made key recommendations to advance Conservation International’s digital strategy, including an actionable roadmap for creating production-ready AI applications. Key Contributions: • User-focused chat experience to enable querying of over 100 research articles with nuanced, citation-backed insights tailored to various expertise levels. Customizations for varying response characteristics were provided. • A pipeline to generate interactive, natural language-based geospatial visualizations to simplify access to environmental data by non-technical users. Used natural language processing (NLP) and multi-agent architecture to generate Google Earth Engine (geemap) code. • Generative AI prototypes using OpenAI API and open-source LLMs (Ollama/LLaMA), incorporating semantic search, ontology-based retrieval, and retrieval-augmented generation (RAG) to ensure accurate, document-grounded responses. • Prompt engineering and multi-agent architecture for dynamic and accurate outputs. Advanced assistants with features such as file search and code interpreter to improve interactivity and accessibility of conservation data. Research-based conversational interfaces using Svelte and Vercel. Technologies Used: • Large language models, natural language processing, multi-agent architecture, prompt engineering, OpenAI API, Vercel, Svelte, Google Earth Engine ### Software Engineer @ Mammoth Biosciences Jan 2021 – Jan 2023 Overview: • Architected and implemented a novel end-to-end data pipeline and visualization platform to analyze off-target gene editing effects of proprietary Cas proteins. Key Contributions: • ETL workflows on AWS services to process mutations from CRISPResso2 into clean, actionable data. Established comprehensive data infrastructure using AWS + Terraform. • Web-based analytics platform using Streamlit, integrating interactive Plotly visualizations and a customized genome browser (Gosling). Used by scientists to analyze off-target effects and editing efficiency metrics, providing insights on amplicons from genomic scale down to base-pair resolution. • Key support to scientists, including Jupyter workflows and D3 visualizations to track experimental sample metadata. • Automated deployment workflows using GitHub Actions, ensuring reliable, scalable updates to data processing infrastructure and to maintain code quality. Technologies Used: • React/Python/SQL, AWS (S3, Athena, Glue, CloudFormation), Docker, Terraform, GitHub Actions, Streamlit, Material UI, Plotly, D3.js, Gosling, Boto3, Pydantic, CRISPResso2, batch processing, step functions ### Software Engineer Intern @ HCL Technologies Jan 2020 – Jan 2020 | Massachusetts, United States • Designed and created reusable React components for a progressive web app email platform • Collaborated with designers and engineers to design components for improved customer experience • Implemented enhancements, performed product testing and bug fixing ### Systems Design Intern @ Draper Jan 2019 – Jan 2019 | Cambridge, Massachusetts, United States • Designed and created reusable React components for a custom data visualization tool • Solved errors and bugs in the backend ERP and created React components for its web frontend ### Quality Assurance Intern @ Draper Jan 2018 – Jan 2018 | Cambridge, Massachusetts, United States • Created and expanded a streamlined database to store and analyze supplier audit data • Wrote queries commonly used by clients and automated the generation of plots from the results • Built a user interface for the database using Access forms ### Quality Assurance Intern @ Draper Jan 2017 – Jan 2017 | Cambridge, Massachusetts, United States • Created and expanded a streamlined database to store and analyze supplier audit data • Wrote queries commonly used by clients and automated the generation of plots from the results • Built a user interface for the database using Access forms ## Education ### Master of Science - MS in Data Science Worcester Polytechnic Institute ### Bachelor of Science - BS in Computer Science Worcester Polytechnic Institute ## Contact & Social - LinkedIn: https://linkedin.com/in/josephcaltabiano --- Source: https://flows.cv/josephcaltabiano JSON Resume: https://flows.cv/josephcaltabiano/resume.json Last updated: 2026-03-29