Supervisor: Professor. Reihaneh Rabbany
Intern research assistant for the complex data lab.
• Engineered prompts for gpt3.5 that was used to generate a human trafficking NER dataset (HTGen)
• Exploited LLMs to generated synthetic annotations for a cost effective way to get labels on niche domains as well as domains where the data is highly sensitive and thus not perfered to be labeled by human annotators.
• Explored rule base gazetteers and knowledge bases of human names.
• Continual learning to expand existing knowledge bases
• Surveyed the ethics (bias and toxic lanugage) in GPT generated synthetic human trafficking domain data.
Papers:
1. SWEET - Weakly Supervised Person Name Extraction for Fighting Human Trafficking
Conference: EMNLP Findings 23
DOI:10.18653/v1/2023.findings-emnlp.219
2. T-NET: Weakly Supervised Graph Learning for Combatting Human Trafficking
Conferences: AAAI-24: AI for Social Impact (AISI)
DOI: https://doi.org/10.1609/aaai.v38i20.30233
3. SWEET - Poster Presented
ACL2023 Student Research Workshop
Link: https://virtual2023.aclweb.org/paper_S122.html
Volunteer positions:
Student Assembly Note Taker
Montreal AI Symposium Reviewer and volunteer