
I'm a Research Scientist at Meta Superintelligence Labs Trust ( formerly GenAI Trust, formerly Responsible AI) interested in safety alignment and interpretably of LLMs and AI for social good ( healthcare/public policy/general knowledge ) and have had my research published in ACL, CoNLL, ECML, EMNLP, ICLR, ICML, IJCAI and NeurIPS amongst other venues. My research interests broadly revolve around (i) safety alignment and evaluation/mitigation of PII, IP, Biometric, Hallucicnation and Memorization risks of pre and post trained LLMs and (ii) interpreting and steering Machine Learning model decisions on natural language and multi-modal data, and learning representations of entities for downstream tasks. My PhD defended in July 2022 with my advisor Joydeep Ghosh and committee memebers Alex Dimakis, Harris Vikalo, Atlas Wang & Byron Wallace is titled "In-process Diagnostic methods for Entity Representation Learning on Sequential data at Scale" and focuses on methods that allow neural networks to be more transparent, explainable, and diagnosable during the process of learning and inference as opposed to in a post-hoc analysis fashion. My research has dealt with hallucination reduction with contrastive activation steering (NeurIPS 25 Mech Interp Workshop), measuring AI slop in text (under submission), memorization safety evaluation of Llama 3 OSS models, clustering influence-based embeddings for improved error analysis (NeuIRPS 23), open source tooling for explaining LLMs (EMNLP 23), intermediate entity-based sparse interpretable representation learning (EMNLP 22), efficient entity based knowledge injection for VQA (WWW 222),interpretable biomedical text representations (ACL 21), dense entity retrieval using dual encoders (CoNLL 19), prototypical learning of time series data (ICML 19(short) IJCAI 19 (long), efficient entity-based knowledge injection for KBVQA amongst other things. key words: safety alignment, contrastive steering, influence functions, label quality, data valuation, memorization in LLMs, interpretable entity representations, knowledge injection for multimodal VQA, dense retrieval, in-network prototype learning, dual encoders, feature importance methods, counterfactual explanations recent news
| |
A selection of publications and projects, academic, professional and personal. | |
| The Llama 3 Herd of Models -- Paper || Project/Code || Llama 3 405B online |
|
| Error Discovery By Clustering Influence Embeddings (NeurIPS2023) -- Paper || Code || Slides |
|
| Using Captum to Explain Generative Language Models (EMNLP 23) -- Paper || Code |
|
| Intermediate Entity-based Sparse Interpretable Representation Learning (EMNLP 22) -- Paper || Code || Poster |
|
| Improving and Diagnosing Knowledge-Based
Visual Question Answering via Entity Enhanced Knowledge Injection (WWW 22) -- Paper || Code |
|
| Biomedical Interpretable Entity Representations (ACL 21) -- Paper || Code || Slides |
|
| Learning Dense Representations for Entity Retrieval (CONLL 19) -- Paper || Code |
|
| Explaining Deep Classification of Time-Series Data with Learned Prototypes (ICML 19 Timeseries Workshop 4pgs) -- Paper || Code IJCAI 19 Knowledge and Health Discovery Long paper -- Paper || Code |
|
| Applying Machine Learning Methods to Enhance the Distribution of Social Services in Mexico (ARXIV 2017) |
![]() |
![]() | |
| Automated construction and analysis of political networks via open government & media sources (ECML 16) |
|
|
|
| Link Detection in Political Networks (NLP Class Project 2018) |
|
| Predicting a Politician's Party Affiliation from a Photo using Deep Learning Methods ( Deep Learning Class project 2017) |
|
| |
| |
| Predicting when a Yearbook Photo was Taken using Convolutional Neural Networks |
![]() |
| Pitchfork: Are music festival lineups getting worse? |
![]() |
| Glasstire 15th Year Anniversary Texas Art Events | |
| Assessment of Similarity in Central and State Climate Change Programs of Mexico (Simultec Special Session on Applications of Modeling and Simulation to Climatic Change and Environmental Sciences. 2015) |
|
| Glasstire 15th Year Contributors | |
| Glasstire 15th Year Texas Artists | |
| Personal Music Visualizations and Interactive Lists |
|
| Turning Album of the Year Lists into a Music Discovery Tool | |
| Looking at US Presidential Election County Changes from 2012 to 2016 |
|
| Blue Islands Project Identify Blue Counties in America that are surrounded by Red ones, and Predict if a county is a blue island based on just socio-economic and public health data. |
|
| Google Results By Country |
|
| Every Foreign and Best Picture Film Ever Nominated for the Oscars |
|
| My Favorite Painter's Colors |
|
List of Spotify's Clarify Data Stories series articles written by Rob Mitchum for which I contributed data mining and data visualization, 2016. | |
| Groove Is In The Heart |
![]() |
| Songs of Summer Jobs |
'![]() |
| Immigration Songs: How Music Crosses American Borders |
![]() |
| There Are Three Types of Gun Songs |
![]() |
| The Persistent Glass Ceiling of Music |
![]() |
| From a Benzo to Student Loans: Debt Anxiety in Today’s Pop Music |
![]() |
| Hot Time, Summer in the City |
![]() |
Visualizations related to my masters thesis project in Barcelona about Texas Politics. Click here for Thesis Presentation Slides | |
| WhoYouElect.com |
|
| Politician Networks |
|
| Extended Politician Networks |
|
| Topics and Table of Contents |
|
| Media Coverage Maps |
|
| More Media Coverage Results |
|