Projects

Selected work

Selected work in AI infrastructure, NLP, and applied research.

I — In production
N° 01

Molcajete

AI-powered transcription and analysis pipeline for political focus group research in Mexican Spanish.

A full audio-to-insight pipeline replacing error-prone transcription and weeks of manual coding. Speaker diarization, transcription, theme classification, and integrated reporting — all surfaced through a tooling layer that researchers actually use.

1,300+
hours of audio processed
<60 min
turnaround per project
N° 02

Adapta

Data preprocessing pipeline and LLM fine-tuning infrastructure for Mexican Spanish political analysis.

QLoRA fine-tuning workflow with reproducible training, evaluation, and registry. Built for empirical comparison of base models, prompts, and adapters across a curated benchmark of domain-specific tasks.

40+
evaluation metrics
100+
training runs
N° 03

Nopalero

Automated participant screening system for qualitative recruitment.

Automated intake pipeline that replaces hours of manual data entry per project. Combines OCR, fraud detection, and socioeconomic classification — so analysts focus on the decisions, not the paperwork.

48
validation checks
0
manual data entry
II — Open source

Streamlined Python library for scraping case data from the Brazilian Supreme Court (STF).

Have a problem that doesn't fit a template?

Most of the work above started as someone saying exactly that.

Start a conversation