Rosetta is an ML experimentation framework for Named Entity Recognition on job postings. Built to benchmark transformer models on the SkillSpan dataset for extracting skills and knowledge entities from job descriptions.
Technical Highlights:
Performance Comparison:

Key Findings:
Overfitting Analysis:

Both models show minimal overfitting (gap < 0.02)
Error Analysis:

Top confusion pairs for distilbert-base-uncased:
Production Pipeline:
Beyond benchmarking, the project includes a deployment pipeline (taxonomy.py) that runs the trained model against real job posting data (jacob-hugging-face/job-descriptions) with batched inference, chunking logic, and JSON output—a full loop from fine-tuning to production inference.
Built with Python, PyTorch, Transformers, and Hugging Face datasets.