Lead Data Scientist
Job DescriptionJob Description
Opportunity for a Senior Data Explorer (LLM/NLP Focus)
Do you thrive at the intersection of , logic, and learning systems? A confidential team is seeking a seasoned builder of smart systems — someone who speaks fluent Python and thinks in embeddings.
This is a hands-on role for someone who’s been deep in the weeds of GenAI, LLMs, and NLP — and wants to push boundaries in applied research with real-world impact.
What You’ll Be Doing
- Crafting custom models for stream and batch pipelines — think GenAI, LLMs, NLP, and ML.
- Working across ingestion, retrieval, RAG, fine-tuning, and prompt design.
- Partnering with internal teams to make sure your models don’t just work — they work well.
Who You’ll Be Working With
- Product thinkers, engineers, and ML folks who care about performance and precision.
- A collaborative crew that values experimentation and iteration.
What You Bring
- Advanced degree in a technical field (CompSci, Stats, Linguistics, etc.).
- 8+ years wrangling structured/unstructured data for insights and automation.
- 3+ years hands-on with Python and libraries like Hugging Face, PyTorch, TensorFlow.
- Deep experience with transformer-based NLP models and semantic search.
- Familiarity with tuning LLMs and staying current with open-source trends.
Logistics & Perks
- Hybrid setup — you’ll need to be able to drop into a workspace in the Northeast US (NY/NJ area).
- Compensation: $150k–$210k base + bonus + benefits + sign-on