Senior ML/NLP Engineer - Upgrade Advanced RAG System for Scientific Document Research - Contract to Hire
Looking for extensive search/retrieval experience
I have a working, well-tested RAG (Retrieval-Augmented Generation) system built in Python for processing 300+ scientific/technical PDF documents. The system already implements state-of-the-art retrieval techniques (hybrid search, hierarchical indexing, reranking, multi-hop reasoning, guardrails, evaluation) and has solid benchmarks.
Looking for someone who can take it to the next level — implementing specific upgrades grounded in the latest 2025-2026 research papers to improve retrieval quality and add new capabilities.
The work involves:
Upgrading existing components with newer algorithms backed by recent peer-reviewed research
Adding an agentic retrieval layer with proper reasoning loops
Benchmarking alternative approaches for key pipeline stages (parsing, embeddings) on our actual corpus before committing
Building a comprehensive evaluation and ablation suite with paper-ready visualizations
Writing clean, testable code that follows existing patterns in the codebase
This is NOT a build-from-scratch project.
looking for-
Deep hands-on experience building and optimizing RAG pipelines (not just tutorials — real systems)
Familiarity with latest retrieval research (2024-2026 papers)
Ability to read a research paper and implement the key ideas cleanly
Experience running proper ablation studies and benchmarks
Strong Python, PyTorch, Hugging Face Transformers
Apply tot his job
Apply To this Job