I design and deploy end-to-end Generative AI, LLM, and RAG pipelines. Specialized in productionizing scalable AI platforms and robust evaluation systems. Co-authored 2 AI safety publications (CVPR, EMNLP).
Led a team of 4-5 engineers to productionize scalable AI platforms. Built automated evaluation pipelines and ran rapid experiments to drive product maturity.
Proposed an inference-time defense framework leveraging safe reward models to defend against jailbreak attacks in Multi-modal LLMs.
A novel framework for enhancing reward model generalization in low-resource settings using few-shot examples.