10+ years of ML experiences in search, natural language processing/understanding. Conversational AI. Proven experience for LLM post training, including but not limited to SFT, RLHF, RLAIF, Reward Modeling, Chain-of-thought, agentic...
10+ years of ML experiences in search, natural language processing/understanding. Conversational AI. Proven experience for LLM post training, including but not limited to SFT, RLHF, RLAIF, Reward Modeling, Chain-of-thought, agentic...