undefined

Brandon Cui

Research Scientist at MosaicML and Databricks, leading RLHF post-training efforts. Expert in AI model optimization, reward models, and RLHF.