News
The AI Seminar is a weekly meeting at the University of Alberta where researchers interested in artificial intelligence (AI) can share their research. Presenters include both local speakers from the University of Alberta and visitors from other institutions. Topics can be related in any way to artificial intelligence, from foundational theoretical work to innovative applications of AI techniques to new fields and problems.
On September 1, Stephen Montes Casper —a PhD Candidate at MIT — presented “Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback" at the AI Seminar.
Abstract:
Reinforcement Learning from Human Feedback (RLHF) has emerged as the central alignment technique used to finetune state-of-the-art AI systems such as GPT-4, Claude, Bard, and Llama-2. Given RLHF's status as the default industry alignment technique, there is a need to carefully study how we got here and what challenges persist in today's state of the art. We review open challenges and fundamental limitations with RLHF with a focus on applications in large language models. Technical progress in some respects is tractable, and this should be seen as a cause for concerted work and optimism. However, other problems with RLHF cannot fully be solved and instead must be avoided or compensated for with non-RLHF approaches.
Watch the full presentation below:
Want to learn how you can kick-start your AI career? Find out more about Amii's Career Accelerator to find out more.
May 16th 2024
News
Amii and New Harvest are excited to announce phase two of their research collaboration focused on applications of artificial intelligence and machine learning in cellular agriculture. The new phase initiates a year-long project with an open call to researchers and experts specializing in cellular agriculture and machine learning who want to apply ML solutions to solve the challenges in the field.
May 7th 2024
News
Check out the advancements being presented by Amii researchers at the 2024 International Conference on Learning Representation.
May 2nd 2024
News
Read our monthly update on Alberta’s growing machine intelligence ecosystem and exciting opportunities to get involved.
Looking to build AI capacity? Need a speaker at your event?