Information

You are on the new improved site. You can view the old site in view-only mode here until June 27, 2026

Sage Journals HomeSage Journals Home
loading
Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences