Boyuan Chen's Avatar

Boyuan Chen

@boyuan-chen

Assistant Professor at Duke University. Robotics and AI. http://boyuanchen.com/

1,481
Followers
545
Following
12
Posts
14.11.2024
Joined
Posts Following

Latest posts by Boyuan Chen @boyuan-chen

Redirecting...

GUIDE is built upon our recent platform CREW. Please check it out as well. Many has started using CREW in their research!

generalroboticslab.com/CREW/

03.12.2024 18:23 πŸ‘ 7 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Redirecting...

Website with video, code, and paper: generalroboticslab.com/GUIDE
Duke' press release: pratt.duke.edu/news/trainin...

03.12.2024 18:23 πŸ‘ 7 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Redirecting...

Read the Full Paper Dive into GUIDE’s technical details and our exciting findings. Explore the future of human-in-the-loop learning! Kudos to our team Lingyu Zhang, Zhengran Ji, and Nicholas Waytowich!

03.12.2024 18:23 πŸ‘ 7 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Redirecting...

Human Cognitive Tests: We didn’t just study AI - we looked at humans too! Cognitive tests revealed how individual differences impact training outcomes, helping us understand how diverse skills translate to better AI guidance. #Neuroscience #HumanFactors

03.12.2024 18:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Redirecting...

How Does It Perform? Our experiments show GUIDE delivers 30% higher success rates than baseline RL models in complex tasks like navigation and multi-agent hide-and-seek. And with just 10 minutes of human feedback, it surpasses previous methods by up to 40%.

03.12.2024 18:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Redirecting...


Mimicking Human Feedback: After initial training, GUIDE keeps improving! While the human is providing feedback, we train a feedback model that replicates human guidance, allowing the agent to continue learning independently. This reduces human effort and ensures robust, ongoing performance gains.

03.12.2024 18:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Redirecting...

Continuous Human Guidance: GUIDE’s interface allows continuous feedback - no more simple β€œyes/no” or β€œgood/bad” labels. Trainers can provide nuanced guidance at every decision step, making learning more natural and expressive for both AI and trainers.

03.12.2024 18:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Redirecting...


Largest Human Studies Ever: Most studies on human-guided AI have involved fewer than 10 participants - often the authors themselves! GUIDE stands apart with the largest human subject study to date in this field, involving 50 participants to rigorously validate our approach.

03.12.2024 18:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Redirecting...

Why GUIDE? Real-time decision-making is a tough nut to crack for AI, especially in high-stakes tasks. GUIDE leverages human feedback in real-time, grounding it into _dense rewards_ to accelerate AI learning, even in challenging environments with sparse feedback signals.

03.12.2024 18:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Post image

πŸš€ We’re thrilled to introduce GUIDE - our framework for real-time human-guided reinforcement learning, enabling continuous human feedback to teach AI agents faster and better. Accepted to #NeurIPS2024! Here’s what makes it special:

03.12.2024 18:23 πŸ‘ 13 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

That’s amazing! Congratulations!!!

27.11.2024 01:51 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Just joined the platform. Would love to join the list if possible!

16.11.2024 04:56 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0