Boyuan Chen (@boyuan-chen)

Redirecting...

GUIDE is built upon our recent platform CREW. Please check it out as well. Many has started using CREW in their research!

generalroboticslab.com/CREW/

03.12.2024 18:23 👍 7 🔁 0 💬 0 📌 0

Redirecting...

Website with video, code, and paper: generalroboticslab.com/GUIDE
Duke' press release: pratt.duke.edu/news/trainin...

03.12.2024 18:23 👍 7 🔁 0 💬 1 📌 0

Redirecting...

Read the Full Paper Dive into GUIDE’s technical details and our exciting findings. Explore the future of human-in-the-loop learning! Kudos to our team Lingyu Zhang, Zhengran Ji, and Nicholas Waytowich!

03.12.2024 18:23 👍 7 🔁 0 💬 1 📌 0

Redirecting...

Human Cognitive Tests: We didn’t just study AI - we looked at humans too! Cognitive tests revealed how individual differences impact training outcomes, helping us understand how diverse skills translate to better AI guidance. #Neuroscience #HumanFactors

03.12.2024 18:23 👍 1 🔁 0 💬 1 📌 0

Redirecting...

How Does It Perform? Our experiments show GUIDE delivers 30% higher success rates than baseline RL models in complex tasks like navigation and multi-agent hide-and-seek. And with just 10 minutes of human feedback, it surpasses previous methods by up to 40%.

03.12.2024 18:23 👍 1 🔁 0 💬 1 📌 0

Redirecting...

Mimicking Human Feedback: After initial training, GUIDE keeps improving! While the human is providing feedback, we train a feedback model that replicates human guidance, allowing the agent to continue learning independently. This reduces human effort and ensures robust, ongoing performance gains.

03.12.2024 18:23 👍 1 🔁 0 💬 1 📌 0

Redirecting...

Continuous Human Guidance: GUIDE’s interface allows continuous feedback - no more simple “yes/no” or “good/bad” labels. Trainers can provide nuanced guidance at every decision step, making learning more natural and expressive for both AI and trainers.

03.12.2024 18:23 👍 1 🔁 0 💬 1 📌 0

Redirecting...

Largest Human Studies Ever: Most studies on human-guided AI have involved fewer than 10 participants - often the authors themselves! GUIDE stands apart with the largest human subject study to date in this field, involving 50 participants to rigorously validate our approach.

03.12.2024 18:23 👍 1 🔁 0 💬 1 📌 0

Redirecting...

Why GUIDE? Real-time decision-making is a tough nut to crack for AI, especially in high-stakes tasks. GUIDE leverages human feedback in real-time, grounding it into _dense rewards_ to accelerate AI learning, even in challenging environments with sparse feedback signals.

03.12.2024 18:23 👍 1 🔁 0 💬 1 📌 0

🚀 We’re thrilled to introduce GUIDE - our framework for real-time human-guided reinforcement learning, enabling continuous human feedback to teach AI agents faster and better. Accepted to #NeurIPS2024! Here’s what makes it special:

03.12.2024 18:23 👍 13 🔁 2 💬 1 📌 0

That’s amazing! Congratulations!!!

27.11.2024 01:51 👍 0 🔁 0 💬 0 📌 0

Just joined the platform. Would love to join the list if possible!

16.11.2024 04:56 👍 1 🔁 0 💬 0 📌 0

Boyuan Chen

Latest posts by Boyuan Chen @boyuan-chen