Bo Dai (@daibond_alpha) / X

Bo Dai

226 posts

Bo Dai

@daibond_alpha

Assistant Professor at @gtcse, Research Scientist at @GoogleDeepMind | ex @googlebrain

California, USA

Joined October 2012

Pinned
Bo Dai
@daibond_alpha
Dec 12, 2024
RL is so back, as David Silver predicted. m.youtube.com/watch?v=pkpJMN…
33K
Bo Dai
@daibond_alpha
Jun 27, 2020
Tired of Softmax? You may try our neural K-NN. Our SOFT top-k operator enables an efficient end2end training! “Differentiable Top-k Operator with Optimal Transport” arxiv.org/abs/2002.06504 with @Xiexieyujia, @hanjundai, @MinshuoC, @tourzhao, Hongyuan Zha, Wei Wei, Tomas Pfister
Bo Dai
@daibond_alpha
Oct 4, 2024
I did not even have 10 submissions…. There are two different “Bo Dai”.
Peter Richtarik
@peter_richtarik
Oct 4, 2024
Source: papercopilot.com/paper-list/neu…
35K
Bo Dai
@daibond_alpha
Apr 4, 2020
Here is a paper with full version of the relationship between probability metrics arxiv.org/abs/math/02090… by @AlisonLGibbs and @mathyawp.
Arthur Gretton
@ArthurGretton
Mar 21, 2020
🥬
Bo Dai
@daibond_alpha
Apr 11, 2023
This is the first time I submitted a paper and got reviews with both 1 (very strong reject) and 8 (strong accept). I alway believe 1 is the highest praise for a paper!
17K
Bo Dai
@daibond_alpha
Dec 21, 2024
RL is sparkling again.
Bo Dai
@daibond_alpha
Dec 12, 2024
RL is so back, as David Silver predicted. m.youtube.com/watch?v=pkpJMN…
6.1K
Bo Dai
@daibond_alpha
Dec 9, 2022
This is the time to read “the bitter lesson”, again, with ChatGPT as another example. incompleteideas.net/IncIdeas/Bitte…
Bo Dai
@daibond_alpha
Apr 13, 2022
Our team is hiring! We have been pushing the frontier of operation research by machine learning. @hanjundai and Dale are not only excellent researchers but also great mentors. Welcome to apply the position :)
Hanjun Dai
@hanjundai
Apr 13, 2022
Our team at Google Brain (w/ Dale Schuurmans, @daibond_alpha, @rl_agent, @mengjiao_yang and many others) is hiring a SWE to work on representation learning for reasoning, search & decision making. Apply below if you are interested! careers.google.com/jobs/results/8…
Bo Dai
@daibond_alpha
May 29, 2023
We are exciting to present AdaPlanner. We show that the long-term decision making ability of a LLM-agent can be significantly improved by iterative closed-loop planning with code-style prompt and skill discovery.
Haotian Sun
@haotiansun014
May 29, 2023
Excited to introduce AdaPlanner, our LLM agent for solving embodied tasks via closed-loop planning. Key features: 1) Adaptively refines LLM-generated plan from environment feedback, with both in-plan and out-of-plan refining strategies 2) A code-style LLM prompt structure to
00:00
8K
Bo Dai
@daibond_alpha
Oct 23, 2020
Thank my great collaborators @ofirnachum , Yinlam, @LihongLi20 , @CsabaSzepesvari , and Dale! Stay tuned. More DICEs are on the way!
Ofir Nachum
@ofirnachum
Oct 23, 2020
A new and beautiful (and practical!) technique for computing confidence intervals of policy value in RL! arxiv.org/abs/2010.11652 This is a problem that I & collaborators have been thinking about for ~1 year. At the beginning, I didn't think such a nice result was possible... 1/
Bo Dai
@daibond_alpha
Jul 8, 2020
We propose a unified view to summarize the existing (DICE, MWL/MQL, LSTDQ, etc) and design new OPE estimators, under which a comprehensive examination in terms of the tradeoff between statistical and optimization has been conducted.
Bo Dai
@daibond_alpha
Dec 5, 2019
The schedule and accepted papers are released: optrl2019.github.io. Congratulations to all the recipients of the travel awards. We thank all the invited speakers, panelists and authors. Thanks to our sponsors @GoogleAI and @DeepMindAI. See you in Vancouver next week.
Bo Dai
@daibond_alpha
Jul 25, 2023
Aloha! Just arrive Honolulu for #ICML23.
4.5K
Bo Dai
@daibond_alpha
Aug 6, 2020
The code for DICE family is online now:
GitHub - google-research/dice_rl
From github.com