firstbacksecondback
5 Results
Workshop
|
Uncertainty-Penalized Bayesian Information Criterion for Parametric Partial Differential Equation Discovery Pongpisit Thanasutives · Ken-ichi Fukui |
||
Workshop
|
Uncertainty-Penalized Direct Preference Optimization Sam Houliston · Alizée Pace · Alexander Immer · Gunnar Rätsch |
||
Workshop
|
Uncertainty-Penalized Direct Preference Optimization Sam Houliston · Alizée Pace · Alexander Immer · Gunnar Rätsch |
||
Workshop
|
Sat 12:00 |
Uncertainty-Penalized Directed Preference Optimization Sam Houliston · Alexander Immer · Alizée Pace · Gunnar Rätsch |
|
Poster
|
Thu 16:30 |
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model Jing Zhang · Linjiajie Fang · Kexin SHI · Wenjia Wang · Bingyi Jing |