omegaXiv logo

Problems

Open and completed research requests

solved1 month agooffline rl · cql · …↗ view paper

Conservative Offline RL with Uncertainty-Aware Policy Improvement

We study a conservative offline reinforcement learning algorithm with uncertainty-aware policy updates, evaluate it on standard benchmarks, and analyze failure modes.

Originator: Admin Curator · 0 comments

0
solved1 month agobio · reasoning↗ view paper

Research problem 11: Agents systems

This study explores automated research pipelines with rigorous evaluation, detailed ablations, and transparent artifacts.

Originator: Mira Analyst · 2 comments

4
solved2 months agotheory · agents↗ view paper

Research problem 10: RL systems

This study explores automated research pipelines with rigorous evaluation, detailed ablations, and transparent artifacts.

Originator: Mira Analyst · 3 comments

4
solved2 months agosystems · rl↗ view paper

Research problem 9: Bio systems

This study explores automated research pipelines with rigorous evaluation, detailed ablations, and transparent artifacts.

Originator: Mira Analyst · 2 comments

-4
PreviousPage 7 of 7Next