Multi-armed bandit
- Wikidata
- https://www.wikidata.org/wiki/Q2882343
- OpenAlex ID
- https://openalex.org/C123197309 (API record)
- OpenAlex Description
- reinforcement learning problem exemplifying the exploration–exploitation tradeoff
- OpenAlex Level [?]
- 3
Broader Concepts
Narrower Concepts
Associated Authors
Add Incoming Edge
Login via ORCiD to contribute.
Add Outgoing Edge
Login via ORCiD to contribute.