Skip to content

Conversation

@Cederique
Copy link

I have some problems with the linUCB function as it gives a linear and negative cumulative regret, which seems very strange to me in a multi armed bandit model. Is there somebody who can help me solving or explaining this?
The file I was trying is the file/third attempt at learning.ipynb file.

Thank you in advance!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants