ab-test-RL

Using Reinforcement Learning for AB test

In this repo, I implemented a code to solve a simple environment of AB test, similar to the Multi-armed Bandit Problem.

The figure shows the result over time of two "workspaces", A and B. The blue dots is the reward from the actions chosen by the agent. The curves, on the other side, refers to the reward curve from each workspace through time.

I used policy gradients implemented in tensorflow. It was my first time coding RL stuff :D

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Dockerfile		Dockerfile
README.md		README.md
ab_example.py		ab_example.py
ab_model.py		ab_model.py
agent.py		agent.py
config.py		config.py
environment.py		environment.py
logger.py		logger.py
requirements.txt		requirements.txt
teste_ab.png		teste_ab.png
utils.py		utils.py
web_server.py		web_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ab-test-RL

About

Releases

Packages

Languages

luckeciano/ab-test-RL

Folders and files

Latest commit

History

Repository files navigation

ab-test-RL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages