Apply modern RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more