{"version":"1.0","type":"rich","provider_name":"gaks.ai AI Glossary","provider_url":"https://gaks.ai/glossary","title":"Reinforcement Learning — AI Glossary","author_name":"Glenn Katrud Solheim","author_url":"https://gaks.ai","width":600,"height":200,"html":"<div style=\"font-family:sans-serif;border:1px solid #e0e0e0;border-radius:8px;padding:16px;max-width:600px;background:#ffffff;color:#111111;\"><p style=\"margin:0 0 4px;font-size:11px;color:#666;\">AI Glossary — gaks.ai</p><h3 style=\"margin:0 0 8px;font-size:16px;\">Reinforcement Learning (RL)</h3><p style=\"margin:0 0 12px;font-size:14px;line-height:1.6;\">A training paradigm where an agent learns by interacting with an environment, taking actions, and receiving rewards or penalties based on the outcomes. Rather than learning from labeled examples, the agent discovers through trial and error what behaviors lead to the best cumulative reward, the approach behind game-playing AI systems like AlphaGo and robotic control systems.</p><a href=\"https://gaks.ai/glossary/reinforcement-learning\" style=\"font-size:12px;color:#0077aa;\">Source: gaks.ai/glossary/reinforcement-learning →</a></div>"}