{"version":"1.0","type":"rich","provider_name":"gaks.ai AI Glossary","provider_url":"https://gaks.ai/glossary","title":"Self-Play — AI Glossary","author_name":"Glenn Katrud Solheim","author_url":"https://gaks.ai","width":600,"height":200,"html":"<div style=\"font-family:sans-serif;border:1px solid #e0e0e0;border-radius:8px;padding:16px;max-width:600px;background:#ffffff;color:#111111;\"><p style=\"margin:0 0 4px;font-size:11px;color:#666;\">AI Glossary — gaks.ai</p><h3 style=\"margin:0 0 8px;font-size:16px;\">Self-Play</h3><p style=\"margin:0 0 12px;font-size:14px;line-height:1.6;\">A training technique in reinforcement learning where a model improves by competing against copies of itself rather than against human opponents or a fixed environment. Self-play generates an open-ended stream of training experience that scales with the model's own capabilities, an approach that enabled AlphaGo and AlphaZero to reach superhuman performance in games with no human data beyond the rules.</p><a href=\"https://gaks.ai/glossary/self-play\" style=\"font-size:12px;color:#0077aa;\">Source: gaks.ai/glossary/self-play →</a></div>"}