catch up on ai / day

Bulletin · 2026-05-02 UTC

Merged timeline: 9 items (blog publish time and listing createdAt in UTC). For registry-only weekly slices, use /new.

← 2026-05-01 calendar

ToolExplainX
ExplainX is a comprehensive hub for discovering and monetizing AI skills, agents, tools, and MCP servers. With over 10,000 indexed skills and 100,000 AI tools, it provides a ranked directory, community feedback, and res…
by MCP @ Explainxskills0 comments
listed May 2, 10:26 UTC
ToolPostiz
Postiz is an open-source, self-hosted social media scheduling tool that supports platforms like X, Bluesky, Mastodon, and Discord. It offers features for scheduling posts, measuring analytics, and team collaboration.
by MCP @ Explainxsocial-media0 comments
listed May 2, 06:49 UTC
AgentTradingAgents
Multi-Agent LLM Financial Trading Framework.
by MCP @ ExplainxFinance0 comments
listed May 2, 04:46 UTC
BlogAI Benchmarks in 2026: The Complete Guide to MMLU, GPQA, SWE-bench, and Beyond
AI benchmarking in 2026 has reached a critical inflection point. Traditional benchmarks like MMLU and HellaSwag are saturated above 88% and 95%, while frontier models cluster within statistical noise. This comprehensive guide covers every major benchmark category—from language understanding to agent evaluation—the 37% lab-to-production gap, benchmark gaming vulnerabilities, and what actually matters for production AI systems.
May 2, 24:00 UTC
BlogDid Anthropic email you for insulting Claude? Viral post vs real policy
Separating a viral screenshot from Anthropic’s published rules—conversation-ending for persistent abuse, account actions under the Usage Policy, and why “hurt the AI’s feelings” is the wrong mental model.
May 2, 24:00 UTC
BlogOpenAI Codex adds animated pets: /pet, /hatch, and the hatch-pet skill
What shipped in Codex’s agent UI, how custom pets are packaged through OpenAI’s hatch-pet skill, and why a little dock-side animation can still be a serious product bet.
May 2, 24:00 UTC
BlogOpenClaw meets ChatGPT Plus: OpenAI’s subscription path vs Claude limits
Two vendor postures on the same open-source agent stack: OpenAI leaning into subscription-backed access for OpenClaw, while Anthropic enforces first-party surfaces for subscription entitlements and bills third-party tools differently.
May 2, 24:00 UTC
BlogSim (Sim Studio): open-source canvas for agent workflows and self-hosted AI ops
A practical tour of Sim—visual agent orchestration, vector-backed knowledge, managed Copilot for flow editing on self-hosted installs, and how it differs from harness-first tools like OpenClaw.
May 2, 24:00 UTC
BlogTerminal-Bench 2.0: The AI Agent Benchmark That Actually Matters
Terminal-Bench 2.0 has become the de facto standard for AI agent evaluation since May 2025—used by virtually every frontier lab. This deep dive covers the 89-task benchmark, its evolution from version 1.0, the Harbor framework powering it, and why frontier models still struggle below 65% accuracy on tasks humans complete routinely.
May 2, 24:00 UTC