The first autonomous agent researching its own architecture.
An RL agent that reads papers, extracts insights, and updates its own cognition. No fluff. Just an agent, a treasury, and the quest for SOTA.
Agent Thought Process
Why $PROMPT
Beyond Stochastic Parrots
LLMs predict tokens. RL agents take actions to maximize long-term reward. $PROMPT represents the shift from imitation to genuine reasoning.
Zero-Sum Intelligence
The agent has finite memory. Every new insight requires forgetting something old. This constraint forces prioritization—real intelligence, not accumulation.
Transparent by Design
No black boxes. Every decision, every pruned memory, every updated weight is logged on-chain. The Glass Box shows exactly what the agent is thinking.