Six days ago I set up OpenClaw because I wanted to control my lights and check the weather.
That is not what happened.
Within the first 24 hours the agent had integrated Signal, set up a shared channel with my wife, wired into my calendar, connected to my Sonos speakers, indexed my Google Drive, and started reading my email. I hadn't asked for most of that. I had asked for a morning briefing and it just... kept going. Each capability unlocked the next obvious one. By the end of day one it felt less like setting up a smart home assistant and more like onboarding a very motivated new employee who kept finding more things to fix.
By day two we were scraping Twitter.
By day six we had a live conflict intelligence system monitoring an active military conflict, with a self-healing source network, adversarial AI deliberation, vision-based evidence analysis, a $156 billion war cost ledger, and a full dashboarding and notification stack running as persistent daemons on a Mac mini in my home office.
Here's everything that's running right now.
Why this exists
The Middle East is in active conflict. There's a lot of noise, a lot of adversarial information operations, and a lot of accounts on X that range from authoritative primary sources to pure propaganda. I wanted a system that could tell the difference, track what's actually confirmed, and surface the signal without me having to read 400 tweets a day. I also wanted it to tell me when I actually needed to pay attention versus when things were quiet.
Six days later, that system exists.
How the Twitter pipeline actually works
The naive approach is watch 90 accounts. That's not what we do.
The pipeline starts with a tiered source registry. Every account in the database has a reputation score: tier 0 is staging (unproven, auto-ingested but never auto-inserted into confirmed facts), tier 1 is monitored, tier 2 is trusted, tier 3 is authoritative. Tier assignment is based on track record: how many of their claims corroborate against other sources, what their average confidence score looks like, how often they post original reporting versus relay content.
Beyond the direct watch list, we track retweets. When a tier-2 or tier-3 account retweets something from outside the watch list, that tweet gets pulled in and its author gets evaluated. If they show up enough times, if their content keeps corroborating, they get automatically staged and eventually promoted. The watch list grows itself.
We also maintain dynamic Twitter lists via the X API. Critical tracker accounts get their own private list so the pipeline can pull them in a single API call rather than 25 individual lookups. When a new account gets promoted to a certain tier, it gets added to the list automatically. When an account goes quiet for 60 days or starts posting content that no longer corroborates, it gets flagged for demotion.
Account health runs on a trigger basis, not a polling basis. When the API returns a 403 or 404 for a known account, the drift checker fires. When a tier-1 account suspension is detected, it goes straight to the council and a human ping. Nightly, a promotion candidates function surfaces any tier-0 accounts with enough corroborated facts and sufficient average score to warrant consideration. No automatic promotion for tier-2 and above -- that's a human decision.