Show HN: StackScope – I crawled over 40k indie launches to see what they ship
Posted by datafreak_ 4 days ago
Hey all, I built StackScope, a crawler/catalogue that looks at new product launches and shows what they were built with.
It watches launches from Product Hunt, Show HN, and PeerPush, then crawls the public site behind each one. The goal is to show what people actually launched with: hosting, frameworks, analytics, DNS, security headers, legal pages, AI-builder signals, and other public clues.
I started building it because most stack-detection sites look at the web as a whole. I was more interested in the current indie launch scene: what people are choosing right now, at the point they first put something in public.
A few implementation details: it runs on .NET, uses Playwright for rendered pages, and has a first-party fingerprint catalogue rather than one copied from Wappalyzer/etc. robots.txt is honoured, and the bot identifies itself.
Frustratingly, I am still waiting for verified bot status from Cloudflare and currently that knocks out about 10% of all sites.
There is also a private readiness check: paste a URL, get the same style of report, fix things, and recrawl. No account or email needed.
I'd be interested in feedback on the usefulness of this, the methodology, and any obvious false positives.
Jonathan.
Comments
Comment by pixel_popping 4 days ago
GET https://stackscope.dev/css/stackscope.css?v=qkmEKPTNqImS7aeD... NS_ERROR_CORRUPTED_CONTENT
GET https://stackscope.dev/css/utilities.css?v=l7--cPr2cfIn6iDRB... NS_ERROR_CORRUPTED_CONTENT
GET https://stackscope.dev/css/page-extras.css?v=m-VKj9bQg7fMpFK... NS_ERROR_CORRUPTED_CONTENT
GET https://stackscope.dev/js/site.js?v=SxUp8zU6gnAArRwBXr22wIBS... NS_ERROR_CORRUPTED_CONTENT
Comment by datafreak_ 4 days ago
Edit: loosened a cf rule, should be good now.
Comment by addedlovely 4 days ago
I've been doing the same bit wider scope, for the whole Crux list, pruned to apex domains, and looking for CMS signals - how's your throughput?
I'm not doing any headless browser stuff, or many requests, so hyper optimised for speed.
I do grab robots.txt - didn't really see much in llms.txt or humans.txt in the wild, does yours?
Comment by addedlovely 4 days ago
I'm seeing about 6.6% block rate, but that does climb over time.
Comment by datafreak_ 4 days ago
Comment by schamack 3 days ago
I will have a thorough look on the insights later, as there might either be some small false positives, or my projects still needs some grunt work :).
So from my perspective, being part of the check, I find it extremely useful.
Comment by datafreak_ 3 days ago
Comment by schamack 1 day ago
My app is Tekyous (didn't go through this option to claim the launch): https://stackscope.dev/launch/xsa4cjwn/tekyous
And turns out that actually there were almost no false-positives here at all. Some of the flagged things were rather minor, like missing files that are not really necessary in my case, or too long meta descriptions, but technically all is flagged correctly even if severity is low.
But on the other hand, it has also correctly flagged my tracking settings or overly strict rules for AI crawlers, so this were genuinely useful for me to find out now.
And the only real false-positive is actually a recognised AI builder signature for Bolt.new - not sure where that one comes from, as Bolt.new was not used here. I've only used Claude Code, mostly with Anthropic + GLM by Z.AI models.
Hope that feedback helps you, as I have defnitely found it helpful on my end. Thanks!
Comment by Gabriel_Valente 4 days ago
Overall this looks solid
Comment by datafreak_ 4 days ago
Comment by Gabriel_Valente 4 days ago
Comment by jrhizor 4 days ago
I'm a little surprised you can't see the analysis for StackScope itself!
Also, it'd be interesting to see the trend of vibe score over time.
Comment by datafreak_ 4 days ago
I will include a vibe score trend in the next blog post.
Comment by idid 4 days ago
Fab project otherwise!
Comment by datafreak_ 4 days ago
Comment by 8organicbits 4 days ago
For HSTS, don't forget to check the preload list. Domains under .dev are all preloaded, for example, so they don't need to set the header for HSTS to apply.
Comment by vdev123 21 hours ago
Comment by thetwopct 4 days ago
Comment by datafreak_ 3 days ago
Comment by cpursley 4 days ago
Comment by sandeepkd 4 days ago
Comment by bji9jhff 4 days ago
Comment by tamnv 2 days ago
Comment by lucasgodt 4 days ago
Comment by ourdailycalc 2 days ago