Where the startup data comes from
We aggregate financial metrics from trusted marketplaces and data providers. Every revenue number, growth rate, and customer count is verified through payment processors \u2014 not self-reported.
TrustMRR
10,500+ startupsVerified monthly recurring revenue, growth rates, customer counts, and subscription metrics pulled directly from payment processors. No self-reported guesswork.
Acquire.com
Live marketplaceReal acquisition listings with verified financials. We track asking prices, revenue multiples, and profitability data from startups actively being bought and sold.
Start Story
Founder insightsCurated founder narratives and startup journeys. We extract origin stories, pivots, and growth strategies to understand the human context behind the numbers.
Where the market signals come from
We mine millions of real conversations where users express frustration, request features, compare products, and share workarounds. This is unfiltered market demand.
We monitor thousands of subreddits where users vent frustrations, request features, and share workarounds. This is where unfiltered user pain lives.
Google Search
Search impressions and ranking data reveal what users are actively looking for. High-intent queries like "best alternative to X" signal real demand gaps.
YouTube
Video comments and reviews contain some of the most emotional, detailed user feedback available. Users describe their workflows, pain points, and wishes in depth.
Google Play Store
App reviews are gold mines of feature requests, complaints, and competitive intelligence. Users rate and describe exactly what they need improved.
Forums & Communities
Niche communities, Hacker News, Indie Hackers, and specialized forums where builders and power users discuss tools and their shortcomings.
Web & Blogs
Product comparison articles, review sites, and industry blogs that surface emerging trends and shifts in user preference at scale.
How we turn raw data into your blueprint
From raw crawl to finished report, every step is designed to filter noise and surface only high-conviction opportunities backed by evidence.
Collect
Continuous crawlers and API integrations pull fresh data from all sources. Startup financials are synced daily; market signals are processed continuously.
Clean & Classify
Raw data is deduplicated, normalized, and classified by evidence type: complaint, feature request, comparison, migration, and more.
Vectorize
Every signal and startup is embedded into a multi-dimensional vector space using AI. This enables semantic search — finding related signals by meaning, not just keywords.
Cross-Reference
Emotional signals are matched against verified financial data. A complaint cluster only becomes an opportunity when the market backs it up with real revenue.
Score & Rank
Each opportunity is scored on pain intensity, buying intent, frequency, source diversity, and solution gaps. Only high-conviction signals make the cut.
Generate Blueprint
Your report is built on this foundation: 8 in-depth sections, each grounded in data — not AI hallucinations. Every recommendation traces back to real evidence.
Why you can trust this data
Most AI tools generate ideas from thin air. ShipForge is fundamentally different.
Verified, not self-reported
Revenue and customer data comes directly from payment processors (Stripe, Paddle, etc.) via TrustMRR and Acquire.com. Founders can’t inflate their numbers.
Multi-source cross-validation
A signal only becomes an opportunity when it appears across multiple independent sources. One Reddit post isn’t enough — we need a pattern.
Continuously refreshed
Our crawlers run daily. The startup database syncs every 24 hours. Signals are processed continuously. Your report uses the freshest data available.
Transparent evidence chain
Every recommendation in your report can be traced back to specific signals, sources, and financial data points. We show our work.