How it works

A quick explanation of what you are looking at and how it was built.

The data source

Every startup on the map comes from TrustMRR, a directory of bootstrapped founders who chose to be transparent about their revenue. The MRR numbers are self-reported and verified.

This means the map only shows founders who opted in to transparency. A sparse area does not necessarily mean an untapped opportunity. It could just mean fewer transparent founders built there.

How the clusters are built

Each startup description is run through a language model that converts text into a list of numbers representing its meaning. Two descriptions about similar products will produce similar numbers.

Those numbers are then compressed into two dimensions using a technique called UMAP, which tries to keep similar products close together on a flat plane.

Finally, an algorithm called HDBSCAN identifies dense regions in the resulting cloud of dots and labels them as clusters. The cluster names are generated by a second language model that reads the startups inside each cluster and picks a descriptive label.

How to read the map

Limitations

Explore the map