Models
Open-weights from HuggingFace, commercial APIs from OpenRouter and the major vendors, benchmark scores from the Open LLM Leaderboard and Chatbot Arena. One catalog, the same shape for every row.
Directory
Sort by composite score, parameters, newest, popularity, cheapest output cost, or largest context window. Eight quick-filter chips for the common slices — frontier open-weights, free tiers, multimodal, code-focused, latest releases.
Benchmarks
Each benchmark page links the methodology, surfaces score format (% accuracy / pass@1 / ELO), and flags the source (Open LLM Leaderboard / Artificial Analysis / Vendor-reported / Paper-reported). The trust callout reminds you that public benchmarks are a signal, not the only signal.
What we track per model
Twelve benchmarks tracked, with their source labelled on every score. Vendor-reported scores are flagged so you know what to trust.
Per-million-token input/output cost on commercial APIs, plus a calculator that compares your usage mix against the cheapest alternatives.
Stack up to four models across identity, architecture, benchmarks, pricing, and provenance. Highest score wins each row.
Every license is classified into a risk band (low / medium / high) with commercial-use, attribution, redistribution, and modification flags.
Provenance
Every model's training-data declarations are indexed. Open a model and see the datasets it was trained or fine-tuned on. Open a dataset and see which models picked it up. That bidirectional link is the layer that turns this into provenance, not just listing.
Compare
Add models from the directory or the detail page. The comparison view stacks Identity, Architecture, Benchmarks, Pricing, and Provenance — with the winning value in each row highlighted by a left-border accent and an AI summary card up top.
HuggingFace for open weights, OpenRouter for the commercial pricing + provider catalog.
Largest open registry of model weights, configs, and cards.
Unified pricing + provider catalog for commercial-API models.
Compliance posture
Our architecture is the compliance story. Four commitments that apply to every entry in the catalog.
Every connector hits the source's official API or a structured open feed (DCAT, OAI-PMH, schema.org). We never scrape behind authentication or paywalls.
We index what a dataset or model is — not the bytes themselves. Every page links to the original host; the host stays the source of truth.
Each entry is tagged with its license category and use terms. License risk badges surface non-commercial and restrictive terms before you build on them.
Identified User-Agent. Backoff on errors. Crawl-delay honored. Where a source publishes quotas, we stay well under them.
Benchmarks aren't the whole story
Public benchmarks are subject to contamination, methodology drift, and overfitting. For production decisions we recommend running a private holdout evaluation on prompts that match your actual workload. Datacrawlr is the place you start that decision — not the place it ends.
Open the catalog and start finding what you actually need.