- TITLE_MISSING
- TITLE_TOO_LONG
- TITLE_TOO_SHORT
- +19 more
Audit the web
that AI reads.
Scannica is a desktop auditor with 130+ checks across SEO, AI-readiness, accessibility, security, performance, GDPR, and best practices. The crawl runs on your machine. Nothing is uploaded.
- Categories
- 7
- Rules
- 130+
- Uploaded
- 0B
- Warning
GEO_NO_LLMS_TXTsite-wide - Warning
GEO_NO_AUTHOR_ENTITY47 pages - Warning
GEO_LOW_ENTITY_DENSITY38 pages - Notice
GEO_NO_FAQ_SCHEMA12 pages
- Error
SEC_CSP_UNSAFE_EVAL/checkout - Error
SEC_COOKIE_NO_HTTPONLYsession.id - Warning
SEC_HSTS_MISSINGsite-wide
Seven categories. 81+ rules. One of them, your auditor isn't running.
Every finding carries a stable rule code, a severity, a description, and where
applicable a CWE or WCAG reference. The codes match what the app exports, ships in
tickets, and writes into .scannica bundles.
- PERF_LARGE_PAGE
- PERF_HUGE_PAGE
- PERF_DOM_SIZE
- +6 more
- IMG_ALT_MISSING
- HEADING_SKIP
- FORM_LABEL_MISSING
- +6 more
- BP_NO_DOCTYPE
- BP_NO_CHARSET
- BP_CONSOLE_LOG
- +8 more
- SEC_NOT_HTTPS
- SEC_MIXED_CONTENT_ACTIVE
- SEC_MIXED_CONTENT_PASSIVE
- +12 more
- GDPR_NO_CONSENT_BANNER
- GDPR_TRACKING_WITHOUT_CONSENT
- GDPR_THIRD_PARTY_FONTS
- +3 more
- GEO_LOW_ENTITY_DENSITY
- GEO_NO_ARTICLE_SCHEMA
- GEO_NO_AUTHOR_ENTITY
- GEO_NO_FAQ_SCHEMA
- GEO_NO_LLMS_TXT
- GEO_SPEAKABLE_MISSING
- GEO_NO_CITE_ELEMENT
- GEO_NO_CITE_SIGNALS
- GEO_NO_EXTERNAL_CITATIONS
The new SEO is the AI search era. Scannica audits how well your pages are read by Google's AI Overviews, ChatGPT, Perplexity — entity density, citation markup, llms.txt, FAQ + Article schema. No incumbent ships these.
9 rulesYour crawls aren't anyone else's product.
Cloud auditors built a business on uploading your clients' HTML and renting it back to you by the URL. Scannica isn't an SDK or a thin wrapper around a remote API — the crawler is the binary. Results sit in a local SQLite database. Nothing is phoned home. There's no Scannica server to phone home to.
- Where the data livesTheir database. They retain the right to look at it.A .scannica file on your disk. ZIP of SQLite + config. You can email it.
- Outbound trafficEvery URL, every HTML response, every header — uploaded.Only the HTTP requests your crawl makes to the site you're auditing.
- Crawl ceilingPer-URL credit, per-month credit, per-seat credit — pick the tax.Free 500 URLs · Pro 100K · Enterprise unlimited. Capped on what your hardware can hold.
- SpeedBottlenecked on shared crawl workers and your upload link.Rust workers + SQLite WAL. Saturates your bandwidth, not their queue.
- Pause and resumeCrawl runs in their pipeline. You watch a progress bar.Pause mid-crawl, close the laptop, resume on the train. Session state persists.
- OfflineDoesn't work. Even reading old reports needs their server.Open last quarter's .scannica file on a plane. Works fine.
- TelemetryApp analytics, error reporting, session replay — opt-out at best.None. There is no Scannica server to phone home to.
Dense. Keyboard-first. Built like a tool, not a funnel.
Built for SEOs who already know what they're looking for. Nothing stands between you and the finding. Every screen is filterable, sortable, exportable. One click ships a finding to Jira, GitHub, ClickUp, or Monday.
Audit for the retrieval-augmented web.
The retrieval-side surface — Google AI Overviews, ChatGPT browsing, Perplexity,
Copilot, Claude search — reads HTML differently from a 2015 spider. Entity density,
author markup, citation signals, schema breadth, llms.txt.
Things no incumbent auditor checks. Scannica does.
- 01
llms.txt missing
GEO_NO_LLMS_TXTOpenAI, Anthropic, Perplexity look here for crawl preferences. Without it, AI crawlers default to whatever robots.txt says — which usually says yes.
- 02
No author entity markup
GEO_NO_AUTHOR_ENTITYAI Overviews lean on byline + Schema.org Person markup to attribute and trust a passage. A page with no author entity is harder to cite.
- 03
Low entity density
GEO_LOW_ENTITY_DENSITYPages thick with named entities — products, people, places, dates — are easier for retrieval-augmented systems to index against a query.
- 04
FAQ schema absent
GEO_NO_FAQ_SCHEMAFAQ + Article structured data are the cleanest way for an LLM to extract a Q&A pair verbatim. Without it, the model has to guess.
- 05
Article schema absent
GEO_NO_ARTICLE_SCHEMAArticle markup gives the engine date, author, body — the metadata that ends up in the citation card next to your answer.
- 06
No <cite> element
GEO_NO_CITE_ELEMENTPages that already cite their own sources signal credibility. Engines that cite *you* prefer pages that *cite others*.
- 07
No outbound citations
GEO_NO_EXTERNAL_CITATIONSAn island page with zero outbound links to authoritative sources reads, to a model, like an unverifiable claim.
- 08
No citation context
GEO_NO_CITE_SIGNALSEven with outbound links, lack of in-text citation cues ("per", "according to", "source:") makes machine extraction noisier.
- 09
Speakable missing
GEO_SPEAKABLE_MISSINGFor voice + assistant surfaces, the SpeakableSpecification schema marks the passage that should be read aloud.
Five jobs. One install.
The same crawl + catalog + report shape adapts to whatever role you're running it under. No separate "dev" SKU vs "marketing" SKU.
- 01 In-house SEOsSEOPerformance
"Run a weekly crawl, catch regressions before the VP of Marketing does."
Smart Report leads with deltas since last crawl. Filter by category, severity, or affected URL. Open last quarter's audit from the .scannica file on your disk.
See the catalog - 02 SEO agenciesSEOGEO Readiness
"Audit five client sites without touching anyone's billing dashboard."
Each project is its own .scannica file. Share with the client, archive with the engagement. No per-seat tax, no upload, no "sorry, we don't have credits left this cycle".
See the catalog - 03 Migration leadsSEOBest Practices
"Ship a replatform without torching organic traffic."
Pre-migration baseline. Post-launch crawl. Sitemap diff highlights pages that fell out of the index. Redirect chain analysis surfaces loops on day one.
See the catalog - 04 Dev teamsSecurityBest Practices
"Get actionable tickets, not a 400-page PDF."
Every finding ships with a stable rule code, severity, description, and (where relevant) a CWE or WCAG reference. One click ships the issue to Jira, GitHub, ClickUp, or Monday.
See the catalog - 05 A11y & compliance auditorsAccessibilityGDPR
"Prove to the client that a site doesn't fail WCAG and doesn't leak PII."
Full Accessibility category (alt text, headings, contrast, ARIA, landmarks). GDPR category checks consent banners, third-party fonts, fingerprinting risk, privacy notices on forms.
See the catalog
Honest caps. No surprise meter.
Pricing isn't finalized — Scannica is pre-1.0. The shape of the tiers, however, is. Free runs every category check on a small site. Pro unlocks GEO, JS rendering, sitemap diff, exports, and integrations. Enterprise removes the caps.
Free
Try the crawl loop end-to-end.
- Crawl cap
- 500 URLs
- Audits
- 10 pages
- Saving
- —
Pro
· primaryThe full kit for working SEO and dev teams.
- Crawl cap
- 100K URLs
- Audits
- 1K pages
- Saving
- .scannica file
Enterprise
Volume crawls and license management.
- Crawl cap
- Unlimited
- Audits
- Unlimited
- Saving
- Unlimited
* Pricing TBA. Pre-1.0 invite-only — request access via the download section below.
Findings, where the work already lives.
The audit isn't done when the report is generated — it's done when somebody fixes the page. One click pushes a finding (with rule code, severity, description, the offending URL) into the tracker the team already opens every morning.
- J Pro+JiraCreate issues from findings
- G Pro+GitHubOpen tickets in any repo
- C Pro+ClickUpPush to project tasks
- M Pro+MondayDrop into the right board
- S Pro+Search ConsolePair crawl data with GSC
Pre-1.0. Invite list, not a download button.
Scannica is in a closed pre-release. Builds for macOS, Windows, and Linux exist; we're shipping access in batches as the rule catalog stabilises and the licensing layer hardens. Drop your email and we'll send a build, the install instructions, and a line about what's still in flux.