Skip to content
Free Tool Arena

How-To & Life · Guide · Writing & Content

How to analyze keyword density

What keyword density is (and isn't) worth, 1-2% myths, LSI and related terms, catching over-optimization, and tying density to search intent.

Updated April 2026 · 6 min read

Keyword density is the percentage of a page’s total words that match a target term. It was once the single most discussed metric in SEO, the axis on which whole chapters of Google’s 2003 ranking algorithm turned. A decade of deliberate stuffing penalties, the Panda update, BERT, and the shift to semantic search has stripped density of most of its direct ranking power. But it remains useful as a diagnostic: it tells you whether a page is actually about what you think it is about, whether you are under-serving the primary intent, or whether the language has drifted into padding. This guide covers the basic calculation, historical versus modern SEO use, penalties for overuse, stemming and variation handling, what “natural frequency” means, and where LSI keywords actually fit.

Advertisement

The basic calculation

Keyword density is (occurrences of the keyword / total words) × 100. A 1,000-word article that mentions “email marketing” 15 times has an exact-match density of 1.5 percent. For a single-word keyword the counting is simple. For multi-word phrases the numerator stays the count of phrase occurrences, and the denominator stays total words, which dilutes density because one occurrence of a two-word phrase still counts as one in the numerator but consumes two slots in the denominator. Most density tools report the phrase count over the word count without adjusting, which is the convention.

Text:     1,000 words total
Phrase:   "email marketing" appears 15 times

Density = (15 / 1000) * 100 = 1.5%

Historical SEO: what density meant

In the first decade of search, keyword density was a strong ranking signal. Pages with densities in the 2-5 percent range for a target term outranked pages that mentioned the term only in passing. This led to a decade of over-optimization: thin pages stuffed with target keywords, doorway pages built to rank for single terms, and footer-text gardens designed purely to hit a density threshold. The early search engines treated high density as a positive signal because the alternative was worse—ignoring the term entirely.

Modern SEO: what density means now

Google’s 2011 Panda update, 2013 Hummingbird rewrite, and 2019 BERT and 2022 MUM-style language models have all moved ranking away from word-frequency matching toward semantic understanding. BERT can tell that a page about “running shoes” and a page about “athletic footwear for runners” are about the same topic even without identical keywords. Density is no longer the direct lever it once was. But it is still a useful diagnostic: if your page about “email marketing” never uses the phrase, or uses it only in the title, the page may be less focused than you think.

Stuffing penalties

Google’s spam policies explicitly list keyword stuffing as a violation. The detection is not tied to a precise density threshold—it looks for patterns that suggest mechanical insertion rather than natural writing. Repeating the keyword in every sentence, inserting it into irrelevant places, or filling footer text and alt attributes with variants all trigger flags. A page with 5 percent density written naturally can rank fine; a page with 2 percent density that reads like robot-generated padding can get suppressed. The modern rule: write for readers, let density follow.

Natural frequency

Natural frequency is the density you get when a knowledgeable writer addresses the topic without thinking about density at all. It varies by topic. Technical content about a specific product mentions the product name often—2 to 3 percent is common and normal. A broader article about a category mentions the category term less often because the writer uses pronouns, synonyms, and partial references. If your target density for a term is wildly above or below what a human writer would produce naturally on that topic, the density is signaling a problem with the writing.

Stemming and partial matches

A density tool that counts only exact matches under-reports the real prominence of a topic. “Email marketing”, “email marketer”, and “marketing emails” all signal the same topic to a modern search engine. Stemming collapses inflected forms (marketing, marketer, markets) to a common root. Lemmatization is stricter and maps word forms to dictionary headwords. Most density tools offer exact-match by default and stemmed-match as a toggle. For SEO analysis, stemmed counts are usually more honest.

LSI and semantic terms

“LSI keywords” is an SEO term borrowed loosely from Latent Semantic Indexing, a 1990s information-retrieval technique. Google does not use actual LSI in its production ranking, despite the name’s popularity in SEO content. What Google does use is topic modeling via transformer-based language models, which recognize that a page about “cameras” should probably mention “lens”, “shutter”, “aperture”, and “exposure”. These are often called LSI keywords in SEO tools but are better described as semantically related terms or co-occurring topic terms. Checking that your page covers the terms your competitors cover is more useful than hitting a density threshold on the primary keyword.

Competitor density analysis

Before optimizing your density, measure what top-ranking competitors use. Pull the top five organic results for your target query, strip navigation and boilerplate, and compute density for each. The median tells you what density Google considers appropriate for that query. If your page sits far outside that range (either much higher or much lower), the content is probably an outlier in either information density or relevance. Match the competitive baseline before trying to exceed it.

Density by document zone

Word position matters more than raw density. A keyword in the title, H1, first paragraph, URL, and first image alt attribute signals topic more strongly than the same keyword repeated ten times in a sidebar. Modern density analysis should weight different zones differently, or at least report density per zone (title, H1, intro, body, footer). An unfocused page with the right density in the wrong places underperforms a focused page with lower density where it counts.

Common mistakes

Targeting a fixed density number. “I want exactly 2 percent” leads to stilted writing with the keyword inserted mechanically. Write for the reader and measure density as a diagnostic.

Ignoring phrase variations. Counting only exact matches misses “marketing emails” when you are measuring “email marketing”. Use stemmed or lemmatized counts for an honest picture.

Counting everything on the page. Navigation, footer, sidebar, and boilerplate dilute or inflate the density measurement. Analyze the main content area only, as a search engine would.

Stuffing alt text and meta tags. Image alt attributes and meta descriptions full of keyword variants are an old pattern that still triggers penalties. Write alt text that describes the image; write descriptions that describe the page.

Ignoring anchor text. Internal and inbound anchor text counts toward how search engines associate a page with topics. Heavy anchor-text repetition with the same exact phrase is a signal of manipulation.

Forgetting to measure after edits. Adding a single section or rewriting the intro can shift density by half a percent. Re-measure after significant edits to verify the page still focuses where you intended.

Treating LSI keywords as magic. Dropping in a list of related terms does not substitute for writing about the topic thoroughly. Related terms should appear because you covered the topic, not because a tool said they should.

Run the numbers

Measure keyword density, phrase frequency, and term distribution with the keyword density checker. Pair with the readability score checker to make sure optimizing density has not made the prose harder to read, and the meta description length checker to confirm the key terms make it into the snippet that shows in search results.

Advertisement

Found this useful?Email