PostgreSQL Indexes Explained — B-Tree, GIN, BRIN, GiST

Key Takeaways

PostgreSQL supports six index access methods — B-tree, Hash, GiST, SP-GiST, GIN, and BRIN — each optimized for a different data shape and query pattern.
A BRIN index can be roughly 4,000x smaller than an equivalent B-tree on a 10-million-row, naturally-ordered table (CYBERTEC PostgreSQL).
When a query is estimated to return more than roughly 10% of a table's rows, the planner usually picks a sequential scan over an index scan.
CREATE INDEX CONCURRENTLY builds an index without blocking writes — the standard choice for adding an index to a live production table.

Magnifying glass placed on top of a printed document representing how a PostgreSQL index narrows a search to matching rows instead of scanning every page — An index narrows the search before PostgreSQL ever touches the table's actual rows. (Photo: Pexels)

What Is a PostgreSQL Index?

A PostgreSQL index is a separate on-disk structure that stores a sorted or otherwise organized copy of one or more column values, plus a pointer back to the matching table row. Instead of reading every row in sequence, called a sequential scan, the query planner can walk the much smaller index structure and jump directly to matching rows.

Unlike MySQL's InnoDB, PostgreSQL doesn't store table rows inside the primary key structure. Every PostgreSQL index, including one on the primary key, is a separate structure that points back into the table's heap. That's an architectural difference worth knowing if you're moving between the two engines — see the MySQL vs PostgreSQL comparison for more.

PostgreSQL's own documentation lists six built-in index access methods: B-tree, Hash, GiST, SP-GiST, GIN, and BRIN. Each targets a different indexable clause — equality, range, containment, proximity, or physical correlation — and picking the wrong one either wastes storage or leaves the planner unable to use the index at all.

PostgreSQL implements six index access methods — B-tree, Hash, GiST, SP-GiST, GIN, and BRIN — each suited to a different class of indexable operator (PostgreSQL Documentation, Index Types). Unlike MySQL's InnoDB, PostgreSQL never stores table data inside an index structure; every index, including on the primary key, is a separate structure pointing back to the table heap.

Index Types: B-Tree, GIN, GiST, and BRIN

CREATE INDEX without a USING clause defaults to B-tree, and that default covers most schemas. The other three types below solve specific problems B-tree can't handle efficiently.

Index type	Best for	Weak point
`B-tree`	Equality and range queries on sortable scalar data — the default for most columns	Not designed for multi-valued columns like JSONB or arrays
`GIN`	JSONB containment, array membership, full-text search (`tsvector`)	Slower to write; index maintenance cost is higher per row
`GiST`	Range types, geometric/spatial data (PostGIS), nearest-neighbor queries	Lossy for some operator classes, requiring a recheck of the actual row
`BRIN`	Very large, append-only tables where the column correlates with row insertion order	Only fast when data is physically ordered; useless on shuffled data

GIN stores one index entry per element rather than one per row — a JSONB document with 10 keys produces roughly 10 index entries. That's what makes containment queries like WHERE data @> '{"status": "active"}' fast, but it also means every write has to update potentially many entries, which is why GIN indexes are noticeably slower to maintain than B-tree.

BRIN takes the opposite approach: instead of indexing every row, it stores only a min/max summary for each range of physical table pages (128 pages by default). A query can skip entire page ranges whose summary doesn't overlap the search condition, without ever pointing to an individual row. That's why BRIN indexes are tiny compared to B-tree, but only useful when the indexed column's values roughly track the order rows were inserted — a created_at timestamp on an append-only log table is the textbook case.

Index size on a 10-million-row, naturally-ordered table

The size gap looks like a pure win for BRIN, but it comes with a real trade-off: BRIN only stores page-range summaries, so a lookup still has to re-check every row inside a matching range. On data that isn't naturally ordered, that recheck cost erases the benefit entirely — BRIN is a bet on physical correlation, not a universal replacement for B-tree.

On a 10-million-row table with values that correlate with physical insertion order, a B-tree index measured roughly 214 MB while an equivalent BRIN index measured roughly 48 KB — about 4,000 times smaller (CYBERTEC PostgreSQL). BRIN achieves this by storing only a summary per block range rather than an entry per row, so it works only when the indexed column tracks the table's physical row order.

Creating and Managing Indexes

The base syntax is the same across all six index types — only the USING clause changes.

Default B-tree:

CREATE INDEX idx_orders_user ON orders (user_id);

GIN, for a JSONB column:

CREATE INDEX idx_orders_metadata ON orders USING GIN (metadata);

BRIN, for an append-only timestamp column:

CREATE INDEX idx_events_created_at ON events USING BRIN (created_at);

Unique index:

CREATE UNIQUE INDEX idx_users_email ON users (email);

On a live production table, always add CONCURRENTLY:

CREATE INDEX CONCURRENTLY idx_orders_status ON orders (status);

A plain CREATE INDEX takes a lock that blocks writes for the entire build. CONCURRENTLY builds the index without that lock, at the cost of roughly double the build time and a requirement that it run outside a transaction block. If the build fails partway through, it can leave behind an invalid index — check pg_index.indisvalid and drop it before retrying.

PostgreSQL indexes - B-Tree, GIN, BRIN. What's the difference? (YouTube, 2025)

Every PostgreSQL index type shares the same CREATE INDEX ... USING method (column) syntax, defaulting to B-tree when USING is omitted. CREATE INDEX CONCURRENTLY avoids the write-blocking lock a normal index build takes, making it the standard approach for adding indexes to tables already serving production traffic (PostgreSQL Documentation).

Composite, Partial, and Covering Indexes

A composite index spans multiple columns and follows the same leftmost-prefix rule as MySQL: an index on (user_id, status, created_at) serves queries filtering on user_id alone, on user_id and status, or on all three — but not on status or created_at alone, since neither is a leftmost prefix of the sorted structure.

CREATE INDEX idx_orders_lookup ON orders (user_id, status, created_at);

PostgreSQL adds two refinements MySQL doesn't have natively: partial indexes and covering indexes via INCLUDE. A partial index only indexes rows matching a WHERE condition, which keeps the index small when you only ever query a subset of rows:

CREATE INDEX idx_orders_pending ON orders (created_at)
WHERE status = 'pending';

A covering index adds extra columns purely for retrieval, without making them part of the sort key, letting the planner satisfy a query entirely from the index without touching the table heap (an index-only scan):

CREATE INDEX idx_orders_covering ON orders (user_id)
INCLUDE (status, total);

Partial indexes are consistently underused in production schemas. A queue table where 95% of rows are already completed and only the pending rows are ever queried is exactly the case where a partial index cuts both the index size and the write overhead dramatically, since completed rows never enter the index at all.

A partial index in PostgreSQL indexes only the rows matching a WHERE predicate specified at creation time, reducing both index size and write maintenance cost when queries only ever target a known subset of rows. An INCLUDE clause adds non-key columns to an index purely for retrieval, enabling index-only scans that never touch the table heap.

When PostgreSQL Chooses a Sequential Scan

Having an index doesn't guarantee the planner uses it. PostgreSQL's cost-based optimizer estimates the cost of a sequential scan against the cost of an index scan for each query and picks whichever is cheaper — and for a large fraction of matching rows, a sequential scan usually wins.

An index scan is fastest when a query needs only a small fraction of a table's rows, since it avoids reading the rest of the table entirely. Once a query is estimated to return roughly 10% or more of a table's rows, the sequential scan's simpler, more predictable I/O pattern typically outperforms the random-access pattern of following an index across that many matches.

Small tables see the same effect for a different reason: if a table fits in a handful of pages, PostgreSQL just reads all of them, since the overhead of consulting an index isn't worth it. Why keep an index at all if the optimizer routinely ignores it? Because the row estimate that triggers a sequential scan today changes as the table grows — an index that looks unnecessary on a 500-row table becomes essential once that table hits 5 million rows.

PostgreSQL's planner favors a sequential scan over an index scan once the estimated result set approaches roughly 10% of a table's total rows, since sequential I/O has lower per-row overhead than following an index across a large fraction of the table (Crunchy Data, How to Read Postgres EXPLAIN). The same threshold effect applies to small tables, where a full scan of a few pages beats the overhead of an index lookup entirely.

Reading EXPLAIN ANALYZE Output

EXPLAIN shows the planner's chosen execution plan without running the query; EXPLAIN ANALYZE actually executes it and reports real elapsed time per plan node alongside the planner's estimate.

EXPLAIN ANALYZE
SELECT * FROM orders WHERE user_id = 42 AND status = 'shipped';

Look for Seq Scan versus Index Scan (or Bitmap Index Scan, used when the planner expects a moderate number of matches) in the plan's top line. A large gap between the estimated row count and the actual row count usually means the table's statistics are stale — run ANALYZE table_name; to refresh them.

To find indexes nobody's querying against, check pg_stat_user_indexes:

SELECT relname AS table_name, indexrelname AS index_name, idx_scan
FROM pg_stat_user_indexes
WHERE idx_scan = 0
ORDER BY relname;

An idx_scan of zero means the index has never been used by the planner since the statistics were last reset. As with MySQL's sys.schema_unused_indexes, confirm the observation window is long enough — check stats_reset in pg_stat_database — before dropping anything a monthly report query might still depend on.

The pg_stat_user_indexes system view exposes an idx_scan counter per index, incremented every time the planner uses that index to answer a query; a value of zero across a representative time window flags the index as a pure write-overhead candidate for removal (PostgreSQL Documentation, Cumulative Statistics System).

Common Mistakes

Using B-tree on a JSONB column you query for containment. A default B-tree index on a JSONB column can only support equality on the whole value. Use GIN for @>, ?, and similar containment operators.
Skipping CONCURRENTLY on a production table. A plain CREATE INDEX blocks all writes to the table for the duration of the build, which on a large table can mean minutes of blocked INSERT and UPDATE statements.
Choosing BRIN for shuffled data. BRIN only pays off when the indexed column correlates with physical row order. Applying it to a randomly-ordered column, like a UUID primary key, produces an index that can't meaningfully narrow a search.
Wrong column order in a composite index. Just like MySQL, the leftmost column should match your most common filter condition. An index on (status, user_id) gives little benefit to queries that mostly filter on user_id.
Never checking for unused indexes. Every index adds write overhead with zero benefit if the planner never selects it. Review pg_stat_user_indexes periodically, the same discipline covered in the MySQL indexes guide for sys.schema_unused_indexes.

Boxes organized on warehouse shelves representing how a BRIN index summarizes ranges of physically ordered table pages instead of indexing every row — A BRIN index works like labeling warehouse aisles instead of every box — fast to scan, but only useful if items are actually stored in order. (Photo: Pexels)

Visualise Indexes Before Writing DDL

Index type decisions get easier once your schema's actual query patterns are visible. Marking which columns are foreign keys, which hold JSONB, and which are append-only timestamps is a design-time decision that's far clearer on a diagram than buried in a migration file.

You can design your schema visually and export the correct PostgreSQL DDL, including primary keys and foreign key indexes, straight from the diagram. See the PostgreSQL data types guide for choosing the right column type before you decide which index method fits it, and the database schema examples post for complete indexed schemas across common application types.

Frequently Asked Questions

What is the default index type in PostgreSQL?

B-tree. Running CREATE INDEX without specifying USING creates a B-tree index, which handles equality and range queries on sortable data and covers the large majority of indexing needs — primary keys, foreign keys, and most WHERE clause lookups.

When should I use a GIN index instead of B-tree?

Use GIN for columns holding multiple values per row that you search inside: JSONB documents, text arrays, and full-text search tsvector columns. GIN stores one index entry per element rather than per row, making it fast for containment queries but slower to write than a B-tree.

What is a BRIN index good for in PostgreSQL?

BRIN indexes suit very large, append-only tables where the indexed column correlates with physical row order, such as a created_at column on a time-series table. A BRIN index can be roughly 4,000 times smaller than an equivalent B-tree on a 10-million-row table, at the cost of less precise row filtering.

How do I create an index without locking the table in PostgreSQL?

Use CREATE INDEX CONCURRENTLY. A plain CREATE INDEX takes a lock that blocks writes to the table for the build's duration. CONCURRENTLY builds the index in the background without blocking INSERT, UPDATE, or DELETE, at the cost of a longer build time and the requirement to run it outside a transaction block.

How do I find unused indexes in PostgreSQL?

Query pg_stat_user_indexes and look for rows where idx_scan equals zero. Confirm the statistics have accumulated over a representative period first, since a freshly reset counter or a recently restarted server will show every index as unused regardless of whether it's actually needed.

PostgreSQL Indexes Explained — B-Tree, GIN, BRIN, and GiST