PostgreSQL Interview Questions: Answers, Coding Prep & FAQs

01

What is PostgreSQL?

PostgreSQL is an open-source object-relational database known for correctness, SQL standards support, extensibility, strong transactions, advanced indexing, JSONB, full-text search, partitioning, replication, and mature operational features. It is often chosen when teams need reliability, complex queries, and strong data integrity.

02

What is MVCC in PostgreSQL?

MVCC, or Multi-Version Concurrency Control, lets readers and writers work concurrently by keeping multiple row versions. Readers see a consistent snapshot without blocking ordinary writers, and writers create new row versions instead of overwriting rows in place. MVCC improves concurrency but creates dead tuples that vacuum must clean.

03

What is a transaction in PostgreSQL?

A transaction groups SQL statements so they commit together or roll back together. PostgreSQL transactions provide ACID behavior: atomicity, consistency, isolation, and durability. Use transactions for workflows such as transferring funds, creating orders, or updating several related tables.

Example

BEGIN;
UPDATE accounts SET balance = balance - 100 WHERE id = 1;
UPDATE accounts SET balance = balance + 100 WHERE id = 2;
COMMIT;

04

What are PostgreSQL isolation levels?

PostgreSQL supports Read Committed, Repeatable Read, and Serializable. Read Committed is the default and each statement sees a fresh committed snapshot. Repeatable Read keeps one snapshot for the transaction. Serializable provides the strongest isolation and may abort transactions to prevent anomalies.

Example

BEGIN TRANSACTION ISOLATION LEVEL SERIALIZABLE;
-- read and write data
COMMIT;

05

What is a deadlock in PostgreSQL?

A deadlock happens when transactions wait on each other in a cycle, so none can proceed. PostgreSQL detects deadlocks and aborts one transaction. Prevent them by locking rows in consistent order, keeping transactions short, and avoiding unnecessary locks.

06

What is EXPLAIN ANALYZE?

EXPLAIN shows the query plan PostgreSQL expects to use. EXPLAIN ANALYZE actually runs the query and reports real timing and row counts. It is essential for diagnosing slow queries because it reveals scans, joins, estimates, sort operations, and whether indexes are used.

Example

EXPLAIN ANALYZE
SELECT * FROM orders WHERE customer_id = 42;

07

What is a B-tree index?

B-tree is the default PostgreSQL index type. It works well for equality, range searches, ordering, and many common WHERE clauses. It is commonly used for primary keys, foreign keys, timestamps, status fields, and lookup columns.

Example

CREATE INDEX idx_orders_customer_id ON orders(customer_id);

08

What is a composite index?

A composite index includes multiple columns. Column order matters because PostgreSQL can use the leftmost prefix effectively. For example, an index on (customer_id, created_at) helps queries filtering by customer_id and ordering or filtering by created_at.

Example

CREATE INDEX idx_orders_customer_created
ON orders(customer_id, created_at DESC);

09

What is a partial index?

A partial index indexes only rows matching a predicate. It is useful when queries frequently target a subset, such as active users, unpaid invoices, or non-deleted records. Partial indexes are smaller and faster when the predicate matches query patterns.

Example

CREATE INDEX idx_users_active_email
ON users(email)
WHERE deleted_at IS NULL;

10

What is an expression index?

An expression index indexes the result of an expression instead of a raw column. It is useful for case-insensitive lookups, date extraction, computed keys, or normalized values. The query must use a matching expression for the index to help.

Example

CREATE INDEX idx_users_lower_email ON users (lower(email));

SELECT * FROM users WHERE lower(email) = 'a@example.com';

11

What are GIN indexes used for?

GIN indexes are useful for composite values where a row can contain many searchable elements, such as JSONB, arrays, and full-text search vectors. They are powerful for containment and membership queries but can be larger and more expensive to update than B-tree indexes.

Example

CREATE INDEX idx_events_payload_gin ON events USING gin(payload);

12

What are GiST indexes used for?

GiST indexes are generalized search tree indexes used for data types and operators such as geometric data, ranges, nearest-neighbor searches, and extensions like PostGIS. They are flexible but should be chosen based on operator support and query patterns.

13

Why might PostgreSQL not use an index?

PostgreSQL may skip an index if the table is small, the predicate is not selective, statistics are stale, the query expression does not match the index, the data type cast prevents usage, or a sequential scan is estimated cheaper. Use EXPLAIN ANALYZE before assuming the planner is wrong.

14

What are primary keys and foreign keys?

A primary key uniquely identifies rows in a table. A foreign key enforces that a value references an existing row in another table. Foreign keys protect data integrity and make relationships explicit, but they also add checks during writes.

Example

CREATE TABLE orders (
  id bigserial PRIMARY KEY,
  customer_id bigint NOT NULL REFERENCES customers(id)
);

15

What are constraints in PostgreSQL?

Constraints enforce rules in the database, such as NOT NULL, UNIQUE, CHECK, PRIMARY KEY, FOREIGN KEY, and EXCLUDE. They protect data even when multiple applications or scripts write to the same database.

Example

ALTER TABLE products
ADD CONSTRAINT price_positive CHECK (price > 0);

16

What is UPSERT in PostgreSQL?

UPSERT inserts a row or updates an existing row when a uniqueness conflict occurs. PostgreSQL implements it with INSERT ... ON CONFLICT. It is useful for idempotent writes, counters, sync jobs, and external identifiers.

Example

INSERT INTO users(email, name)
VALUES ('a@example.com', 'Asha')
ON CONFLICT (email)
DO UPDATE SET name = EXCLUDED.name;

17

What does RETURNING do?

RETURNING returns values from inserted, updated, or deleted rows. It avoids an extra SELECT after writes and is commonly used to get generated IDs, timestamps, or updated values.

Example

INSERT INTO posts(title)
VALUES ('Hello')
RETURNING id, created_at;

18

What are Common Table Expressions?

Common Table Expressions, or CTEs, define named temporary result sets using WITH. They improve readability for complex queries and recursive logic. In modern PostgreSQL, simple CTEs may be inlined, but materialization behavior can still matter for performance.

Example

WITH recent_orders AS (
  SELECT * FROM orders WHERE created_at >= now() - interval '7 days'
)
SELECT customer_id, count(*)
FROM recent_orders
GROUP BY customer_id;

19

What are window functions?

Window functions calculate values across related rows without collapsing the result like GROUP BY. They are useful for ranking, running totals, previous or next row comparisons, and per-group analytics.

Example

SELECT
  customer_id,
  total,
  row_number() OVER (PARTITION BY customer_id ORDER BY created_at DESC) AS rn
FROM orders;

20

What is JSONB in PostgreSQL?

JSONB stores JSON data in a binary format that supports indexing and efficient querying. It is useful for flexible attributes, event payloads, and integration data. Do not use JSONB as a replacement for relational modeling when fields need strong constraints and joins.

Example

SELECT * FROM events
WHERE payload @> '{"type":"signup"}'::jsonb;

21

When should you use arrays in PostgreSQL?

Arrays can model small lists of scalar values, such as tags or flags, when relational joins are not needed. Avoid arrays when individual elements need foreign keys, frequent updates, complex querying, or independent lifecycle management.

Example

SELECT * FROM posts WHERE 'postgres' = ANY(tags);

22

What are views?

A view is a stored query exposed like a table. Views simplify repeated queries, hide complexity, and provide stable interfaces. They do not store data by default, so performance depends on the underlying query.

Example

CREATE VIEW active_users AS
SELECT id, email FROM users WHERE deleted_at IS NULL;

23

What are materialized views?

A materialized view stores query results physically. It can speed expensive reports or dashboards, but it becomes stale until refreshed. Use it when data freshness requirements tolerate refresh intervals.

Example

CREATE MATERIALIZED VIEW daily_sales AS
SELECT date_trunc('day', created_at) AS day, sum(total) AS revenue
FROM orders
GROUP BY 1;

REFRESH MATERIALIZED VIEW daily_sales;

24

What are stored functions and procedures?

Stored functions and procedures run logic inside the database. They can centralize data-heavy operations and reduce round trips, but overusing them can hide business logic, complicate testing, and couple application behavior tightly to the database.

25

What are triggers in PostgreSQL?

Triggers run automatically in response to table events such as INSERT, UPDATE, or DELETE. They are useful for audit logs, denormalized counters, and invariants. They can also surprise developers because side effects happen outside visible application code.

26

What is table partitioning?

Partitioning splits a logical table into smaller physical partitions, commonly by range, list, or hash. It helps manage large tables, retention, and query pruning when predicates match the partition key. Poor partition strategy can add complexity without improving performance.

Example

CREATE TABLE events (
  id bigserial,
  created_at date NOT NULL,
  payload jsonb
) PARTITION BY RANGE (created_at);

27

What is VACUUM in PostgreSQL?

VACUUM cleans up dead row versions created by MVCC so space can be reused and table bloat is controlled. VACUUM also helps prevent transaction ID wraparound. Regular vacuuming is essential for write-heavy databases.

28

What is autovacuum?

Autovacuum automatically runs vacuum and analyze based on table activity. If it is too weak for a write-heavy workload, tables can bloat, queries can slow down, and wraparound risk can grow. Monitor autovacuum activity, dead tuples, and table bloat.

29

What is ANALYZE in PostgreSQL?

ANALYZE collects table statistics used by the query planner. Stale or inaccurate statistics can cause bad query plans. Autovacuum usually runs analyze automatically, but manual ANALYZE can help after large data changes.

Example

ANALYZE orders;

30

What is connection pooling?

Connection pooling reuses database connections instead of opening one per request. PostgreSQL connections are relatively heavy, so high-traffic apps often use PgBouncer or application pools. Pool size should match database capacity and workload, not simply the number of web requests.

31

What is WAL in PostgreSQL?

WAL, or Write-Ahead Logging, records changes before they are applied to data files. It supports crash recovery, replication, and point-in-time recovery. Heavy write workloads can be limited by WAL volume, disk throughput, or replication lag.

32

What is replication in PostgreSQL?

Replication copies data from a primary server to one or more replicas. Streaming replication is commonly used for read scaling, high availability, and disaster recovery. Replicas can lag behind the primary, so applications must understand read-after-write requirements.

33

What is point-in-time recovery?

Point-in-time recovery, or PITR, restores a backup and replays WAL to a specific time before an incident. It is useful after accidental deletes, bad migrations, or corruption. PITR requires base backups, archived WAL, and tested restore procedures.

34

How do roles and privileges work?

PostgreSQL uses roles for users and groups. Privileges control access to databases, schemas, tables, sequences, functions, and more. Production systems should use least privilege and avoid running applications as superusers.

Example

CREATE ROLE app_user LOGIN PASSWORD 'change-me';
GRANT CONNECT ON DATABASE appdb TO app_user;
GRANT SELECT, INSERT, UPDATE ON users TO app_user;

35

What are schemas and search_path in PostgreSQL?

A schema is a namespace inside a database that groups objects such as tables, views, and functions. search_path controls which schemas PostgreSQL checks when an object name is not qualified. In production, qualify important objects or manage search_path carefully to avoid security and ambiguity problems.

Example

CREATE SCHEMA billing;
CREATE TABLE billing.invoices (id bigserial PRIMARY KEY);

SET search_path TO billing, public;

36

What is row-level security?

Row-level security, or RLS, restricts which rows a role can see or modify based on policies. It is useful for multi-tenant systems, but policies must be tested carefully because a mistake can expose or block data.

Example

ALTER TABLE invoices ENABLE ROW LEVEL SECURITY;

CREATE POLICY tenant_isolation ON invoices
USING (tenant_id = current_setting('app.tenant_id')::uuid);

37

What is full-text search in PostgreSQL?

Full-text search tokenizes documents and queries into lexemes, then ranks matches. It is useful for basic search without a separate search engine. Use tsvector, tsquery, and GIN indexes for performant search.

Example

CREATE INDEX idx_posts_search
ON posts USING gin(to_tsvector('english', title || ' ' || body));

SELECT * FROM posts
WHERE to_tsvector('english', title || ' ' || body) @@ plainto_tsquery('postgres');

38

What are advisory locks?

Advisory locks are application-defined locks managed by PostgreSQL. They are useful for leader election, scheduled job locking, or protecting shared resources. They require discipline because PostgreSQL does not know what business object the lock represents.

Example

SELECT pg_try_advisory_lock(12345);
-- do protected work
SELECT pg_advisory_unlock(12345);

39

How do joins work in PostgreSQL?

Joins combine rows from tables based on conditions. PostgreSQL can use nested loop, hash join, or merge join strategies depending on data size, indexes, statistics, and sort order. EXPLAIN helps show which strategy was chosen.

40

What is the difference between INNER JOIN and LEFT JOIN?

INNER JOIN returns only matching rows from both tables. LEFT JOIN returns all rows from the left table and matching rows from the right table, using NULLs where no match exists. LEFT JOIN is useful for optional relationships and missing-data reports.

Example

SELECT customers.email, orders.id
FROM customers
LEFT JOIN orders ON orders.customer_id = customers.id;

41

How do you find slow queries in PostgreSQL?

Use logs, pg_stat_statements, EXPLAIN ANALYZE, application tracing, and database metrics. Look at total time, mean time, rows read, rows returned, shared buffer hits, temporary files, lock waits, and query frequency.

42

What is pg_stat_statements?

pg_stat_statements is an extension that tracks normalized query statistics. It helps identify high-cost queries by total time, average time, calls, rows, and I/O. It is one of the most useful extensions for performance tuning.

Example

SELECT query, calls, total_exec_time, mean_exec_time
FROM pg_stat_statements
ORDER BY total_exec_time DESC
LIMIT 10;

43

What are prepared statements in PostgreSQL?

Prepared statements let PostgreSQL parse and plan a statement once, then execute it multiple times with different parameters. They can improve repeated query performance and help prevent SQL injection when parameters are bound correctly. Be aware that generic plans are not always optimal for every parameter distribution.

Example

PREPARE find_user(text) AS
SELECT id, email FROM users WHERE email = $1;

EXECUTE find_user('a@example.com');

44

What is a lock wait?

A lock wait happens when one transaction waits for another transaction lock. Long lock waits can slow or block production traffic. Monitor blocking queries, keep transactions short, and avoid running unsafe migrations during peak traffic.

Example

SELECT pid, wait_event_type, wait_event, query
FROM pg_stat_activity
WHERE wait_event_type = 'Lock';

45

How do you safely add an index in production?

Use CREATE INDEX CONCURRENTLY for large production tables so writes are not blocked the same way as a normal index build. It takes longer and cannot run inside a transaction block, but it is safer for live systems.

Example

CREATE INDEX CONCURRENTLY idx_orders_created_at
ON orders(created_at);

46

How do migrations affect PostgreSQL performance?

Migrations can lock tables, rewrite data, backfill rows, or create heavy indexes. Safe migrations split schema changes from data backfills, use concurrent indexes, batch updates, set timeouts, and test on production-like data volume.

47

What is query optimization in PostgreSQL?

Query optimization means improving query shape, indexes, statistics, schema design, and configuration so PostgreSQL can execute work efficiently. Start with measurements: EXPLAIN ANALYZE, pg_stat_statements, row counts, selectivity, and real latency.

48

What are common PostgreSQL security mistakes?

Common mistakes include using superuser accounts for apps, weak passwords, open network access, missing TLS, overbroad grants, unsafe dynamic SQL, unencrypted backups, exposed stateful replicas, and storing secrets in logs or migration files.

49

What are common PostgreSQL anti-patterns?

Common anti-patterns include missing indexes on foreign keys, indexing every column blindly, long idle transactions, unbounded connection pools, using JSONB for strongly relational data, ignoring autovacuum, running blocking migrations in peak hours, and tuning without EXPLAIN.

50

How would you design a simple PostgreSQL schema in an interview?

A strong demo schema uses primary keys, foreign keys, constraints, indexes that match queries, timestamps, and realistic relationships. Explain both data integrity and access patterns, because good schema design depends on how data is queried and updated.

Example

CREATE TABLE customers (
  id bigserial PRIMARY KEY,
  email text NOT NULL UNIQUE,
  created_at timestamptz NOT NULL DEFAULT now()
);

CREATE TABLE orders (
  id bigserial PRIMARY KEY,
  customer_id bigint NOT NULL REFERENCES customers(id),
  total numeric(12,2) NOT NULL CHECK (total >= 0),
  created_at timestamptz NOT NULL DEFAULT now()
);

CREATE INDEX idx_orders_customer_created
ON orders(customer_id, created_at DESC);

Top 50 PostgreSQL Interview Questions

What is PostgreSQL?

What is MVCC in PostgreSQL?

What is a transaction in PostgreSQL?

What are PostgreSQL isolation levels?

What is a deadlock in PostgreSQL?

What is EXPLAIN ANALYZE?

What is a B-tree index?

What is a composite index?

What is a partial index?

What is an expression index?

What are GIN indexes used for?

What are GiST indexes used for?

Why might PostgreSQL not use an index?

What are primary keys and foreign keys?

What are constraints in PostgreSQL?

What is UPSERT in PostgreSQL?

What does RETURNING do?

What are Common Table Expressions?

What are window functions?

What is JSONB in PostgreSQL?

When should you use arrays in PostgreSQL?

What are views?

What are materialized views?

What are stored functions and procedures?

What are triggers in PostgreSQL?

What is table partitioning?

What is VACUUM in PostgreSQL?

What is autovacuum?

What is ANALYZE in PostgreSQL?

What is connection pooling?

What is WAL in PostgreSQL?

What is replication in PostgreSQL?

What is point-in-time recovery?

How do roles and privileges work?

What are schemas and search_path in PostgreSQL?

What is row-level security?

What is full-text search in PostgreSQL?

What are advisory locks?

How do joins work in PostgreSQL?

What is the difference between INNER JOIN and LEFT JOIN?

How do you find slow queries in PostgreSQL?

What is pg_stat_statements?

What are prepared statements in PostgreSQL?

What is a lock wait?

How do you safely add an index in production?

How do migrations affect PostgreSQL performance?

What is query optimization in PostgreSQL?

What are common PostgreSQL security mistakes?

What are common PostgreSQL anti-patterns?

How would you design a simple PostgreSQL schema in an interview?

Use PostgreSQL interview prep to move into practice and application.

Popular Tutorials

Ready to Level Up Your Skills?