Postgresql Online Tutorials

PostgreSQL (/ˌpstɡrɛskjuˈɛl/ POHST-gres-kew-EL) also known as Postgres, is a free and open-source relational database management system (RDBMS) emphasizing extensibility and SQL compliance. PostgreSQL features transactions with atomicity, consistency, isolation, durability (ACID) properties, automatically updatable views, materialized views, triggers, foreign keys, and stored procedures. It is supported on all major operating systems, including Windows, Linux, macOS, FreeBSD, and OpenBSD, and handles a range of workloads from single machines to data warehouses, data lakes, or web services with many concurrent users.

PostgreSQL
Developer(s)PostgreSQL Global Development Group
Initial release8 July 1996;
28 years ago
 (1996-07-08)
Stable release
17.2 Edit this on Wikidata / 21 November 2024; 32 days ago (21 November 2024)
Repository
  • git.postgresql.org/gitweb/?p=postgresql.git Edit this at Wikidata
Written inC (and C++ for the LLVM dependency)
TypeRDBMS
LicensePostgreSQL License (free and open-source, permissive)
Websitewww.postgresql.org Edit this on Wikidata
PostgreSQL License
PublisherPostgreSQL Global Development Group
Regents of the University of California
Debian FSG compatibleYes
FSF approvedYes
OSI approvedYes
GPL compatibleYes
CopyleftNo
Linking from code with a different licenceYes
Websitepostgresql.org/about/licence

The PostgreSQL Global Development Group focuses only on developing a database engine and closely related components. This core is, technically, what comprises PostgreSQL itself, but there is an extensive developer community and ecosystem that provides other important feature sets that might, traditionally, be provided by a proprietary software vendor. These include special-purpose database engine features, like those needed to support a geospatial or temporal database or features which emulate other database products. Also available from third parties are a wide variety of user and machine interface features, such as graphical user interfaces or load balancing and high availability toolsets. The large third-party PostgreSQL support network of people, companies, products, and projects, even though not part of The PostgreSQL Development Group, are essential to the PostgreSQL database engine's adoption and use and make up the PostgreSQL ecosystem writ large.

PostgreSQL was originally named POSTGRES, referring to its origins as a successor to the Ingres database developed at the University of California, Berkeley. In 1996, the project was renamed PostgreSQL to reflect its support for SQL. After a review in 2007, the development team decided to keep the name PostgreSQL and the alias Postgres.

History

edit

PostgreSQL evolved from the Ingres project at the University of California, Berkeley. In 1982, the leader of the Ingres team, Michael Stonebraker, left Berkeley to make a proprietary version of Ingres. He returned to Berkeley in 1985, and began a post-Ingres project to address the problems with contemporary database systems that had become increasingly clear during the early 1980s. He won the Turing Award in 2014 for these and other projects, and techniques pioneered in them.

The new project, POSTGRES, aimed to add the fewest features needed to completely support data types. These features included the ability to define types and to fully describe relationships – something used widely, but maintained entirely by the user. In POSTGRES, the database understood relationships, and could retrieve information in related tables in a natural way using rules. POSTGRES used many of the ideas of Ingres, but not its code.

Starting in 1986, published papers described the basis of the system, and a prototype version was shown at the 1988 ACM SIGMOD Conference. The team released version 1 to a small number of users in June 1989, followed by version 2 with a re-written rules system in June 1990. Version 3, released in 1991, again re-wrote the rules system, and added support for multiple storage managers and an improved query engine. By 1993, the number of users began to overwhelm the project with requests for support and features. After releasing version 4.2 on June 30, 1994 – primarily a cleanup – the project ended. Berkeley released POSTGRES under an MIT License variant, which enabled other developers to use the code for any use. At the time, POSTGRES used an Ingres-influenced POSTQUEL query language interpreter, which could be interactively used with a console application named monitor.

In 1994, Berkeley graduate students Andrew Yu and Jolly Chen replaced the POSTQUEL query language interpreter with one for the SQL query language, creating Postgres95. The monitor console was also replaced by psql. Yu and Chen announced the first version (0.01) to beta testers on May 5, 1995. Version 1.0 of Postgres95 was announced on September 5, 1995, with a more liberal license that enabled the software to be freely modifiable.

On July 8, 1996, Marc Fournier at Hub.org Networking Services provided the first non-university development server for the open-source development effort. With the participation of Bruce Momjian and Vadim B. Mikheev, work began to stabilize the code inherited from Berkeley.

In 1996, the project was renamed to PostgreSQL to reflect its support for SQL. The online presence at the website PostgreSQL.org began on October 22, 1996. The first PostgreSQL release formed version 6.0 on January 29, 1997. Since then developers and volunteers around the world have maintained the software as The PostgreSQL Global Development Group.

The project continues to make releases available under its free and open-source software PostgreSQL License. Code comes from contributions from proprietary vendors, support companies, and open-source programmers.

Multiversion concurrency control (MVCC)

edit

PostgreSQL manages concurrency through multiversion concurrency control (MVCC), which gives each transaction a "snapshot" of the database, allowing changes to be made without affecting other transactions. This largely eliminates the need for read locks, and ensures the database maintains ACID principles. PostgreSQL offers four levels of transaction isolation: Read Uncommitted, Read Committed, Repeatable Read and Serializable. Because PostgreSQL is immune to dirty reads, requesting a Read Uncommitted transaction isolation level provides read committed instead. PostgreSQL supports full serializability via the serializable snapshot isolation (SSI) method. The PostgreSQL MVCC implementation is prone to performance issues that require tuning when under a heavy write load which updates existing rows.

Storage and replication

edit

Replication

edit

PostgreSQL includes built-in binary replication based on shipping the changes (write-ahead logs (WAL)) to replica nodes asynchronously, with the ability to run read-only queries against these replicated nodes. This allows splitting read traffic among multiple nodes efficiently. Earlier replication software that allowed similar read scaling normally relied on adding replication triggers to the master, increasing load.

PostgreSQL includes built-in synchronous replication that ensures that, for each write transaction, the master waits until at least one replica node has written the data to its transaction log. Unlike other database systems, the durability of a transaction (whether it is asynchronous or synchronous) can be specified per-database, per-user, per-session or even per-transaction. This can be useful for workloads that do not require such guarantees, and may not be wanted for all data as it slows down performance due to the requirement of the confirmation of the transaction reaching the synchronous standby.

Standby servers can be synchronous or asynchronous. Synchronous standby servers can be specified in the configuration which determines which servers are candidates for synchronous replication. The first in the list that is actively streaming will be used as the current synchronous server. When this fails, the system fails over to the next in line.

Synchronous multi-master replication is not included in the PostgreSQL core. Postgres-XC which is based on PostgreSQL provides scalable synchronous multi-master replication. It is licensed under the same license as PostgreSQL. A related project is called Postgres-XL. Postgres-R is yet another fork. Bidirectional replication (BDR) is an asynchronous multi-master replication system for PostgreSQL.

Tools such as repmgr make managing replication clusters easier.

Several asynchronous trigger-based replication packages are available. These remain useful even after introduction of the expanded core abilities, for situations where binary replication of a full database cluster is inappropriate:

  • Slony-I
  • Londiste, part of SkyTools (developed by Skype)
  • Bucardo multi-master replication (developed by Backcountry.com)
  • SymmetricDS multi-master, multi-tier replication

Indexes

edit

PostgreSQL includes built-in support for regular B-tree and hash table indexes, and four index access methods: generalized search trees (GiST), generalized inverted indexes (GIN), Space-Partitioned GiST (SP-GiST) and Block Range Indexes (BRIN). In addition, user-defined index methods can be created, although this is quite an involved process. Indexes in PostgreSQL also support the following features:

  • Expression indexes can be created with an index of the result of an expression or function, instead of simply the value of a column.
  • Partial indexes, which only index part of a table, can be created by adding a WHERE clause to the end of the CREATE INDEX statement. This allows a smaller index to be created.
  • The planner is able to use multiple indexes together to satisfy complex queries, using temporary in-memory bitmap index operations (useful for data warehouse applications for joining a large fact table to smaller dimension tables such as those arranged in a star schema).
  • k-nearest neighbors (k-NN) indexing (also referred to KNN-GiST) provides efficient searching of "closest values" to that specified, useful to finding similar words, or close objects or locations with geospatial data. This is achieved without exhaustive matching of values.
  • Index-only scans often allow the system to fetch data from indexes without ever having to access the main table.
  • Block Range Indexes (BRIN).

Schemas

edit

PostgreSQL schemas are namespaces, allowing objects of the same kind and name to co-exist in a single database. They are not to be confused with a database schema—the abstract, structural, organizational specification which defines how every table's data relates to data within other tables. All PostgreSQL database objects, except for a few global objects such as roles and tablespaces, exist within a schema. They cannot be nested, schemas cannot contain schemas. The permission system controls access to schemas and their content. By default, newly created databases have only a single schema called public but other schemas can be added and the public schema isn't mandatory.

A search_path setting determines the order in which PostgreSQL checks schemas for unqualified objects (those without a prefixed schema). By default, it is set to $user, public ($user refers to the currently connected database user). This default can be set on a database or role level, but as it is a session parameter, it can be freely changed (even multiple times) during a client session, affecting that session only.

Non-existent schemas, or other schemas not accessible to the logged-in user, that are listed in search_path are silently skipped during object lookup.

New objects are created in whichever valid schema (one that can be accessed) appears first in the search_path.

Data types

edit

A wide variety of native data types are supported, including:

  • Boolean
  • Arbitrary-precision numerics
  • Character (text, varchar, char)
  • Binary
  • Date/time (timestamp/time with/without time zone, date, interval)
  • Money
  • Enum
  • Bit strings
  • Text search type
  • Composite
  • HStore, an extension enabled key-value store within PostgreSQL
  • Arrays (variable-length and can be of any data type, including text and composite types) up to 1 GB in total storage size
  • Geometric primitives
  • IPv4 and IPv6 addresses
  • Classless Inter-Domain Routing (CIDR) blocks and MAC addresses
  • XML supporting XPath queries
  • Universally unique identifier (UUID)
  • JavaScript Object Notation (JSON), and a faster binary JSONB (not the same as BSON)

In addition, users can create their own data types which can usually be made fully indexable via PostgreSQL's indexing infrastructures – GiST, GIN, SP-GiST. Examples of these include the geographic information system (GIS) data types from the PostGIS project for PostgreSQL.

There is also a data type called a domain, which is the same as any other data type but with optional constraints defined by the creator of that domain. This means any data entered into a column using the domain will have to conform to whichever constraints were defined as part of the domain.

A data type that represents a range of data can be used which are called range types. These can be discrete ranges (e.g. all integer values 1 to 10) or continuous ranges (e.g., any time between 10:00 am and 11:00 am). The built-in range types available include ranges of integers, big integers, decimal numbers, time stamps (with and without time zone) and dates.

Custom range types can be created to make new types of ranges available, such as IP address ranges using the inet type as a base, or float ranges using the float data type as a base. Range types support inclusive and exclusive range boundaries using the and () characters respectively. (e.g., These can take the form of any data source, such as a file system, another relational database management system (RDBMS), or a web service. This means that regular database queries can use these data sources like regular tables, and even join multiple data-sources together.

Interfaces

edit

PostgreSQL supports a binary communication protocol that allows applications to connect to the database server. The protocol is versioned (currently 3.0, as of PostgreSQL 7.4) and has a detailed specification.

The official client implementation of this communication protocol is a C API, libpq. In addition, the officially supported ECPG tool allows SQL commands to be embedded in C code. Both are part of the standard PostgreSQL distribution.

Third-party libraries for connecting to PostgreSQL are available for many programming languages, including C++, Java, Julia, Python, Node.js, Go, and Rust.

Procedural languages

edit

Procedural languages allow developers to extend the database with custom subroutines (functions), often called stored procedures. These functions can be used to build database triggers (functions invoked on modification of certain data) and custom data types and aggregate functions. Procedural languages can also be invoked without defining a function, using a DO command at SQL level.

Languages are divided into two groups: Procedures written in safe languages are sandboxed and can be safely created and used by any user. Procedures written in unsafe languages can only be created by superusers, because they allow bypassing a database's security restrictions, but can also access sources external to the database. Some languages like Perl provide both safe and unsafe versions.

PostgreSQL has built-in support for three procedural languages:

  • Plain SQL (safe). Simpler SQL functions can get expanded inline into the calling (SQL) query, which saves function call overhead and allows the query optimizer to "see inside" the function.
  • Procedural Language/PostgreSQL (PL/pgSQL) (safe), which resembles Oracle's Procedural Language for SQL (PL/SQL) procedural language and SQL/Persistent Stored Modules (SQL/PSM).
  • C (unsafe), which allows loading one or more custom shared library into the database. Functions written in C offer the best performance, but bugs in code can crash and potentially corrupt the database. Most built-in functions are written in C.

In addition, PostgreSQL allows procedural languages to be loaded into the database through extensions. Three language extensions are included with PostgreSQL to support Perl, Tcl, and Python. For Python, the current Python 3 is used, and the discontinued Python 2 is no longer supported as of PostgreSQL 15. Both were supported previously, defaulting to Python 2, while old and new versions couldn't be used in the same session. External projects provide support for many other languages, including PL/Java, JavaScript (PL/V8), PL/Julia, PL/R, PL/Ruby, and others.

Triggers

edit

Triggers are events triggered by the action of SQL data manipulation language (DML) statements. For example, an INSERT statement might activate a trigger that checks if the values of the statement are valid. Most triggers are only activated by either INSERT or UPDATE statements.

Triggers are fully supported and can be attached to tables. Triggers can be per-column and conditional, in that UPDATE triggers can target specific columns of a table, and triggers can be told to execute under a set of conditions as specified in the trigger's WHERE clause. Triggers can be attached to views by using the INSTEAD OF condition. Multiple triggers are fired in alphabetical order. In addition to calling functions written in the native PL/pgSQL, triggers can also invoke functions written in other languages like PL/Python or PL/Perl.

Asynchronous notifications

edit

PostgreSQL provides an asynchronous messaging system that is accessed through the NOTIFY, LISTEN and UNLISTEN commands. A session can issue a NOTIFY command, along with the user-specified channel and an optional payload, to mark a particular event occurring. Other sessions are able to detect these events by issuing a LISTEN command, which can listen to a particular channel. This functionality can be used for a wide variety of purposes, such as letting other sessions know when a table has updated or for separate applications to detect when a particular action has been performed. Such a system prevents the need for continuous polling by applications to see if anything has yet changed, and reducing unnecessary overhead. Notifications are fully transactional, in that messages are not sent until the transaction they were sent from is committed. This eliminates the problem of messages being sent for an action being performed which is then rolled back.

Many connectors for PostgreSQL provide support for this notification system (including libpq, JDBC, Npgsql, psycopg and node.js) so it can be used by external applications.

PostgreSQL can act as an effective, persistent "pub/sub" server or job server by combining LISTEN with FOR UPDATE SKIP LOCKED.

Rules

edit

Rules allow the "query tree" of an incoming query to be rewritten; they are an, automatically invoked, macro language for SQL. "Query Re-Write Rules" are attached to a table/class and "Re-Write" the incoming DML (select, insert, update, and/or delete) into one or more queries that either replace the original DML statement or execute in addition to it. Query Re-Write occurs after DML statement parsing and before query planning.

The functionality rules provide was, in almost every way, later duplicated with the introduction of newer types of triggers. The use of triggers is usually preferred over rules as it is easier to reason about trigger behavior and interactions than when equivalent rules are used.

Other querying features

edit
  • Transactions
  • Full-text search
  • Views
    • Materialized views
    • Updateable views
    • Recursive views
  • Inner, outer (full, left, and right), and cross joins
  • Sub-selects
    • Correlated sub-queries
  • Regular expressions
  • Common table expressions and writable common table expressions
  • Encrypted connections via Transport Layer Security (TLS); current versions do not use vulnerable SSL, even with that configuration option
  • Domains
  • Savepoints
  • Two-phase commit
  • The Oversized-Attribute Storage Technique (TOAST) is used to transparently store large table attributes (such as big MIME attachments or XML messages) in a separate area, with automatic compression.
  • Embedded SQL is implemented using preprocessor. SQL code is first written embedded into C code. Then code is run through ECPG preprocessor, which replaces SQL with calls to code library. Then code can be compiled using a C compiler. Embedding works also with C++ but it does not recognize all C++ constructs.

Concurrency model

edit

PostgreSQL server is process-based (not threaded), and uses one operating system process per database session. Multiple sessions are automatically spread across all available CPUs by the operating system. Many types of queries can also be parallelized across multiple background worker processes, taking advantage of multiple CPUs or cores. Client applications can use threads and create multiple database connections from each thread.

Security

edit

PostgreSQL manages its internal security on a per-role basis. A role is generally regarded to be a user (a role that can log in), or a group (a role of which other roles are members). Permissions can be granted or revoked on any object down to the column level, and can allow or prevent the visibility/creation/alteration/deletion of objects at the database, schema, table, and row levels.

PostgreSQL's SECURITY LABEL feature (extension to SQL standards), allows for additional security; with a bundled loadable module that supports label-based mandatory access control (MAC) based on Security-Enhanced Linux (SELinux) security policy.

PostgreSQL natively supports a broad number of external authentication mechanisms, including:

  • Password: either SCRAM-SHA-256, MD5 or plain-text
  • Generic Security Services Application Program Interface (GSSAPI)
  • Security Support Provider Interface (SSPI)
  • Kerberos
  • ident (maps O/S user-name as provided by an ident server to database user-name)
  • Peer (maps local user name to database user name)
  • Lightweight Directory Access Protocol (LDAP)
    • Active Directory (AD)
  • RADIUS
  • Certificate
  • Pluggable authentication module (PAM)

The GSSAPI, SSPI, Kerberos, peer, ident and certificate methods can also use a specified "map" file that lists which users matched by that authentication system are allowed to connect as a specific database user.

These methods are specified in the cluster's host-based authentication configuration file (pg_hba.conf), which determines what connections are allowed. This allows control over which user can connect to which database, where they can connect from (IP address, IP address range, domain socket), which authentication system will be enforced, and whether the connection must use Transport Layer Security (TLS).

Standards compliance

edit

PostgreSQL claims high, but not complete, conformance with the latest SQL standard ("as of the version 17 release in September 2024, PostgreSQL conforms to at least 170 of the 177 mandatory features for SQL:2023 Core conformance", and no other databases fully conformed to it). One exception is the handling of unquoted identifiers like table or column names. In PostgreSQL they are folded, internally, to lower case characters whereas the standard says that unquoted identifiers should be folded to upper case. Thus, Foo should be equivalent to FOO not foo according to the standard. Other shortcomings concern the absence of temporal tables allowing automatic logging of row versions during transactions with the possibility of browsing in time (FOR SYSTEM TIME predicate), although relatively SQL compliant third-party extensions are available.

Benchmarks and performance

edit

Many informal performance studies of PostgreSQL have been done. Performance improvements aimed at improving scalability began heavily with version 8.1. Simple benchmarks between version 8.0 and version 8.4 showed that the latter was more than ten times faster on read-only workloads and at least 7.5 times faster on both read and write workloads.

The first industry-standard and peer-validated benchmark was completed in June 2007, using the Sun Java System Application Server (proprietary version of GlassFish) 9.0 Platform Edition, UltraSPARC T1-based Sun Fire server and PostgreSQL 8.2. This result of 778.14 SPECjAppServer2004 JOPS@Standard compares favourably with the 874 JOPS@Standard with Oracle 10 on an Itanium-based HP-UX system.

In August 2007, Sun submitted an improved benchmark score of 813.73 SPECjAppServer2004 JOPS@Standard. With the system under test at a reduced price, the price/performance improved from $84.98/JOPS to $70.57/JOPS.

The default configuration of PostgreSQL uses only a small amount of dedicated memory for performance-critical purposes such as caching database blocks and sorting. This limitation is primarily because older operating systems required kernel changes to allow allocating large blocks of shared memory. PostgreSQL.org provides advice on basic recommended performance practice in a wiki.

In April 2012, Robert Haas of EnterpriseDB demonstrated PostgreSQL 9.2's linear CPU scalability using a server with 64 cores.

Matloob Khushi performed benchmarking between PostgreSQL 9.0 and MySQL 5.6.15 for their ability to process genomic data. In his performance analysis he found that PostgreSQL extracts overlapping genomic regions eight times faster than MySQL using two datasets of 80,000 each forming random human DNA regions. Insertion and data uploads in PostgreSQL were also better, although general searching ability of both databases was almost equivalent.

Platforms

edit

PostgreSQL is available for the following operating systems: Linux (all recent distributions), 64-bit ARM and x86-64 installers available and tested for macOS (OS X) version 10.14 and newer, Windows (with installers available and tested for 64-bit Windows Server 2022 and 2016[90]), FreeBSD, OpenBSD,[91] NetBSD, DragonFlyBSD, and these without official (though unofficial likely available) binary executables, Solaris,[92] and illumos.

PostgreSQL can be expected to work on any of the following instruction set architectures (and operating systems): 64-bit x86-64 and 32-bit x86 on Windows and other operating systems; these are supported on other than Windows: 64-bit ARM[93] and the older 32-bit ARM, including older such as ARMv6 in Raspberry Pi[94]), RISC-V, z/Architecture, S/390, PowerPC (incl. 64-bit Power ISA), SPARC (also 64-bit), MIPS and PA-RISC. It was also known to work on some other platforms (while not been tested on for years, i.e. for latest versions).[95]

Database administration

edit

Open source front-ends and tools for administering PostgreSQL include:

psql Session Example[96]
regression=# select foo;
ERROR:  column "foo" does not exist
CONTEXT:  PL/pgSQL function "test1" while casting return value to function's return type
LINE 1: select foo;
               ^
regression=# \q
peter@localhost testdb=> \a \t \x
Output format is aligned.
Tuples only is off.
Expanded display is on.
regression=# select '\x';
WARNING:  nonstandard use of escape in a string literal
LINE 1: select '\x';
               ^
HINT:  Use the escape string syntax for escapes, e.g., E'\r\n'.
 ?column? 
----------
 x
(1 row)

regression=# select E'\x';
piro=> \set foo 30;
piro=> select * from test where foo <= :foo;
 foo | bar 
-----+-----
  10 | 
  20 | 
(2 rows)
testdb=> \set foo 'my_table'
testdb=> SELECT * FROM :"foo";

testdb=> \set content `cat my_file.txt`
testdb=> INSERT INTO my_table VALUES (:'content');
regression=# select (
regression(# 1);
 ?column? 
----------
        1
(1 row)
piro=> select (
piro(> '
piro'> ' || $$
piro$> $$)
piro-> from "
piro"> foo";
ERROR:  relation "
foo" does not exist
LINE 5: from "
             ^
testdb=> CREATE TABLE my_table (
first integer not null default 0,
second text) ; -- end of command
CREATE TABLE

=# SELECT '0x10'::mpz AS "hex", '10'::mpz AS "dec",
-#        '010'::mpz AS oct, '0b10'::mpz AS bin; -- Table output
 hex | dec | oct | bin
-----+-----+-----+-----
 16  | 10  | 8   | 2
(1 row)
regression=# select schemaname from  pg_tables limit 3; -- One field output
 schemaname 
------------
 pg_catalog
 pg_catalog
 pg_catalog
(3 rows)

=# select 10.0, 1e-6, 1E+6;
 ?column? | ?column? | ?column? 
----------+----------+----------
     10.0 | 0.000001 |  1000000
(1 row)

regression=# begin;
BEGIN
regression=# create table asdf (foo serial primary key);
NOTICE:  CREATE TABLE will create implicit sequence "asdf_foo_seq" for serial column "asdf.foo"
NOTICE:  CREATE TABLE / PRIMARY KEY will create implicit index "asdf_pkey" for table "asdf"
CREATE TABLE
regression=# insert into asdf values (10) returning foo;
 foo 
-----
  10
(1 row)

INSERT 0 1
regression=# ROLLBACK ;
ROLLBACK
psql
The primary front-end for PostgreSQL is the psql command-line program, which can be used to enter SQL queries directly, or execute them from a file. In addition, psql provides a number of meta-commands and various shell-like features to facilitate writing scripts and automating a wide variety of tasks; for example tab completion of object names and SQL syntax.
pgAdmin
The pgAdmin package is a free and open-source graphical user interface (GUI) administration tool for PostgreSQL, which is supported on many computer platforms.[97] The program is available in more than a dozen languages. The first prototype, named pgManager, was written for PostgreSQL 6.3.2 from 1998, and rewritten and released as pgAdmin under the GNU General Public License (GPL) in later months. The second incarnation (named pgAdmin II) was a complete rewrite, first released on January 16, 2002. The third version, pgAdmin III, was originally released under the Artistic License and then released under the same license as PostgreSQL. Unlike prior versions that were written in Visual Basic, pgAdmin III is written in C++, using the wxWidgets[98] framework allowing it to run on most common operating systems. The query tool includes a scripting language called pgScript for supporting admin and development tasks. In December 2014, Dave Page, the pgAdmin project founder and primary developer,[99] announced that with the shift towards web-based models, work has begun on pgAdmin 4 with the aim to facilitate cloud deployments.[100] In 2016, pgAdmin 4 was released. The pgAdmin 4 backend was written in Python, using Flask and the Qt framework.[101]
phpPgAdmin
phpPgAdmin is a web-based administration tool for PostgreSQL written in PHP and based on the popular phpMyAdmin interface originally written for MySQL administration.[102]
PostgreSQL Studio
PostgreSQL Studio allows users to perform essential PostgreSQL database development tasks from a web-based console. PostgreSQL Studio allows users to work with cloud databases without the need to open firewalls.[103]
TeamPostgreSQL
AJAX/JavaScript-driven web interface for PostgreSQL. Allows browsing, maintaining and creating data and database objects via a web browser. The interface offers tabbed SQL editor with autocompletion, row editing widgets, click-through foreign key navigation between rows and tables, favorites management for commonly used scripts, among other features. Supports SSH for both the web interface and the database connections. Installers are available for Windows, Macintosh, and Linux, and a simple cross-platform archive that runs from a script.[104]
LibreOffice, OpenOffice.org
LibreOffice and OpenOffice.org Base can be used as a front-end for PostgreSQL.[105][106]
pgBadger
The pgBadger PostgreSQL log analyzer generates detailed reports from a PostgreSQL log file.[107]
pgDevOps
pgDevOps is a suite of web tools to install & manage multiple PostgreSQL versions, extensions, and community components, develop SQL queries, monitor running databases and find performance problems.[108]
Adminer
Adminer is a simple web-based administration tool for PostgreSQL and others, written in PHP.
pgBackRest
pgBackRest is a backup and restore tool for PostgreSQL that provides support for full, differential, and incremental backups.[109]
pgaudit
pgaudit is a PostgreSQL extension that provides detailed session and/or object audit logging via the standard logging facility provided by PostgreSQL.[110]
WAL-E
WAL-E is a backup and restore tool for PostgreSQL that provides support for physical (WAL-based) backups, written in Python.[111]
DBeaver
DBeaver is a free and open source GUI administration tool for PostgreSQL, it has Visual Entity Diagrams and Intellisense features. It also has a commercial PRO license.

A number of companies offer proprietary tools for PostgreSQL. They often consist of a universal core that is adapted for various specific database products. These tools mostly share the administration features with the open source tools but offer improvements in data modeling, importing, exporting or reporting.

Notable users

edit

Notable organizations and products that use PostgreSQL as the primary database include:

  • Microsoft, used for a petabyte-scale “Release Quality View” (RQV) analytics dashboard, which tracks quality of Windows updates analyzing 20K types of metrics from over 800M Windows devices.[112]
  • In 2009, the social-networking website Myspace used Aster Data Systems's nCluster database for data warehousing, which was built on unmodified PostgreSQL.[113][114]
  • Geni.com uses PostgreSQL for their main genealogy database.[115]
  • OpenStreetMap, a collaborative project to create a free editable map of the world.[116]
  • Afilias, domain registries for .org, .info and others.[117][118]
  • Sony Online multiplayer online games.[119]
  • BASF, shopping platform for their agribusiness portal.[120]
  • Reddit social news website.[121]
  • Skype VoIP application, central business databases.[122]
  • Sun xVM, Sun's virtualization and datacenter automation suite.[123]
  • MusicBrainz, open online music encyclopedia.[124]
  • The International Space Station – to collect telemetry data in orbit and replicate it to the ground.[125]
  • MyYearbook social-networking site.[126]
  • Instagram, a mobile photo-sharing service.[127]
  • Disqus, an online discussion and commenting service.[128]
  • TripAdvisor, travel-information website of mostly user-generated content.[129]
  • Yandex, a Russian internet company switched its Yandex.Mail service from Oracle to Postgres.[130]
  • Amazon Redshift, part of AWS, a columnar online analytical processing (OLAP) system based on ParAccel's Postgres modifications.
  • National Oceanic and Atmospheric Administration's (NOAA) National Weather Service (NWS), Interactive Forecast Preparation System (IFPS), a system that integrates data from the NEXRAD weather radars, surface, and hydrology systems to build detailed localized forecast models.[118][131]
  • United Kingdom's national weather service, Met Office, has begun swapping Oracle for PostgreSQL in a strategy to deploy more open source technology.[131][132]
  • WhitePages.com had been using Oracle and MySQL, but when it came to moving its core directories in-house, it turned to PostgreSQL. Because WhitePages.com needs to combine large sets of data from multiple sources, PostgreSQL's ability to load and index data at high rates was a key to its decision to use PostgreSQL.[118]
  • FlightAware, a flight tracking website.[133]
  • Grofers, an online grocery delivery service.[134]
  • The Guardian migrated from MongoDB to PostgreSQL in 2018.[135]
  • YugabyteDB implements the PostgreSQL query layer as its default SQL mode
  • OpenAI uses PostgreSQL as part of its primary API service.[136]

Service implementations

edit

Some notable vendors offer PostgreSQL as software as a service:

  • Heroku, a platform as a service provider, has supported PostgreSQL since the start in 2007.[137] They offer value-add features like full database roll-back (ability to restore a database from any specified time),[138] which is based on WAL-E, open-source software developed by Heroku.[139]
  • In January 2012, EnterpriseDB released a cloud version of both PostgreSQL and their own proprietary Postgres Plus Advanced Server with automated provisioning for failover, replication, load-balancing, and scaling. It runs on Amazon Web Services.[140] Since 2015, Postgres Advanced Server has been offered as ApsaraDB for PPAS, a relational database as a service on Alibaba Cloud.[141]
  • VMware has offered vFabric Postgres (also termed vPostgres[142]) for private clouds on VMware vSphere since May 2012.[143] The company announced End of Availability (EOA) of the product in 2014.[144]
  • In November 2013, Amazon Web Services announced the addition of PostgreSQL to their Relational Database Service offering.[145][146]
  • In November 2016, Amazon Web Services announced the addition of PostgreSQL compatibility to their cloud-native Amazon Aurora managed database offering.[147]
  • In May 2017, Microsoft Azure announced Azure Databases for PostgreSQL.[148]
  • In May 2019, Alibaba Cloud announced PolarDB for PostgreSQL.[149]
  • Jelastic Multicloud Platform as a Service has provided container-based PostgreSQL support since 2011. It also offers automated asynchronous master-slave replication of PostgreSQL.[150]
  • In June 2019, IBM Cloud announced IBM Cloud Hyper Protect DBaaS for PostgreSQL.[151]
  • In September 2020, Crunchy Data announced Crunchy Bridge.[152]
  • In June 2022, Neon.tech announced Neon Serverless Postgres.[153]
  • In October 2023, Nile announced Nile Postgres Platform.[154]

Release history

edit
Release history
Release First release Latest minor version Latest release End of
life[155]
Milestones
6.0 1997-01-29 First formal release of PostgreSQL, unique indexes, pg_dumpall utility, ident authentication
6.1 1997-06-08 Old version, no longer maintained: 6.1.1 1997-07-22 Multicolumn indexes, sequences, money data type, GEQO (GEnetic Query Optimizer)
6.2 1997-10-02 Old version, no longer maintained: 6.2.1 1997-10-17 JDBC interface, triggers, server programming interface, constraints
6.3 1998-03-01 Old version, no longer maintained: 6.3.2 1998-04-07 2003-03-01 SQL-92 subselect ability, PL/pgTCL
6.4 1998-10-30 Old version, no longer maintained: 6.4.2 1998-12-20 2003-10-30 VIEWs (then only read-only) and RULEs, PL/pgSQL
6.5 1999-06-09 Old version, no longer maintained: 6.5.3 1999-10-13 2004-06-09 MVCC, temporary tables, more SQL statement support (CASE, INTERSECT, and EXCEPT)
7.0 2000-05-08 Old version, no longer maintained: 7.0.3 2000-11-11 2004-05-08 Foreign keys, SQL-92 syntax for joins
7.1 2001-04-13 Old version, no longer maintained: 7.1.3 2001-08-15 2006-04-13 Write-ahead log, outer joins
7.2 2002-02-04 Old version, no longer maintained: 7.2.8 2005-05-09 2007-02-04 PL/Python, OIDs no longer required, internationalization of messages
7.3 2002-11-27 Old version, no longer maintained: 7.3.21 2008-01-07 2007-11-27 Schema, table function, prepared query[156]
7.4 2003-11-17 Old version, no longer maintained: 7.4.30 2010-10-04 2010-10-01 Optimization on JOINs and data warehouse functions[157]
8.0 2005-01-19 Old version, no longer maintained: 8.0.26 2010-10-04 2010-10-01 Native server on Microsoft Windows, savepoints, tablespaces, point-in-time recovery[158]
8.1 2005-11-08 Old version, no longer maintained: 8.1.23 2010-12-16 2010-11-08 Performance optimization, two-phase commit, table partitioning, index bitmap scan, shared row locking, roles
8.2 2006-12-05 Old version, no longer maintained: 8.2.23 2011-12-05 2011-12-05 Performance optimization, online index builds, advisory locks, warm standby[159]
8.3 2008-02-04 Old version, no longer maintained: 8.3.23 2013-02-07 2013-02-07 Heap-only tuples, full text search,[160] SQL/XML, ENUM types, UUID types
8.4 2009-07-01 Old version, no longer maintained: 8.4.22 2014-07-24 2014-07-24 Window functions, column-level permissions, parallel database restore, per-database collation, common table expressions and recursive queries[161]
9.0 2010-09-20 Old version, no longer maintained: 9.0.23 2015-10-08 2015-10-08 Built-in binary streaming replication, hot standby, in-place upgrade ability, 64-bit Windows[162]
9.1 2011-09-12 Old version, no longer maintained: 9.1.24 2016-10-27 2016-10-27 Synchronous replication, per-column collations, unlogged tables, serializable snapshot isolation, writeable common table expressions, SELinux integration, extensions, foreign tables[163]
9.2 2012-09-10[164] Old version, no longer maintained: 9.2.24 2017-11-09 2017-11-09 Cascading streaming replication, index-only scans, native JSON support, improved lock management, range types, pg_receivexlog tool, space-partitioned GiST indexes
9.3 2013-09-09 Old version, no longer maintained: 9.3.25 2018-11-08 2018-11-08 Custom background workers, data checksums, dedicated JSON operators, LATERAL JOIN, faster pg_dump, new pg_isready server monitoring tool, trigger features, view features, writeable foreign tables, materialized views, replication improvements
9.4 2014-12-18 Old version, no longer maintained: 9.4.26 2020-02-13 2020-02-13 JSONB data type, ALTER SYSTEM statement for changing config values, ability to refresh materialized views without blocking reads, dynamic registration/start/stop of background worker processes, Logical Decoding API, GiN index improvements, Linux huge page support, database cache reloading via pg_prewarm, reintroducing Hstore as the column type of choice for document-style data.[165]
9.5 2016-01-07 Old version, no longer maintained: 9.5.25 2021-02-11 2021-02-11 UPSERT, row level security, TABLESAMPLE, CUBE/ROLLUP, GROUPING SETS, and new BRIN index[166]
9.6 2016-09-29 Old version, no longer maintained: 9.6.24 2021-11-11 2021-11-11 Parallel query support, PostgreSQL foreign data wrapper (FDW) improvements with sort/join pushdown, multiple synchronous standbys, faster vacuuming of large table
10 2017-10-05 Old version, no longer maintained: 10.23 2022-11-10 2022-11-10 Logical replication,[167] declarative table partitioning, improved query parallelism
11 2018-10-18 Old version, no longer maintained: 11.22 2023-11-09 2023-11-09 Increased robustness and performance for partitioning, transactions supported in stored procedures, enhanced abilities for query parallelism, just-in-time (JIT) compiling for expressions[168][169]
12 2019-10-03 Old version, no longer maintained: 12.22 2024-11-21 2024-11-21 Improvements to query performance and space utilization; SQL/JSON path expression support; generated columns; improvements to internationalization, and authentication; new pluggable table storage interface.[170]
13 2020-09-24 Old version, yet still maintained: 13.18 2024-11-21 2025-11-13 Space savings and performance gains from de-duplication of B-tree index entries, improved performance for queries that use aggregates or partitioned tables, better query planning when using extended statistics, parallelized vacuuming of indexes, incremental sorting[171][172]
14 2021-09-30 Old version, yet still maintained: 14.15 2024-11-21 2026-11-12 Added SQL-standard SEARCH and CYCLE clauses for common table expressions, allow DISTINCT to be added to GROUP BY[173][174]
15 2022-10-13 Old version, yet still maintained: 15.10 2024-11-21 2027-11-11 Implements SQL-standard MERGE statement. PL/Python now only supports current Python 3, and plpythonu now means Python 3, no longer the discontinued Python 2.
16 2023-09-14 Old version, yet still maintained: 16.6 2024-11-21 2028-11-09 Improvements to logical replication, pg_stat_io view (for I/O metrics)[175]
17 2024-09-26 Current stable version: 17.2 2024-11-21 2029-11-08 Performance boosts to the vacuum process, I/O layer, and query execution, expanding JSON functionality, more features to MERGE and improving COPY; enhances logical replication for high availability and upgrades, improvements to security, operations, monitoring, and analysis.[176]
Legend:
Old version, not maintained
Old version, still maintained
Latest version
Latest preview version
Future release

See also

edit
  • Comparison of relational database management systems
  • Database scalability
  • List of databases using MVCC
  • LLVM (llvmjit is the JIT engine used by PostgreSQL)

References

edit

Further reading

edit
  • Obe, Regina; Hsu, Leo (July 8, 2012). PostgreSQL: Up and Running. O'Reilly. ISBN 978-1-4493-2633-3.
  • Krosing, Hannu; Roybal, Kirk (June 15, 2013). PostgreSQL Server Programming (second ed.). Packt Publishing. ISBN 978-1-84951-698-3.
  • Riggs, Simon; Krosing, Hannu (October 27, 2010). PostgreSQL 9 Administration Cookbook (second ed.). Packt Publishing. ISBN 978-1-84951-028-8.
  • Smith, Greg (October 15, 2010). PostgreSQL 9 High Performance. Packt Publishing. ISBN 978-1-84951-030-1.
  • Gilmore, W. Jason; Treat, Robert (February 27, 2006). Beginning PHP and PostgreSQL 8: From Novice to Professional. Apress. p. 896. ISBN 1-59059-547-5. Archived from the original on July 8, 2009. Retrieved April 28, 2009.
  • Douglas, Korry (August 5, 2005). PostgreSQL (second ed.). Sams. p. 1032. ISBN 0-672-32756-2.
  • Matthew, Neil; Stones, Richard (April 6, 2005). Beginning Databases with PostgreSQL (second ed.). Apress. p. 664. ISBN 1-59059-478-9. Archived from the original on April 9, 2009. Retrieved April 28, 2009.
  • Worsley, John C.; Drake, Joshua D. (January 2002). Practical PostgreSQL. O'Reilly Media. pp. 636. ISBN 1-56592-846-6.
edit
  • Official website  , and wiki
  • A Software Catalog of related projects and products
  • The official Main Source Code Repository (for browsing), and the Developer FAQ
  • The official Reference for PostgreSQL Documentation Authors
  • All official PostgreSQL Source Code Repositories
  • PostgreSQL on GitHub