We are proud to announce that there are now over 300 million sequences in GQ-Pat, including 256 million nucleotide sequences and over 45 million protein sequences. And these protein sequences aren't just automated translations of nuceotides like TrEMBL. All of these sequences are in fact found in patents and patent applications from patent authorities around the world.
To put this accomplishment in perspective, when the Human Genome Project formally began in 1990, there were fewer than 40,000 sequences in GenBank, before being transferred from Stanford to the newly created National Center for Biotechnology Information (NCBI).
As of the summer of 2015, according to the NCBI's GenBank Statistics page, there are 185 million nucleotide sequences in GenBank/EMBL/DDBJ consortium, the world's gold standard for sequence databases.* GenBank represents merely slightly more than half of GQ-Pat!