All download files including the archive files are now in a publicly accessible Google Storage Bucket. Downloads page links have been updated.

Statistics & download files

Statistics
Locus Group Total by Locus Group Locus Type Total by Locus Type
protein-coding gene 19268 text file json custom gene with protein product 19268 text file json custom
gene with protein product 19268 text file json custom
non-coding RNA 9317 text file json custom RNA, Y 4 text file json custom
RNA, Y 4 text file json custom
RNA, cluster 119 text file json custom
RNA, long non-coding 5979 text file json custom
RNA, micro 1912 text file json custom
RNA, misc 29 text file json custom
RNA, ribosomal 60 text file json custom
RNA, small nuclear 51 text file json custom
RNA, small nucleolar 568 text file json custom
RNA, transfer 591 text file json custom
RNA, vault 4 text file json custom
pseudogene 14454 text file json custom T cell receptor pseudogene 37 text file json custom
T cell receptor pseudogene 37 text file json custom
immunoglobulin pseudogene 203 text file json custom
pseudogene 14214 text file json custom
other 996 text file json custom T cell receptor gene 200 text file json custom
T cell receptor gene 200 text file json custom
complex locus constituent 69 text file json custom
endogenous retrovirus 110 text file json custom
fragile site 116 text file json custom
immunoglobulin gene 229 text file json custom
readthrough 150 text file json custom
region 46 text file json custom
unknown 68 text file json custom
virus integration site 8 text file json custom
Total Approved Symbols 44035 text file json custom
Alternative Loci Statistics
Locus Group Total by Locus Group Locus Type Total by Locus Type
protein-coding gene 23 text file json custom gene with protein product 23 text file json custom
gene with protein product 23 text file json custom
non-coding RNA 1 text file json custom RNA, long non-coding 1 text file json custom
RNA, long non-coding 1 text file json custom
pseudogene 17 text file json custom T cell receptor pseudogene 1 text file json custom
T cell receptor pseudogene 1 text file json custom
pseudogene 16 text file json custom
other 7 text file json custom T cell receptor gene 5 text file json custom
T cell receptor gene 5 text file json custom
immunoglobulin gene 1 text file json custom
unknown 1 text file json custom
Total Approved Symbols 48 text file json custom

Last updated: 01/04/25 11:42:41

Semantic web downloads

What is a Web Ontology Language (OWL)?
The W3C Web Ontology Language (OWL) is a Semantic Web language designed to represent rich and complex knowledge about things, groups of things, and relations between things. OWL is a computational logic-based language such that knowledge expressed in OWL can be exploited by computer programs, e.g., to verify the consistency of that knowledge or to make implicit knowledge explicit. OWL documents, known as ontologies, can be published in the World Wide Web and may refer to or be referred from other OWL ontologies.
Where can I find an OWL file for the HGNC?

Our friends at SciBite have created an OWL file for us. You can find the file at https://storage.googleapis.com/public-download-files/hgnc/owl/owl/hgnc.owl.

This HGNC OWL file contains all genes in HGNC organised in a shallow hierarchy, classified by their locus type and gene group. The ontology contains approved gene symbol, approved gene name, previous names and symbols and mappings to external databases. Please contact help@scibite.com if you have any questions.

How do I search/download HGNC data using the semantic web?
You need to either use the URL of our hgnc.owl or download the file, and use the file or URL in a tool such as Stanford University's Protégé. To learn how to use Protégé check out their short training course.

Complete dataset download links

Archived HGNC datasets

The HGNC archive the complete HGNC dataset and withdrawn files (both tab separated and JSON formats) each month and each quarter. View our HGNC data archive help page for more information about the files and the archive itself.

Help & information

A comprehensive list of all the columns contained within the download files, can be found in the Statistics & downloads help page.

Data release policy

No restrictions are imposed on access to, or use of, the data provided by the HGNC, which are provided to enhance knowledge and encourage progress in the scientific community. The HGNC provides these data in good faith, but make no warranty, express or implied, nor assume any legal liability or responsibility for any purpose for which they are used.