All download files including the archive files are now in a publicly accessible Google Storage Bucket. Downloads page links have been updated.

Gene group DB table files

The files within this directory contain data found in the gene group associated tables within our database and are in a comma separated value format. Each value is quoted within double quotes and all have a header line denoting the column titles.

File/tables descriptions

family
Contains the following columns:
  • id: gene group primary key.
  • abbreviation: abbreviated name of the group; usually a common root symbol of the genes within
  • name: group name
  • external_note: HGNC note about the group
  • pubmed_ids: Associated pubmed IDs
  • desc_comment: Description of the group
  • desc_label: Label for the description
  • desc_source: Where the description came from
  • desc_go: The GO term connected to the description
  • typical_gene: Typical member gene of the group
hierarchy
Relationships between groups, step by step. Contains the following columns:
  • parent_fam_id: The group ID of the group above the child (sub) group. Foreign key for family.id
  • child_fam_id: The group ID of the group below the parent (super) group. Foreign key for family.id
hierarchy_closure
Relationships between groups showing the full hierarchical ascyclic graph from a group down and the distance from the super group.
  • parent_fam_id:
  • child_fam_id: The group ID of the group below the super group. Foreign key for family.id
  • distance: How far the child/sub group is from the super group
external_resource
External resources linked to the gene group. Contains the following columns:
  • id: The primary ID for the external resource
  • name: Name of the resource
  • url: The URL of the resource
  • description: A description of the resource
  • approved: Resource uses approved gene symbols and or IDs
family_has_external_resource
A linking many to many table to join the family table to external resource table. Contains the following columns:
  • family_id: Foreign key for the family table
  • ext_id:: Foreign key for the external_resource table
gene_has_family
A linking many to many table to join the family table to HGNC gene data. Contains the following columns:
  • hgnc_id: The HGNC ID for the gene. Foreign key to link to gene tables etc.
  • family_id: The group ID. Foreign key for the family table.

CSV group download files