C12orf50

Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811 (NCBI 37, August 2010), on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

Function
The ontology points to the function of C12orf50 is to enable mRNA and protein binding. It also is involved in poly(A)+ mRNA export from the nucleus.

Isoforms
The C12orf50 gene has 6 isoforms.

Gene expression
In an analysis of human tissues with specific expression by the genome, RNA-seq was performed on tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity protein-coding genes found that the expression of C12orf50 is very low in most human tissues with the exception of the testis. C12orf50's expression was restricted towards testis.

Protein
Uncharacterized protein Chromosome 12 Open Reading Frame 50 is a protein in humans, encoded by the C12orf50 gene. The protein accession id is Q8NA57. The protein has a length of 414aa. The predicted mass of the protein is 47.2 kDa. The protein includes a CCCH-type Zn Finger Domain. The protein has a CCCH-type Zn Finger Domain with a C-X8-C-X5-C-X3-H motif. The domain starts at the beginning of the protein and goes to the 44th amino acid. The protein also has three disordered regions from the 136th amino acid to 168th with of length of 33 aa, 297th to 333rd with a length of 37 aa, and 346th to 414th with a length of 69 aa. The predicted molecular weight is 47.3 kDa and the predicted isoelectric point is 8.79.

Structure
The predicted tertiary structure for C12orf50 has two beta-sheets towards the beginning of the protein in the zinc finger domain and a helix from 106-124aa. These are conserved throughout mammalian orthologs. There is also a large number of coiled regions. The promoter, 3' UTR region, and 5' UTR are very well conserved. There is a negative cluster (acidic domain) before and at the beginning of the helix from amino acid 87 to 111.

Localization
There is a 47.8% probability of being in the nucleus and a 30.4% probability of being in the cytoplasm. This was confirmed by immunohistochemistry and immunofluorescence by Sigma-Aldrich showing positivity in both the nucleus and cytoplasm. There is a nuclear location signal and acidic domain. The orthologs also confirm that C12orf50 is localized in the nucleus and cytoplasm.

Protein Interactions
There are two proteins (GAPDHS and GOLGA2) that interact with C12orf50. Glyceraldehyde-3-phosphate dehydrogenase, spermatogenic (GAPDHS) enzyme may play an important role in regulating the switch between different energy-producing pathways, and it is required for sperm motility and male fertility.

Post-translation Modifications
C12orf50 has been predicted to undergo various phosphorylation, c-mannosylation, and O-glycosylations. The phosphorylation sites are at amino acids 262, 349 and 370. The O-glycosylation sites are amino acids 139, 238, and 374. The c-mannosylation sites are amino acids 13, 102, 292, and 388.

Evolution
C12orf50 has an evolutionary rate that is close to Fibrinogen alpha, making it relatively quick. Orthologs for C12orf50 have been found in mammals, reptiles, birds, and amphibians caecilians. No orthologs were found for frogs, fish, invertebrates, or fungi. The mammalian orthologs shared the most similarity with humans with the exception of the platypus. The range of divergence from humans from mammals was 6.4-180 million years. The reptilian orthologs were the next similar and diverged around 318 million years ago. Then the birds diverged from humans at the same time as the reptiles. The least similar was the amphibian caecilians and they diverged around 351.7 million years ago.

Orthologs
C12orf50 has orthologs in mammals, aves, reptiles and caecilian amphibians. No orthologs were found in amphibian frogs, invertebrates, plants, fungi, or yeast. The table below shows some of the orthologs that can be found on BLAST.

Paralogs
C12orf50 has two paralogs: ZC3H11A and ZC3H11B. The zinc the finger domain is considered in both of the paralogs.