User:Egpolar

User:Egpolar/sandbox From Wikipedia, the free encyclopedia Jump to navigationJump to search

Contents 1	C10ORF25 (Chromosome 10 open reading frame 25) 1.1	Gene 1.2	RNA 1.2.1	Expression 1.3	Protein 1.3.1	Cellular Localization 1.3.2	Domains and Motifs 1.3.3	Post-Translational Modifications 1.3.4	Structure 1.4	Orthologs 1.5	Function 1.5.1	Interacting proteins 1.6	Clinical Significance 1.7	References C10ORF25 (Chromosome 10 open reading frame 25) Chromosome 10 open reading frame 25 (c10orf25) is a protein which in humans is encoded by C10orf25 gene.

Gene

Figure 1: Image of Chromosome 10 showing the location of C10orf25. The red line indicates the exact location of the gene, seen on the long arm of the chromosome.[1] C10orf25 is 668 base pairs long and it encodes a protein that is 122 amino acids long.[2]The gene is found on the long arm of chromosome 10, and its specific location is 10q11.21.[3]

The C10orf25 gene contains about 12 different gt-at intron regions. The gene is unique in having 5 alternatively spliced variants and 3 unspliced forms. The gene also consists of 3 non-overlapping alternative last exons and 3 alternative promoters. There are also 7 validated alternative polyadenylation sites throughout the gene.[4] The protein coding sequence consists of 6 exon sequences.

RNA The alias of C10orf25 is known as ZNF22-AS1 which is 8,972 base pairs long.[1] The mRNA transcript is 668 base pairs long to encode a protein that is 122 amino acids long.

Figure 3: Conceptual Translation of human C10orf25 with labeled domains, motifs, post-translational modifications and conserved amino acids are recognized in bold.

Figure 2: Schematic of C10orf25 and its promoter sequence. The dark green region represents the promoter region. The dark brown regions are the 5’ UTR and 3’UTR regions, whereas the light brown regions are the exons. The black lines represent the introns. The arrow represents the transcriptional start site. Expression Gene C10orf25 has been found to be highly expressed in the appendix, fat cells and is moderately expressed in the kidney, thyroid, and nervous system tissues ike the brain and cerebellum.[3] The gene has also shown significant prevalence in expression in the testis, lungs, and liver.[5] Small significances in expression have also been discovered in adipose tissue, ovary, the placenta, hypothalamus, and spleen.[5]

Protein The encoded protein C10orf25 weighs 14.4kDa. The isoelectric point of the protein is predicted to be 10.26. The protein is composed of mostly proline which is a neutral amino acid. The next most common amino acids serine, arginine, and leucine. Serine is a neutral amino acid, arginine is positively charged, and leucine is nonpolar. According to the KR-ED analysis, the protein is more positively charged.[6]

Cellular Localization This protein is predicted to be located in the extracellular region.[7] DeepLoc predicts C10orf25 to be localized in the mitochondria and membrane. The top localization likelihood scores in cell were for the mitochondrion at 0.4413 and the nucleus score at 0.1209.[8] Data from PSORT II shows the protein is localized in the membrane, so it is highly possible it is also localized in the membrane and has a function there.[9] C10orf25 is a secretory protein in the mitochondria and the membrane.

Domains and Motifs The C-terminus is located on the inside of the protein.

Figure 4: Human C10orf25 protein is depicted in light blue, the mitochondrial localization sequence is highlighted in red, the cleavage signal peptide is highlighted in yellow, the conserved phosphorylation sites are denoted by yellow circles, the conserved O-glycosylation sites[10] are denoted by green squares, and the conserved N-glycosylation sites[11] are denoted by pink rectangles. Made by IBS Cuckoo. Post-Translational Modifications The cleavage signal peptide is between amino acids 1-28 and the cleavage peptide is between amino acids 28 and 29.[12]

NetPhos predicts C10orf25 undergoes phosphorylation post-translation at 21 different sites on the protein by 9 different kinases and 9 sites by unspecified kinases. The amino acid with the highest level of phosphorylation is serine. The bars that go over the line of threshold is due to multiple kinases that target the same serine, threonine, or tyrosine.[13]

Structure The protein composed of 5 beta sheets and 6 alpha helices. AlphaFold predicts the structure of C10orf25 to be of low to medium confidence. The protein structure is also highly conserved.[14]

Figure 5: 3D predicted structure fold of C10orf25 from AlphaFold Orthologs Orthologs for protein C10orf25 are found in mammals only, mostly in primates and marine mammals. There were no orthologs present in Marsupials, Monotremes, Birds, Reptilia, Amphibia, Fish, or Plants. Down below is the ortholog chart that shows the gives the genus and species, the common name, taxon, percent similarity, and percent identity.[3]

C10orf25	Genus and Species	Common Name	Taxon	Similarity (%)	Identity (%)	Accesion # Mammalia	Homo Sapiens	Human	primates	100	100 Mammalia	Pan troglodytes	Chimpanzee	primates	97.5	97.54	XP_016773654.1 Mammalia	Pongo abelii	Sumatran orangutan	primates	62.1	60.4	XP_024109826.1 Mammalia	Nomascus leucogenys	Northern white-cheeked gibbon	primates	82.9	81.4	XP_030655093.1 Mammalia	Colobus angolensis palliatus	Angola colobus	primates	87.2	83.25	XP_0118909174.1 Mammalia	Macaca mulatta	Indochinese rhesus macaque	primates	84	80.8	XP_015002402.1 Mammalia	Macaca nemestrina	Southern pig-tailed macaque	primates	83.2	80	XP_024650102.1 Mammalia	Pilicolobus tephrosceles	Ugandan red colobus	primates	88.9	84.92	XP_023054351.3 Mammalia	Mandrillus leucophaeus	Drill	primates	87.9	86.29	XP_011833969.1 Mammalia	Theropithecus gelada	Gelada	primates	82.4	80	XP_025253169.1 Mammalia	Aotus nancymaae	Nancy Ma's night monkey	primates	74	70.08	XP_021527937.1 Mammalia	Microcebus murinus	Gray mouse lemur	primates	49.6	47.37	XP_020136089.1 Mammalia	Propithecus coquereli	Coquerel's sifaka	primates	47.2	40.7	XP_012509356.1 Mammalia	Balaenoptera acutorostrata scammoni	Minke whale	Artiodactyla	23.9	56.9	XP_028020796.1 Mammalia	Balaenoptera acutorostrata scammoni	Minke whale	Cetacea	23.9	56.9	XP_028020796.1

Figure 7: Phylogeny tree showing the lines of evolutionary descent of C10orf25 made using Phylogeny.

Figure 6: Date of Divergence of C10orf25 compared to the evolutionary rate of Cytochrome C and Fibrinogen Alpha Chain.

The C10orf25 protein is estimated to have first appeared in Minke Whales 94 million years ago. This was the only marine mammal the protein was found in. The gene only appears vertebrates and evolved from Minke Whales to primates. The gene family for C10orf25 is relatively small and is a moderately early diverging lineage.[15]

There are no paralogs for C10orf25.[3]

C10orf25 is evolving moderately slowly compared to reference sequences cytochrome C and fibrinogen alpha chain.The percent similarity and identity for cytochrome C sequence and Fibrinogen Alpha Chain sequence were calculated in EMBOSS Needle against the sequences for C10orf25. Overall, this graph shows the mutation rate of C10orf25 in comparison to mutation rates of cytochrome c and fibrinogen alpha chain. From the graph below we can see that C10orf25 is evolving relatively quickly compared to fibrinogen and cytochrome C.

Function The C10orf25 protein plays a role in gene regulation and expression in the nuclei and is secreted in the mitochondria.

Interacting proteins The C10orf25 protein interacts with two histone proteins: HIST1H1C and HIST3H3.[16] HIST3H3 is is a core component of nucleosomes. HIST1H1C is a histone H1 protein that binds to linker DNA between nucleosomes forming the macromolecular structure known as the chromatin fiber.[17] It also interacts with a Wnt Ligand Secretion Mediator protein, WLS [6][18]. WLS plays a key role in the regulation of subcellular location, expression, binding and organelle-specific association of Wnt proteins. It regulates Wnt protein sorting and secretions in a regulatory feedback mechanism. WLS is associated with diseases like Zaki Syndrome and Volkmann Contracture. [19]

C10orf25 protein also interacts with COG1212 between amino acids 71-115 [20], which is the only protein that is a part of protein superfamily cl42799, in Conserved Protein Domain Family KdsB.[21] COG1212 is an acid synthetase/

Clinical Significance The C10orf25 gene has been tested and confirmed associated with Alzheimer’s Disease.4 A study performed a chromosome-10 specific association study to Alzheimer’s including 1,412 SNPs to identify genes that leave people susceptible for late-onset Alzheimer disease. C10orf25 showed a hit for the sample done in the UK. The marker found for the gene is rs2297492.[22]

Another study discovered that C10orf25 in a GWAS study evaluating unique and shared genes associated with cognitive deficits had a p-value score of 0.00017. The gene only showed one domain in the GWAS. The polygenic risk score analyses of the GWAS hits showed a significant negative correlation for verbal learning and memory, which is what gene C10orf25 is associated with. The study was able to conclude that negative cognitive domain scores are associated to a higher schizophrenia genetic risk.[23]

A study, Lysine-specific demethylase 1 depletion effect on neuroblastoma cell line experimented on the effect of lysine demethylase 1 depletion on a neuroblastoma cell line The study found that when the cell line was depleted of LSD1, expression of C10orf25 was higher than the control cell line that was not depleted of LSD1.[24]

The study, tested the effect of sodium butyrate, chemotherapy, and a combination of the two on Raji Burkitt’s lymphoma cells. The expression of C10orf25 in R.B. lymphoma cells was much higher than the control when being treated with sodium butyrate. When treated with only chemotherapy, the expression of C10orf25 was less than when only treated with sodim butyrate. When the R.B. lymphoma cell line was treated with both sodium butyrate and chemotherapy, the expression of C10orf25 was much higher than the latter results.[25]

The study, Antiretroviral therapy effect on brain of patients with HIV-associated neurocognitive disorders evaluated the effect of antiretroviral therapy on patients with HIV-associated neurocognitive disorders. The results determined both overexpression and underexpression of C10orf25 in these cells in both the uninfected control and effected, in which both had groups that received the therapy and another that was untreated. There was more underexpression of C10orf25 in the group with the HIV disease that received the antiretroviral therapy, this indicates the possible correlation of C10orf25 playing a role in HIV-associated neurocognitive disorders.[26]

References [Gene Cards entry on ZNF22-AS1 Gene https://www.genecards.org/cgi-bin/carddisp.pl?gene=ZNF22-AS1&keywords=c10orf25 "ZNF22-AS1 Gene"]. Gene Cards. : Check |url= value (help) "Homo sapiens chromosome 10 open reading frame 25, mRNA (cDNA clone MGC:163267 IMAGE: 40146426), complete cds". NCBI (National Center for Biotechnology Information). "Chromosome 10 open reading frame 25 [Homo sapiens]". NCBI (National Center for Biotechnology Information). "C10orf25". AceView. "GDS426 / 86187_at". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16. "EBI Tools: Job not available". www.ebi.ac.uk. Retrieved 2022-12-16. "EMBOSS: pepwheel". www.bioinformatics.nl. Retrieved 2022-12-15. "C10orf25". DeepLoc. "C10orf25". PSORT II. "NetOGlyc-4.0 - redirect". www.cbs.dtu.dk. Retrieved 2022-12-16. "Services". https://www.healthtech.dtu.dk. Retrieved 2022-12-16. : External link in |website= (help) "C10orf25". BioGrid4.4. "Services". https://www.healthtech.dtu.dk. Retrieved 2022-12-16. : External link in |website= (help) "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2022-12-16. "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-16. Fasci, Domenico; Ingen, Hugo van; Scheltema, Richard A.; Heck, Albert J. R. (2018-10-01). "Histone Interaction Landscapes Visualized by Crosslinking Mass Spectrometry in Intact Cell Nuclei". Molecular & Cellular Proteomics. 17 (10): 2018–2033. doi:10.1074/mcp.RA118.000924. ISSN 1535-9476. PMC 6166682. PMID 30021884. "GPS-Prot Navigator". gpsprot.org. Retrieved 2022-12-16. "C10orf25 protein (human) - STRING interaction network". string-db.org. Retrieved 2022-12-16. "WLS Gene - GeneCards | WLS Protein | WLS Antibody". www.genecards.org. Retrieved 2022-12-16. "Error". www.genome.jp. Retrieved 2022-12-16. "CDD Conserved Protein Domain Family: KdsB". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16. Grupe, Andrew; Li, Yonghong; Rowland, Charles; Nowotny, Petra; Hinrichs, Anthony L.; Smemo, Scott; Kauwe, John S. K.; Maxwell, Taylor J.; Cherny, Sara; Doil, Lisa; Tacey, Kristina; Luchene, Ryan van; Myers, Amanda; Vrièze, Fabienne Wavrant-De; Kaleem, Mona (2006-01-01). "A Scan of Chromosome 10 Identifies a Novel Locus Showing Strong Association with Late-Onset Alzheimer Disease". The American Journal of Human Genetics. 78 (1): 78–88. doi:10.1086/498851. ISSN 0002-9297. PMC 1380225. PMID 16385451. Nakahara, Soichiro; Medland, Sarah; Turner, Jessica A.; Calhoun, Vince D.; Lim, Kelvin O.; Mueller, Bryon A.; Bustillo, Juan R.; O'Leary, Daniel S.; Vaidya, Jatin G.; McEwen, Sarah; Voyvodic, James; Belger, Aysenil; Mathalon, Daniel H.; Ford, Judith M.; Guffanti, Guia (2018-11-01). "Polygenic risk score, genome-wide association, and gene set analyses of cognitive domain deficits in schizophrenia". Schizophrenia Research. 201: 393–399. doi:10.1016/j.schres.2018.05.041. ISSN 0920-9964. www.ncbi.nlm.nih.gov https://www.ncbi.nlm.nih.gov/geo/tools/profileGraph.cgi?ID=GDS5281:1552422. Retrieved 2022-12-16. : Missing or empty |title= (help) "GDS5386 / 4649". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16. www.ncbi.nlm.nih.gov https://www.ncbi.nlm.nih.gov/geo/tools/profileGraph.cgi?ID=GDS4231:1552422. Retrieved 2022-12-16. : Missing or empty |title= (help)