Autophagy-related protein 101

Autophagy-related protein 101 also known as ATG101 is a protein that in humans is encoded by the C12orf44 gene (chromosome 12 open reading frame 44).

Autophagy is the process of sequestering target proteins, organelles, aggregates, and other cytoplasmic species inside large membrane-bound vesicles and delivering them to lysosomes for degradation. The ATG101 protein is localized in the cytoplasm, but can possibly also be found bound to a structure known as a phagophore, involved in the initial steps of autophagy. The gene is highly conserved among mammals, as well as showing conservation among most eukaryotes. It is thought to directly interact with ATG13 in the ULK1 complex, which may be important for activating phagophores.

Function
ATG101 is one of dozens of diverse proteins named for their involvement in autophagy, a process well conserved among most eukaryotic organisms. However, ATG101 is not homologous to any of the other ATG proteins. ATG101 interacts with essential autophagy protein ATG13 in mammals, which is an ULK1-interacting protein. ULK1 (unc-51-like kinase 1) is thought to be important in the activation of macroautophagy in mammals. ATG101 is suggested to protect ATG13 from proteasomal degradation, thereby stabilizing levels of ATG13 found in cells and regulating levels of macroautophagy. According to published papers, ATG101 is said to localize to the isolation membrane, also known as the phagophore. This attachment to the phagophore could possibly explain the palmitoylation site found on the third amino acid, which is a cysteine. Phagophores are responsible for recognizing and surrounding materials that are to be degraded in the lysosome.

Protein properties
The ATG101 protein contains an domain DUF1649 (domain of unknown function 1649).

ATG101 is highly conserved among most vertebrates, including animals such as mice, whose common ancestor with humans was thought to have diverged between 65 and 85 million years ago. This may be evidence that ATG101 is significant in a way that it must remain conserved structurally in order to retain function. Only 2 missense SNPs are known in the DNA sequence for ATG101, showing that no other versions of this protein exist, also suggesting its structural importance.

In this conceptual translation, the palmitoylation site on the third amino acid can be seen, as well as a possible phosphorylation site shown surrounded by brackets. The amino acids are aligned above the coding region of the mRNA, which is also numbered on the sides. In ATG101, this protein is valine-rich.

Gene expression
No concrete data is present establishing ATG101 in any particular area. By looking at multiple expression sets in both humans and mice, it has been hypothesized to be ubiquitously expressed in tissues.

Gene neighborhood
Codes for a general receptor protein that is relatively unknown, located on the same strand as C12orf44 as seen below, spans 8924 bp. This gene encodes nerve growth factor IB that is a member of the steroid-thyroid hormone-retinoid receptor superfamily. Expression is induced by phytohemagglutinin in human lymphocytes and by serum stimulation of arrested fibroblasts. The encoded protein acts as a nuclear transcription factor. Translocation of the protein from the nucleus to mitochondria induces apoptosis. Location is on the same strand as C12orf44, spans 8097 bp. Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. Located after C12orf44 on the chromosome, it lies on the same strand and is 946 bp in length. This gene encodes an epithelial keratin that is weakly expressed in the tongue. It is located on the opposite strand to C12orf44 and is 23,005 bp long.
 * GRASP GRP1 (general receptor for phosphoinositides 1)-associated scaffold protein -
 * NR4A1 (nuclear receptor subfamily 4, group A, member 1) -
 * OR7E47P (olfactory receptor, family 7, subfamily E, member 47 pseudogene) -
 * KRT80 (keratin 80) -