CRACD-like protein

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

Gene
CRACDL is a protein-coding gene. The table above presents the gene's alias, location, size and accession number.

mRNA
There are 11 splice isoforms of the CRACDL. The validated isoform has 10 exons.

Protein
The table above presents the protein's alias, size, and accession number. The CRACD-L protein is proline rich and asparagine, isoleucine, phenylalanine, and tyrosine poor.

Domains and motifs


The CRACD-L protein has one domain called the DUF4592 motif and spans amino acids 131–239. This domain is highly conserved among the CRACDL orthologs. The DUF4592 motif is depicted in both the conceptual translation and schematic figures.

Post translational modifications
CRACDL is phosphorylated at the Ser92 and Ser490 amino acids. The KIAA1211L protein is also predicted to have five different SUMOylation sites located at Lys134, Lys375, Lys866, Lys874, and Lys914. Both the phosphorylated sites and the SUMOylation sites are depicted in the conceptual translation and schematic figures.

Secondary structure
The CRACD-L protein predicted secondary structure is composed of 50% alpha helixes, 8.9% beta sheets, and 17.9% turns. The high number of turns is consistent with the fact that CRACD-L is proline rich.

Subcellular location
The CRACD-L protein is predicted to be located in the nucleus. The orthologs, including the elephant shark, horse, rock dove, and chimp, are also predicted to be located in the nucleus. The nuclear location signal is located on amino acids 25-43 which is depicted in both the conceptual translation and schematic figures. . This signal is conserved throughout the orthologs. Additionally, this location (amino acids 24-43) is positively charged, probably due to the high amount of lysine at this location. Finally, it is predicted that CRACD-L is mainly localized to the microtubules and centrosome and sometimes localized to the cytokinetic bridge.

Expression
The gene is highly expressed in the cerebral cortex of the brain. The CRACD-L protein is located in many different tissue types, including the brain, the hippocampus, the lung, breast carcinoma, the islets of Langerhans, the pancreas, the kidney, and 38 other tissues. Additionally, it is expressed an average amount compared to other human proteins.

Regulation of transcription
The promoter region of CRACDL is approximately 1340 base pairs with various predicted transcription factors. The glial cells missing homolog 1 and the oligodendrocyte lineage transcription factors are notable because CRACDL is highly expressed in the brain. Furthermore, the Estrogen-related receptor alpha is also a notable transcription factor due to CRACDL's low expression levels when estrogen receptors are knocked down. Furthermore, CRACDL is predicted to be SUMOylated. The 3' UTR of CRACDL is predicted to be a targeted by miRNA-132, which is depicted in the conceptual translation figure.

Interacting proteins
Glycogen Synthase Kinase 3 Beta (GSK3B)

GSK3B is a protein kinase that regulates transcription factors and microtubules. As such, it phosphorylates proteins, decreasing their ability to bind and stabilize microtubules. The proteins it phosphorylates are the principle components of neurofibrillary tangles in Alzheimer disease. The protein is needed for the establishment of neuronal polarity and axon outgrowth and phosphorylates proteins in neuroblastoma cells. Furthermore, it is associated with bipolar disease and is active in breast cancer cells.

As such, the predicted interaction between CRACDL and GSK3B is likely because CRACDL is highly expressed in the brain, associated with bipolar disorder and breast cancer, and is localized on the microtubules. The interaction between GSK3B and CRACDL was predicted using anti bait coimmunoprecipitation, pull down, tandem affinity purification, fluorescence polarization spectroscopy, protein kinases assay, two hybrid, and confocal microscopy experiments.

CRACD-L protein is also predicted to interact with Alpha-synuclein (SNCA), E3 Ubiquitin-Protein Ligase Mdm2 (MDM2), Serine/Threonine-Protein Kinase PAK 1 (PAK 1), and DNA Replication Factor Cdt1 (CDT1).

Clinical significance
CRACDL is associated with depression, bipolar disorder, and schizophrenia. Additionally, CRACDL is associated with various cancers including ovarian, breast, etc.

Paralogs
KIAA1211 is the paralog to KIAA1211L. KIAA1211 is located on chromosome 4 and has 1233 amino acids. Its percent identity to KIAA1211L is 21%. The KIAA1211 has an ortholog in the bacteria Proteus vulgarism, indicating the paralog duplicated 4290 million years ago, before KIAA1211L.

Orthologs
Below is the table of various KIAA1211L orthologs. It includes closely, intermediately, and distantly related orthologs. The most distant ortholog is the elephant shark, indicating KIAA1211L duplicated 473 MYA. The amino acids conserved among all the KIAA1211L orthologs are depicted in the conceptual translation.

Phylogeny
The CRACDL gene is similar and conserved in mammals, birds, reptiles, amphibians, and fish. It is not conserved in bacteria, archaea, protists, plants, fungus, trichoplax, and invertebrates.