CFAP97D2

CFAP97D2
CFAP97D2 (Cilia and Flagella Associated Protein 97 Domain containing 2) is a protein that in humans is encoded by the CFAP97D2 gene.

Gene
Homo sapiens CFAP97D2 gene (XM_017020910.2) is a 68,395 base pair gene that encodes mRNA transcripts ranging from 952 nucleotides to 1841 nucleotides. It is located on Chromosome 13 and found at locus 13q34 on the plus strand. CFAP97D2 is on Chromosome 13. This gene encodes 166 amino acids that make up Cilia and Flagella Associated Protein 97 containing Domain 2 (CFAP97D2) protein. The CFAP97D2 gene is also known as C17orf105 Homolog gene.

Isoforms
There are 5 mRNA transcripts produced from the CFAP97D2 gene. These transcripts encode 5 different CFAP97D2 isoforms: X1, X2, X3, 1, and 2. Isoform X1 is 166 AA, the longest isoform of Homo sapiens CFAP97D2. Isoform X2 has 2 deletirious mRNA mutations at the end of exon 5 accounting for a single amino acid deletion. The final protein transcript is a 165 amino acids. Isoform X3 has numerous mRNA insertions in Exons 3, 4, and 5. The final protein length is 101 AA. Isoforms 1 and  2 have complete deleterious mRNA mutations of Exon 4 and encode final protein lengths of 99 and 98 AA respectively.

Expression
RNA sequencing revealed CFAP97D2 expression in the brain, lungs, pancreas, testis, fallopian tubes, and cervix.

Primary Structure
Homo sapiens CFAP97D2 Isoform X1 is a basic protein with a predicted PI of 10.4 and Mw 19.3 kaD. There is nuclear leucine zipper motif (AA #37-52) and nuclear localization signal (AA #102-116).

Secondary Structure
By definition, the leucine zipper region of Homo sapiens CFAP97D2 (AA #37-52) is an alpha helix.

Tertiary Structure
CFAP97D2 is characterized by coiled-coiled regions and 2 alpha helices

CFAP97 Gene Family
The CFAP97 gene family contains 3 genes: CFAP97, CFAP97D1, and CFAP97D2 and is characterized by the KIAA1430 gene domain. CFAP97D1 has longer evolutionary conservation than CFAP97D2 with an estimated date of divergence 431 MYA.

Strict Orthologs
CFAP97D2 is a highly conserved protein found in primates, rodents, bats, even-toed ungulates, otarlidae, birds, reptiles, and bony fish. Primate and rodent CFAP97D2 proteins are most recently related (% identity range: 74-100%). Bats, even-toed ungulates, otariidae, and birds are moderately related to Homo sapiens CFAP97D2 (% identity range: 55.9-73%) and reptile and body fish species are most distantly related (% identity range: 25.6-42%). The asiactic toad is an outlier as a reptile with 60.4% identity with Homo sapiens CFAP97D2.

Paralogs and Distant Homologs
CFAP97D1 is conserved in both invertebrate and vertebrate species. The CFAP97D1 genes found in vertebrate species are paralogs of CFAP97D2. The invertebrate CFAP97D1 genes existing prior to 431 MYA are distant CFAP97D2 homologs due to these paralogs' shared evolutionary history.

Evolution
CFAP97D1 has longer evolutionary conservation than CFAP97D2. CFAP97D1's slow and consistent conservation is evidenced by its little divergence from its ortholog ancestors over 600 million years while CFAP97D2 has changed rapidly over 431 million years. Present only in vertebrate species, CFAP97D2's rapid evolution indicates that it is under different selective pressures than CFAP97D1 and therefore serves in a different functional capacity.