Neurogenomics

Neurogenomics is the study of how the genome of an organism influences the development and function of its nervous system. This field intends to unite functional genomics and neurobiology in order to understand the nervous system as a whole from a genomic perspective.

The nervous system in vertebrates is made up of two major types of cells – neuroglial cells and neurons. Hundreds of different types of neurons exist in humans, with varying functions – some of them process external stimuli; others generate a response to stimuli; others organize in centralized structures (brain, spinal ganglia) that are responsible for cognition, perception, and regulation of motor functions. Neurons in these centralized locations tend to organize in giant networks and communicate extensively with each other. Prior to the availability of expression arrays and DNA sequencing methodologies, researchers sought to understand the cellular behaviour of neurons (including synapse formation and neuronal development and regionalization in the human nervous system) in terms of the underlying molecular biology and biochemistry, without any understanding of the influence of a neuron's genome on its development and behaviour. As our understanding of the genome has expanded, the role of networks of gene interactions in the maintenance of neuronal function and behaviour has garnered interest in the neuroscience research community. Neurogenomics allows scientists to study the nervous system of organisms in the context of these underlying regulatory and transcriptional networks. This approach is distinct from neurogenetics, which emphasizes the role of single genes without a network-interaction context when studying the nervous system.

Advent of high-throughput biology
In 1999, Cirelli & Tononi first reported the association of genome-wide brain gene expression profiling (using microarrays) with a behavioural phenotype in mice. Since then, global brain gene expression data, derived from microarrays, has been aligned to various behavioural quantitative trait loci (QTLs) and reported in several publications. However, microarray based approaches have their own problems that confound analysis – probe saturation can result in very small measurable variance of gene expression between genetically unique individuals, and the presence of single nucleotide polymorphisms (SNPs) can result in hybridization artifacts. Furthermore, due to their probe-based nature, microarrays can miss out on many types of transcripts (ncRNAs, miRNAs, and mRNA isoforms). Probes can also have species-specific binding affinities that can confound comparative analysis.

Notably, the association between behavioural patterns and high penetrance single gene loci falls under the purview of neurogenetics research, wherein the focus is to identify a simple causative relationship between a single, high penetrance gene and an observed function/behaviour. However, it has been shown that several neurological diseases tend to be polygenic, being influenced by multiple different genes and regulatory regions instead of one gene alone. There has hence been a shift from single gene approaches to network approaches for studying neurological development and diseases, a shift that has been greatly propelled by the advent of next generation sequencing methodologies.

Next-generation sequencing approaches
Twin studies have revealed that schizophrenia, bipolar disorder, autism spectrum disorder (ASD), and attention deficit hyperactivity disorder (ADHD) are highly heritable, genetically complex psychiatric disorders. However, linkage studies have largely failed at identifying causative variants for psychiatric disorders such as these, primarily because of their complex genetic architecture. Multiple low penetrance risk variants can be aggregated in affected individuals and families, and sets of causative variants could vary across families. Studies along these lines have determined a polygenic basis for several psychiatric disorders. Several independently occurring de novo mutations in patients Alzheimer's disease have been found to disrupt a shared set of functional pathways involved with neuronal signalling, for example. The quest to understand the causative biology of psychiatric disorders is hence greatly assisted by the ability to analyse entire genomes of affected and unaffected individuals in an unbiased manner.

With the availability of massively parallel next generation sequencing methodologies, scientists have been able to look beyond the probe based captures of expressed genes. RNA-seq, for example, identifies 25-60% more expressed genes than microarrays do. In the upcoming field of neurogenomics, it is hoped that by understanding the genomic profiles of different parts of the brain, we might be able to improve our understanding of how the interactions between genes and pathways influence cellular function and development. This approach is expected to be able to identify the secondary gene networks that are disrupted in neurological disorders, subsequently assisting drug development stratagems for brain diseases. The BRAIN initiative launched in 2013, for example, seeks to "inform the development of future treatments for brain disorders, including Alzheimer's disease, epilepsy, and traumatic brain injury".

Rare variant association studies (RVAS) have highlighted the role of de novo mutations in several congenital and early-childhood-onset disorders like autism. Several of these protein disrupting mutations have been able to be identified only with the aid of whole genome sequencing efforts, and validated with RNA-Seq. Additionally, these mutations are not statistically enriched in individual genes, but rather, exhibit patterns of statistical enrichment in groups of genes associated with networks regulating neurological development and maintenance. Such a discovery would have been impossible with prior gene-centric approaches (neurogenetics, behavioural neuroscience). Neurogenomics allows for a high-throughput system-based approach for understanding the polygenic basis of neuropsychiatric disorders.

Imaging studies and optical mapping
When autism was identified as a distinct biological disorder in the 1980s, researchers found that autistic individuals showed a brain growth abnormality in the cerebellum in their early developmental years. Subsequent research has indicated that 90% of autistic children have a larger brain volume than their peers by 2 to 4 years of age, and show an expansion in the white and gray matter content in the cerebrum. The white and gray matter in the cerebrum is associated with learning and cognition respectively, and the formation of amyloid plaques in the white matter has been associated with Alzheimer's disease. These findings highlighted the influence of structural variance in the brain on psychiatric disorders, and have motivated the use of imaging technologies to map regions of divergence between healthy and diseased brains. Furthermore, while it may not always be possible to retrieve biological specimens from different areas live human brains, neuroimaging techniques offer a noninvasive means to understanding the biological basis of neurological disorders. It is hoped that an understanding of localization patterns of different psychiatric diseases could in turn inform network analysis studies in neurogenomics.

MRI
Structural Magnetic Resonance Imaging (MRI) can be used to identify the structural composition of the brain. Particularly in the context of neurogenomics, MRI has played an extensive role in the study of Alzheimer's disease over the past four decades. It was initially used to rule out other causes of dementia, but recent studies indicated the presence of characteristic changes in patients with Alzheimer's disease. As a result, MRI scans are currently being used as a neuroimaging tool to help identify the temporal and spatial pathophysiology of Alzheimer's disease, such as specific cerebral alterations and amyloid imaging.

The ease and non-invasive nature of MRI scans has motivated research projects that trace the development and onset of psychiatric diseases in the brain. Alzheimer disease has become a key candidate in this topographical approach to psychiatric diseases. For example, MRI scans are currently being used to track the resting and task-dependent functional profiles of brains in children with autosomal dominant Alzheimer disease. These studies have found indications of early onset brain alterations in at-risk individuals for Alzheimer's disease. The Autism Center of Excellence at University of California, San Diego, is also conducting MRI studies with children between 12 and 42 months, in the hopes of characterizing brain development abnormalities in children who present behavioural symptoms of autism.

Additional research has indicated that there are specifics patterns of atrophy in the cerebrum (as a repercussion of neurodegeneration) in different neurological disorders and diseases. These disease-specific patterns of progression of atrophy can be identified with MRI scans, and provide a clinical phenotype context to neurogenomic research. The temporal information about disease progression provided by this approach can also potentially inform the interpretation of gene network-level perturbations in psychiatric diseases.

Optical mapping
One prohibitive feature of 2nd generation sequencing methodologies is the upper limit on the genomic range accessible by mate-pairing. Optical mapping is an emerging methodology used to span large-scale variants that cannot usually be detected using paired end reads. This approach has been successfully applied to detect structural variants in oligodendroglioma, a type of brain cancer. Recent work has also highlighted the versatility of optical maps in improving existing genome assemblies. Chromosomal rearrangements, microdeletions, and large-scale translocations have been associated with impaired neurological and cognitive function, for example in hereditary neuropathy and neurofibromatosis. Optical mapping can significantly improve variant detection and inform gene interaction network models for the diseased state in neurological disorders.

Studying other brain diseases
Apart from neurological disorders, there are additional diseases that manifest in the brain and have formed exemplar use-case scenarios for the application of brain imaging in network analysis. In a classic example of imaging-genomic analyses, a research study in 2012 compared MRI scans and gene expression profiles of 104 glioma patients in order to distinguish treatment outcomes and identify novel targetable genomic pathways in Glioblastoma Multiforme (GBM). Researchers found two distinct groups of patients with significantly different organization of white matter (invasive vs non-invasive). Subsequent pathway analysis of the gene expression data indicated mitochondrial dysfunction as the top canonical pathway in an aggressive, low-mortality GBM phenotype.

Expansion of brain imaging approaches to other diseases can be used to rule out other medical illnesses while diagnosing psychiatric disorders, but cannot be used to inform the presence or absence of a psychiatric disorder.

In humans
The current approaches in collecting gene expression data in human brains are to use either microarrays or RNA-seq. Currently, it is rare to gather "live" brain tissue – only when treatments involve brain surgery is there a chance that brain tissue is collected during the procedure. This is the case with epilepsy.

Currently, gene expression data is usually collected on post mortem brains and this is often a barrier to neurogenomics research in humans. After death, the amount of time between death and when the data from the post mortem brain is collected is known as the post mortem interval (PMI). Since RNA degrades after death, a fresh brain is optimal – but not always available. This in turn can influence a variety of downstream analyses. Consideration should be taken of the following factors when working with 'omics data collected from post-mortem brains: Differential diagnosis also remains a critical pre-analytical confounder of cohort-wide studies of spectrum neurological disorders. Specifically, this has been noted to be a problem for Alzheimer's disease and autism spectrum disorder studies. Furthermore, as our understanding of the diverse symptoms and genomic underpinnings of various neurogenomic disorders improves, the diagnostic criteria itself undergoes rearrangements and review.
 * Ideally, human brains should be controlled for PMIs for a given study.
 * The cause of death is also an important variable to consider in the collection of human brain samples for the purposes of neurogenomics research. For example, brain samples of individuals with clinical depression are often collected after suicide. Certain conditions of death, such as drug overdose or self-inflicted gunshot, will alter the expression of the brain.
 * Another issue with studying gene expression in brains is the cellular heterogeneity of brain tissue samples. Bulk brain samples may vary in proportions of specific cell populations from case to case. This can impact the gene expression signatures and may significantly change differential expression analysis.
 * One approach to address this issue is to use single cell RNA-seq. This would control for a specific cell type. However, this solution is only applicable where studies are not cell-type specific.

Animal models
Ongoing genomics research in neurological disorders tends to use animal models (and corresponding gene homologs) to understand the network interactions underlying a particular disorder due to ethical issues surrounding the retrieval of biological specimens from live human brains. This, too, is not without its roadblocks.

Neurogenomic research with a model organism is contingent on the availability of a fully sequenced and annotated reference genome. Additionally, the RNA profiles (miRNA, ncRNA, mRNA) of the model organism need to be well catalogued, and any inferences applied from them to humans must have a basis in functional/sequence homology.

Zebrafish
Zebrafish development relies on gene networks that are highly conserved among all vertebrates. Additionally, with an extremely well annotated set of 12,000 genes and 1,000 early development mutants that are actually visible in the optically clear zebrafish embryos and larvae, zebrafish offer a sophisticated system for mutagenesis and real-time imaging of developing pathologies. This early development model has been employed to study the nervous system at cellular resolution. The zebrafish model system has already been used to study neuroregeneration and severe polygenic human diseases like cancer and heart disease. Several zebrafish mutants with behavioural variations in response to cocaine and alcohol dosage have been isolated and can also form a basis for studying the pathogenesis of behavioural disorders.

Rodent
Rodent models have been preeminent in studying human disorders. These models have been extensively annotated with gene homologs of several monogenic disorders in humans. Knockout studies of these homologs have led to expansion of our understanding of network interactions of genes in human tissues. For example, the FMR1 gene has been implicated with autism from a number of network studies. Using a knockout of FMR1 in mice creates the model for Fragile X Syndrome, one of the disorders in the Autism spectrum.

Mice xenografts are particularly useful for drug discovery, and were extremely important in the discovery of early anti-psychotic drugs. The development of animal models for complex psychiatric diseases has also improved over the last few years. Rodent models have demonstrated behavioural phenotype changes resembling a positive schizophrenia state, either after genetic manipulation or after treatment with drugs that target the areas of the brain suspected to influence hyperactivity or neurodevelopment. Interest has been generated in identifying the network disruptions mediated by these laboratory manipulations, and collection of genomic data from rodent studies has contributed significantly to a better understanding of the genomics of psychiatric diseases.

The first mouse brain transcriptome was generated in 2008. Since then, extensive work has been done with building social-stress mice models to study the pathway level expression signatures of various psychiatric diseases. A recent paper simulated features of Post Traumatic Stress Disorder (PTSD) in mice, and profiled the entire transcriptome of these mice. The authors found differential regulation in many biological pathways, some of which were implicated in anxiety disorders (hyperactivity, fear response), mood disorders, and impaired cognition. These findings are backed by extensive transcriptomic analyses of anxiety disorders, and expression level changes in biological pathways involved with fear learning and memory are thought to contribute to the behavioural manifestations of these disorders. It is thought that functional enrichment of genes involved in long term synaptic potentiation, depression, and plasticity has an important role to play in the acquisition, consolidation, and maintenance of traumatic memories underlying anxiety disorders.

Experimental mice models for psychiatric disorders
A common approach to using a mouse model is to apply an experimental treatment to a pregnant mouse in order to affect a whole litter. However, a key issue in the field is the treatment of litters in a statistical analysis. Most studies consider the total number of offspring produced as that may lead to an increase in statistical power. However, the correct way is to count by the number of litters and to normalize based on litter size. It was found that several autism studies incorrectly performed their statistical analyses based on total number of offspring instead of number of litters.

Several anxiety disorders such as post-traumatic stress disorder (PTSD) involve heterogeneous changes in several different brain regions, such as the hippocampus, amygdala, and nucleus accumbens. The cellular encoding of traumatic events and the behavioral responses triggered by such events has been shown to lie primarily in changes in signaling molecules associated with synaptic transmission.

Global gene expression profiling of the various gene regions implicated in fear and anxiety processing, using mice models, has led to the identification of temporally and spatially distinct sets of differentially expressed genes. Pathway analysis of these genes has indicated possible roles in neurogenesis and anxiety-related behavioural responses, alongside other functional and phenotypic observations.

Mice models for brain research have contributed significantly to drug development and increased our understanding of the genomic underpinnings of several neurological diseases in the last generation. Chlorpromazine, the first antipsychotic drug (discovered in 1951), was identified as a viable treatment option after it was shown to suppress response to aversive stimuli in rats in a behavioural screen.

Challenges
The modelling and assessment of latent symptoms (thoughts, verbal learning, social interactions, cognitive behaviour) remains a challenge when using model organisms to study psychiatric disorders with a complex genetic pathology. For example, a given genotype+phenotype in a mouse model must imitate the genomic underpinnings of a phenotype observed in a human.

This is a particularly crucial item of consideration in spectrum disorders such as autism. Autism is a disorder whose symptoms can be divided into two categories: (i) deficits of social interactions and (ii) repetitive behaviours and restricted interests. Since mice tend to be more social creatures amongst all members of the order Rodentia currently being used as model organisms, mice are generally used to model human psychiatric disorders as closely as possible. Particularly for autism, the following work-arounds are currently in place to emulate human behavioural symptoms: In any of these experiments, the 'autistic' mice have a 'normal' socializing partner and the scientists observing the mice are unaware ("blind") to the genotypes of the mice.
 * For the first diagnostic category of impaired social behaviour, mice are subject to a social assay intended to represent typical autistic social deficits. Normal social behaviour for mice includes sniffing, following, physical contact and allogrooming. Vocal communication could be used as well.
 * There are a number of ways the second diagnostic category can be observed in mice. Examples of repetitive behaviours can include excessive circling, self-grooming and excessive digging. Usually these behaviours would be performed consistently within a long measurement of time (i.e. self-grooming for 10 minutes).
 * While repetitive behaviours are easily observable, it is difficult to characterize actual restricted interests of mice. One aspect of restricted interests of autistic individuals is the "insistence of sameness"—the concept that autistic individuals require their environment to remain consistent. If that environment should change, the individual would experience stress and anxiety. There has been reported success in confirming a mouse model of autism by changing the mouse's environment.

Gene expression in the brain
The gene expression profile of the central nervous system (CNS) is unique. Eighty percent of all human genes are expressed in the brain; 5,000 of these genes are solely expressed in the CNS. The human brain has the highest amount of gene expression of all studied mammalian brains. In comparison, tissues outside of the brain will have more similar expression levels in comparison to their mammalian counterparts. One source of the increased expression levels in the human brain is from the non-protein coding region of the genome. Numerous studies have indicated that the human brain have a higher level of expression in regulatory regions in comparison to other mammalian brains. There is also notable enrichment for more alternative splicing events in the human brain.

Spatial differences
Gene expression profiles also vary within specific regions of the brain. A microarray study showed that the transcriptome profile of the CNS clusters together based on region. A different study characterized the regulation of gene expression across 10 different regions based on their eQTL signals. The cause of the varying expression profiles relates to function, neuron migration and cellular heterogeneity of the region. Even the three layers of the cerebral cortex have distinct expression profiles.

A study completed at Harvard Medical School in 2014 was able to identify developmental lineages stemming from single base neuronal mutations. The researchers sequenced 36 neurons from the cerebral cortex of three normal individuals, and found that highly expressed genes, and neural associated genes, were significantly enriched for single-neuron SNVs. These SNVs, in turn, were found to be correlated with chromatin markers of transcription from fetal brain.

Development patterns in humans
Gene expression of the brain changes throughout the different phases of life. The most significant levels of expression are found during early development, with the rate of gene expression being highest during fetal development. This results from the rapid growth of neurons in the embryo. Neurons at this stage are undergoing neuronal differentiation, cell proliferation, migration events and dendritic and synaptic development. Gene expression patterns shift closer towards specialized functional profiles during embryonic development, however, certain developmental steps are still ongoing at parturition. Consequently, gene expression profiles of the two brain hemispheres appear asymmetrical at birth. At birth, gene expression profiles appear asymmetrical between brain hemispheres. As development continues, the gene expression profiles become similar between the hemispheres. Given a healthy adult, expression profiles stay relatively consistent from the late twenties into the late forties. From the fifties onwards, there is significant decrease in the expression of genes important for regular function. Despite this, there is an increase in the diversity of genes being expressed across the brain. This age related change in expression may be correlated with GC content. At later stages of life, there is an increase in the induction of low GC-content pivotal genes as well as an increase in the repression of high GC-content pivotal genes. Another cause of the shift in gene diversity is the accumulation of mutations and DNA damage. Gene expression studies show that genes that accrue these age-related mutations are consistent between individuals in the aging population. Genes that are highly expressed at development decrease significantly at late stages in life, whereas genes that are highly repressed at development increase significantly at the late stages.

Evolution of the mammalian brain
The evolution of Homo sapiens since the divergence from the primate common ancestor has shown a marked expansion in the size and complexity of the brain, especially in the cerebral cortex. In comparison to primates, the human cerebral cortex has a larger surface area but differs only slightly in thickness. Many large scale studies in understanding the differences of the human brain from other species have indicated expansion of gene families and changes in alternative splicing to be responsible for the corollary increase in cognitive capabilities and cooperative behaviour in humans. However, we are yet to determine the exact phenotypic consequences of all these changes. One difficulty is that only primates have developed subdivisions in their cerebral cortex, making the modeling of human specific neurological problems difficult to mimic in rodents.

Sequence data is used to understand the evolutionary genetic changes which led to the development of the human CNS. We can then understand how the neurological phenotypes differ between species. Comparative genomics entails comparison of sequence data across a phylogeny to pinpoint the genotypic changes that occur within specific lineages, and understand how these changes might have arisen. The increase in high quality mammalian reference sequences generally makes comparative analysis better as it increases statistical power. However, the increase in number of species in a phylogeny does risk adding unnecessary noise as the alignments of the orthologous sequences usually decrease in quality. Furthermore, different classes of species will have significant differences in their phenotypes.

Despite this, comparative genomics has allowed us to connect the genetic changes found in a phylogeny to specific pathways. In order to determine this, lineages are tested for the functional changes that accrue over time. This is often measured as a ratio of nonsynonymous substitutions to synonymous substitutions or the dN/dS ratio (sometimes, further abbreviated to ω). When the dN/dS ratio is greater than 1, this indicates positive selection. A dN/dS ratio equal to 1 is evidence of no selective pressures. A dN/dS ratio less than 1 indicates negative selection. For example, the conserved regions of the genome will generally have a dN/dS ratio of less than 1 since any changes to those positions will likely be detrimental. Of the genes expressed in the human brain, it is estimated that 342 of them have a dN/dS ratio greater than 1 in the human lineage in comparison to other primate lineages. This indicates positive selection on the human lineage for brain phenotypes. Understanding the significance of the positive selection is generally the next step. For example, ASPM, CDK5RAP2 and NIN are genes that are positively selected for on the human lineage and have been directly correlated with brain size. This finding may help elucidate why human brains are larger than other mammalian brains.

Network level expression differences between species
It is thought that gene expression changes, being the ultimate response for any genetic changes, are a good proxy for understanding phenotypic differences within biological samples. Comparative studies have revealed a range of differences in the transcriptional controls between primates and rodents. For example, the gene CNTNAP2 is specifically enriched for in the prefrontal cortex. The mouse homolog of CNTNAP2 is not expressed in the mouse brain. CNTNAP2 has been implicated in cognitive functions of language as well as neurodevelopmental disorders such as Autism Spectrum Disorder. This suggests that the control of expression plays a significant role in the development in unique human cognitive function. As a consequence, a number of studies have investigated the brain specific enhancers. Transcription factors such as SOX5 have been found to be positively selected for on the human lineage. Gene expression studies in humans, chimpanzees and rhesus macaques, have identified human specific co-expression networks, and an elevation in gene expression in the human cortex in comparison to primates.

Disorders
Neurogenomic disorders manifest themselves as neurological disorders with a complex genetic architecture and a non-Mendelian-like pattern of inheritance. Some examples of these disorders include Bipolar disorder and Schizophrenia. Several genes may be involved in the manifestation of the disorder, and mutations in such disorders are generally rare and de novo. Hence it becomes extremely unlikely to observe the same (potentially causative) variant in two unrelated individuals affected with the same neurogenomic disorder. Ongoing research has implicated several de novo exonic variations and structural variations in Autism Spectrum Disorder (ASD), for example. The allelic spectrum of the rare and common variants in neurogenomic disorders therefore necessitates a need for large cohort studies in order to effectively exclude low effect variants and identify the overarching pathways frequently mutated in the different disorders, rather than specific genes and specific high penetrance mutations.

Whole genome sequencing (WGS) and whole exome sequencing (WES) has been used in Genome Wide Association Studies (GWAS) to characterize genetic variants associated with neurogenomic disorders. However, the impact of these variants cannot always be verified because of the non-Mendelian inheritance patterns observed in several of these disorders. Another prohibitive feature in network analysis is the lack of large-scale datasets for many psychiatric (neurogenomic) diseases. Since several diseases with neurogenomic underpinnings tend to have a polygenic basis, several nonspecific, rare, and partially penetrant de novo mutations in different patients can contribute to the same observed range of phenotypes, as is the case with Autism Spectrum Disorder and schizophrenia. Extensive research in alcohol dependence has also highlighted the need for high-quality genomic profiling of large sample sets when studying polygenic, spectrum disorders.

The 1000 Genomes Project was a successful demonstration of how a concerted effort to acquire representative genomic data from the broad spectrum of humans can result in identification of actionable biological insights for different diseases. However, a large-scale initiative like this is still lacking in the field of neurogenomic disorders specifically.

Modelling psychiatric disorders in neurogenomics research – issues
One major GWAS study identified 13 new risk loci for schizophrenia. Studying the impact of these candidates would ideally demonstrate a schizophrenia phenotype in animal models, which is usually difficult to observe due to its manifestation as a latent personality. This approach is able to determine the molecular impact the candidate gene. Ideally the candidate genes would have a neurological impact, which in turn would suggest that it plays a role in the neurological disorder. For example, in the aforementioned schizophrenia GWAS study, Ripke and colleagues determined that these candidate genes were all involved in calcium signalling. Alternatively, one can study these variants in model organisms in the context of affected neurological function. It is important to note that the high penetrance variants of these disorders tend to be de novo mutations.

A further complication to studying neurogenomic disorders is the heterogeneous nature of the disorder. In many of these disorders, the mutations observed from case to case do not stay consistent. In autism, an affected individual may experience a large amount of deleterious mutations in gene X. A different affected individual may not have any significant mutations on gene X but have a large amount of mutations in gene Y. The alternative is to determine if gene X and gene Y impact the same biochemical pathway—one that influences a neurological function. A bioinformatics network analysis is one approach to this problem. Network analyses methodologies provide a generalized, systems overview of a molecular pathway.

One final complication to consider is the comorbidity of neurogenomic genes. Several disorders, especially at the more severe ends of the spectrum tend to be comorbid with each other. For example, more severe cases of ASD tend to be associated with intellectual disability (ID). This raises the question of whether or not there are true, unique ASD genes and unique ID genes or if there are just genes just associated with neurological function that can be mutated into an abnormal phenotype. One confounding factor may be the actual diagnostic category and methods of the spectrum disorders as symptoms between severe disorders may be similar. One study investigated the comorbid symptoms between groups of ID and ASD, and found no significant difference between the symptoms of ID children, ASD children with ID and ASD children without ID. Future research may help establish a more stringent genetic basis for the diagnoses of these disorders.

Network analysis
The main goal of network analysis in neurogenomics is to identify statistically significant nonrandom associations between genes that contain risk variants. While several algorithm implementations of this approach already exist, the general steps for network analysis remain the same.
 * The analytical process starts out with the identification of a biological network based on experimental validation. This can be a gene co-expression network, or a protein-protein interaction (PPI) network. The nodes of the network will be clustered.
 * Subsequently, a specific list of genes with known associations to a particular phenotype of interest is generated. This list could be determined by experimental data, agnostic of genetic studies in psychiatric disorders. This is referred to as a 'hit list'.
 * Genes that belong to the hit list as well as the biological network selected in the first step are marked as such.
 * This is followed by a guilt-by-association (GBA) step. This means that clusters within the biological network that have a significant amount of genes from the hit list are investigated further using functional enrichment tools and database querying for the pathways in which these high scoring cluster genes participate
 * Thus the biological associations of the high-scoring, experimentally implicated cluster members are investigated, expanding the search area from beyond the initial hit list to include gene members of additional pathways that may have significant association with the initial biological network under consideration. This results in a set of candidate genes.

The underlying principle of this approach is that the genes that cluster together, will also jointly affect the same molecular pathway. Again, they would ideally be part of a neurological function. The candidate genes can then be used to prioritize variants for wet lab validation.

Neuropharmacology
Historically, due to the behavioural stimulation manifested as a symptom in several the neurogenomic disorders, the therapies would rely mostly on anti-psychotics or antidepressants. These classes of medications would suppress common symptoms of the disorders, but with questionable efficacy. The biggest barrier to neruopharmacogenomic research was the cohort sizes. Given newly available large-cohort sequencing data, there has been a recent push to expand therapeutic options. The heterogenous nature of neurological diseases is the key motivation for personalized medicine approaches to their therapies. It is rare to find single high penetrance causative genes in neurological diseases. The genomic profiles understandably vary between cases, and logically, the therapies would need to vary between cases. Further complicating the issue is that many of these disorders are spectrum disorders. Their genetic etiology will vary within this spectrum. For example, severe ASD is associated with high penetrance de novo mutations. Milder forms of ASD is usually associated with a mixture of common variants.

The key issue then is the translation of these newly identified genetic variants (from Copy Number Variant studies, candidate gene sequencing and high throughput sequencing technologies) into an intervention for patients with neurogenomic disorders. One aspect will be if the neurological disorder are medically actionable (i.e. is there a simple metabolic pathway that a therapy can target). For example, specific cases of ASD have been associated with microdeletions on TMLHE gene. This gene codes for the enzyme of carnitine biosynthesis. Supplements to elevate carnitine levels appeared to alleviate certain ASD symptoms but the study was confounded by many influencing factors. As mentioned earlier, using a gene network approach will help identify relevant pathways of interest. Many neuropharmacogenomic approaches have focused on targeting the downstream products of these pathways.

Blood brain barrier
Studies in animal models for several brain diseases has shown that the blood brain barrier (BBB) undergoes modification at many levels; for example, the surface glycoprotein composition can influence the types of HIV-1 strains transported by the BBB. The BBB has been found to be key in the onset of Alzheimer's disease. It is extremely difficult, however, to be able to study this in humans due to obvious restrictions with accessing the brain and retrieving biological specimens for sequencing or morphological analysis. Mice models of the BBB and models of disease states have served well in conceptualizing the BBB as a regulatory interface between disease and good health in the brain.

Personalized neurobiology
The heterogenous nature of neurological diseases is the key motivation for personalized medicine approaches to their therapies. Genomic samples of individual patients could be used to identify predictive factors, or to better understand the specific prognosis of a neurogenomic disease, and use this information to guide treatment options. While there is a clear clinical utility to this approach, the adaptation of this approach is still nonexistent.

There are various issues prohibiting the application of personalized genomics to the assessment, diagnosis, and treatment of psychiatric disorders.
 * Firstly, the causative network biology of several spectrum disorders with neurogenomic underpinnings is not fully understood yet, in spite of extensive studies conducted with disorders like Autism Spectrum and schizophrenia. Thus, the analytical validity of standing hypotheses concerning the etiology of neurogenomic disorders has still not been fully established and is subject to debate and controversy.
 * The clinical validity of genetic variants that have shown to be highly correlated with specific neurogenomic disorders is often a major cause of concern. The interpretation of these test results, and subsequent decision making, are a complicated undertaking given the polygenic nature of many of these disorders. Complicating things further, it has been shown that pre-emptive intervention in major psychiatric disorders does not always reduce the risk for the disorder. Such intervention might not even be available for at-risk offspring of affected adults, thereby limiting the 'medical actionability' of the data.
 * Ethical concerns have also been raised regarding the safeguarding of personal genomic information, and how best to approach the burden of incidental findings and family risk assessment.
 * Consanguinity and in-breeding can lead to selective enrichment of rare, otherwise low penetrance genetic mutations attributed to various symptoms of neurogenomic disorders. Thus, the interpretation of family-specific genetic mutations and/or network-level disruptions in the onset of a rare psychiatric disorder requires careful consideration of the motivations of participants included in the study.
 * That said, these issues can be addressed by effective education and counseling, and collection of genomic data from patients with psychiatric disorders should not be disqualified solely on this basis. The data itself serves as a dynamic health resource and can significantly further our understanding of the genomic basis of several psychiatric disorders.