Printer Friendly

Functional Virtual Flow Cytometry: A Visual Analytic Approach for Characterizing Single-Cell Gene Expression Patterns.

1. Background

Single-cell RNA sequencing (scRNA-seq) is becoming a powerful tool for studying heterogeneity and subtypes in cell populations. Many bioinformatics and computational tools have been developed to visualize, cluster, and categorize the cells based on their expression profiles [1, 2]. Different algorithmic approaches such as principal component analysis (PCA) or multidimensional scaling (MDS) [3], nonnegative matrix factorization [4], minimum spanning tree (MST) [5, 6], latent variable modeling [7], diffusion map [8, 9], and spline models [10] have all been applied and implemented for such purposes. Moreover, it has been shown that often the cells in a population do not always form "clusters." Instead, the cells form a continuous distribution over the space of featured genes and gene signatures [1]. Therefore, it is often of great interest to identify the interesting distribution patterns (e.g., wishbone pattern and bifurcation) which often imply important biological processes such as stem cell differentiation as well as the gene signatures that can be used to reveal such patterns.

However, this effort often leads to a "chicken-and-egg" situation. Since the patterns may not always be readily perceivable from whole genome data, methods such as PCA and MDS may not always be effective. Therefore, it often ends up in an iterative process and a subjective selection of genes of interests. Another commonly adopted workflow is to first cluster the cells based on their expression profiles and identify "gene signatures" that differentiate the clusters followed by enrichment analysis on these signature genes for potential biological functions or processes involved in the separation of the cells. Since there could be many genes involved in differential analysis, the functional enrichment signals can be diluted.

In this paper, we propose a visual analytic workflow called functional virtual flow cytometry (FVFC) for identifying functional gene groups that can effectively separate the cells using scRNA-seq data. We specifically take advantage of gene coexpression network analysis (GCNA). GCNA aims to identify modules of genes with similar expression profiles. It has been well known that the coexpressed genes often are functionally or structurally related [11-16]. Therefore, instead of surveying all the genes, by focusing on the coexpressed gene clusters, we can directly study the cells based on functional gene groups with increased statistical power [17].

Our method is innovative in the following ways. First, it focuses on the gene modules with clear functional relationships (coexpression) and thus greatly enhances the statistical power. Secondly, only the gene modules that are "informative" among the single cells are used. Specifically we focus on the modules that show bimodal or multimodal distributions among the cells to ensure separation power of the genes on the cell population. Thirdly, we apply spatial statistical methods to detect combinations of gene modules that lead to interesting spatial patterns or separation of the cells and thus identify the gene signatures associated with the underlying biological processes. Last but not least, instead of developing this workflow as an "algorithm," we implement it as a visual analytic workflow, allowing the researchers to interactively select gene modules and cell distribution patterns of interest for further investigation. To this end, we take advantage of the SPLOM combined with various visual cues derived from spatial statistical calculation. We demonstrate our workflow using two large single-cell studies on brain and cancer, respectively.

2. Methods

2.1. Workflow. Figure 1 outlines the workflow of our approach that contains three stages. Given a set of processed scRNAseq data, the first stage carries out the coexpression network analysis and summarization of each network module into a single "eigengene" as well as enrichment analysis to determine the function or structural relationships for each module. The second stage analyzes each eigengene to select the ones with more information content, in particular, the bimodal ones. Then scatterplots are generated for every pair of informative eigengenes. The scatterplots are further analyzed using spatial statistical parameters to determine if they form interesting patterns, specifically if there is clustering or clumping in the scatterplot, implying potential relationships between the two gene modules associated with the two eigengenes. In the final stage, the scatterplots are colored based on the spatial statistical parameters and interesting patterns are further examined with their functional relevance. Overall, this workflow provides an intuitive visual analytic approach for researchers to quickly explore the relationships among functional gene groups in single-cell populations. The details of the steps in the workflow are discussed in the following sections.

2.2. Weighted Gene Coexpression Network Analysis. The first stage in Figure 1 is to carry out gene coexpression network analysis. The detailed workflow for this stage is illustrated in Figure 2. Given a set of M genes and their expression levels over N cells, the gene expression profile can be expressed with a matrix

[mathematical expression not reproducible], (1)

where the N-dimensional row vector [g.sub.i] = [[g.sub.t1] ... [g.sub.N]] is the expression profile for the ith gene across the samples (i = 1, 2, ..., N). Then the pairwise correlation matrix C can be represented by

[mathematical expression not reproducible], (2)

where [c.sub.ij] is the correlation coefficient between ith gene vector g; and jth gene vector [g.sub.j]. In our experiment, we use Spearman rank correlation coefficients in the pairwise correlation matrix since Gaussian distribution cannot be assumed for RNA-seq data as required by Pearson correlation.

After the correlation matrix was computed, we apply a recently developed algorithm called Normalized lmQCM [15]. Compared to widely adopted gene coexpression network analysis software package WGCNA [18], this algorithm takes a network mining approach allowing overlaps between modules and also is guaranteed to have a lower bound on the density of the detected modules. The output of algorithm lmQCM is a set of gene modules [M.sub.1], [M.sub.2], ..., [M.sub.L], where each module [mathematical expression not reproducible] is composed of a group of [N.sub.k] coexpressed genes. The number of modules L and the sizes of the modules are determined by the four parameters of the lmQCM algorithm. While detailed choice of parameters was discussed in [15], the most important parameter is [gamma], which is the threshold for the weight of the first edge of any module and thus controls the number of modules. Usually we choose [gamma] to ensure that the maximum size of a module is not too large (i.e., less than 500 genes). In addition, we focus on gene modules with at least 10 genes so that meaningful functional enrichment analysis can be applied.

For each gene module detected by lmQCM, [M.sub.k] can be represented by a gene expression matrix. If we want to compare one gene module against another, it is advantageous to take only a representative of that module rather than taking all the genes. We use PCA to reduce the gene module data meaningfully and take the first principal component as a summary of that module. This first principal component is called "eigengene" in this context. Computationally, we take the submatrix of G for [M.sub.k] as

[mathematical expression not reproducible]. (3)

[G.sub.k] is centralized and standardized as [G'.sub.k] such that for each row the mean is zero and the norm is one. Let [G'.sub.k] = [USV.sup.T] be the singular value decomposition of [G'.sub.k]. Then the first column of V (denoted as v1) is the "eigengene" for [M.sub.k] up to a sign since V is an orthonormal matrix whose determinant is 1 or -1. Since the eigengene [w.sub.k] should reflect the directions of the majority of genes in [G.sub.k], its projection on the majority of the genes should be positive. Thus, if [SIGMA] sgn([G.sub.k] [v.sub.1]) < 0, then [w.sub.k] = - [v.sub.1]; otherwise, [w.sub.k] = [v.sub.1]. So each gene module detected by lmQCM corresponds to one "eigengene."

For the reported modules, enrichment analyses are carried out using NIH DAVID ( [19] and TOPPGene ( [20].

2.3. Identify Eigengenes with Bimodal or Long Tail Distribution. Before exploring pairwise relationships between gene modules with eigengenes, we identify and keep eigengenes which are "informative," that is, eigengenes whose distribution follows a bimodal or long tail distribution. Therefore, eigengenes with unimodal distribution, especially the ones with narrow sharp peak-shaped distribution, will be filtered out. To differentiate unimodal distribution with bimodal or long tail distributions, metrics such as Kurtosis, second central differences, and likelihood ratio are adopted [21-23]. Specifically, Kurtosis is a measure of the "tailedness" of the Probability distribution of a real-valued random variable[24]. Here we use Kurtosis as a measure to filter whether the histogram of a given eigengene has a very narrow sharp peak distribution. For each eigengene vector [w.sub.k], first the histogram of the vector is computed and then Kurtosis of the histogram distribution is computed as

Kurt ([w.sub.k]) = E[[([w.sub.k] - [mu]).sup.4]/[(E[[([w.sub.k] - [mu]).sup.2]).sup.2], (4)

where [mu] is mean of [w.sub.k]. In [24], the Kurtosis value between 3 and 9 show peakness of the distribution while higher values imply sharper peak-shaped distribution. In this paper, we set the threshold for Kurtosis as user defined parameter. If Kurtosis value of histogram for a given eigengene is smaller than a given threshold, then eigengene will be kept.

2.4. Spatial Statistical Analysis of the 2D Scatterplot Using the Nearest Neighbor Distribution. In order to find the relationship between two coexpressed gene modules, we generate pairwise scatter plots for all pairs of eigengene vectors in a 2D space. For two given eigengene vectors [e.sub.i] = [[e.sub.i1], [e.sub.i2], ..., [e.sub.iN]] and [e.sub.j] = [[e.sub.j1], [e.sub.j2], ..., [e.sub.jN]], scatter plot is the points with coordinates ([e.sub.i1], [e.sub.j1]), ([e.sub.i2], [e.sub.j2]), ..., ([e.sub.iN], [e.sub.jN]) in the 2D space. Then we use the nearest neighbor distance (NND) to analyze the pattern. NND for a data point is the distance to its closest neighbor. It is a spatial statistical parameter effectively used for detecting cell patterns in the space [25, 26]. Define [[bar.d].sub.0] as the mean NND for all the points. Then we make 100 random simulations, each time the same number of points is created in the same region covering ([e.sub.i1], [e.sub.j1]), ([e.sub.i2], [e.sub.j2]), ..., ([e.sub.iN], [e.sub.jN]), and the mean NND is calculated. Assuming that [[bar.d].sub.E] is the mean of 100 randomly simulated mean NND and [[bar.[sigma]] is the standard variation, the z-score is calculated as

z = [[bar.d].sub.0] - [[bar.d].sub.E]/[[bar.[sigma]]. (5)

We call the z-score as the clustering index for a scatter plot.

2.5. Layout for Visualization. Once the eigengenes with long tail or bimodal distributions are detected, SPLOM is generated. Each scatterplot is then colored using the color scale based on the clustering index. User can then select plots with interesting patters for further visualization and analysis.

3. Results

3.1. Datasets and Preprocessing. We applied the above analysis to two large gene expression single-cell datasets. One dataset is RNA sequencing data of single cells isolated from mouse dorsal lateral geniculate nucleus (dLGN) of the thalamus, which is downloaded from Allen Brian Atlas (ABA) website. This data set includes 1,772 single cells collected from dLGN in adult mouse and transcriptionally profiled with RNA sequencing. The dataset contains transcription readings for 45,772 genes and transcripts. However, since many of the genes have zero readings in most cells, these genes were filtered out; specifically we removed genes with zeros in more than half of the cells. In addition, genes whose mean values are among the lowest 20% and variances are among the lowest 50% were removed. This way, 20,000 genes were retained for further analysis.

Another dataset is from a single-cell study on human glioblastoma. The dataset was downloaded from NCBI Gene Expression Omnibus (GEO) with accession number GSE57872. It contains transcriptomes from 430 single glioblastoma cells isolated from 5 individual tumors and 102 single cells from gliomasphere cells lines generated using SMART-seq [27]. Using the same preprocessing procedure, 5,948 genes were kept for further analysis in this dataset.

3.2. Analysis of the ABA Mouse Brain scRNA-Seq Data. Using the lmQCM algorithm (with [gamma] = 0.75), 60 coexpressed gene modules with at least five genes are identified. Using a threshold 20 for the Kurtosis metric, seven eigengenes are selected. Table 1 summarizes the information for the seven modules. Figure 3 shows the colored SPLOM for the seven eigengenes.

The color scheme in Figure 3 allows us to further inspect scatterplots with interesting patterns. In order to determine if these patterns are associated with specific annotations, for selected scatter plots, we further overlay the annotation information using different colors. Figure 4 shows examples when the broad subtype information about the neurons is overlaid on the scatter plots as points with different colors. It is apparent that none of the gene modules can thoroughly separate the cells based on the subtypes. Instead, some of them can separate specific subtypes. For instance, as in Figure 4(a), the cells are separated into two major clusters based on the "clustering index" as defined in the previous section, which does not fully reflect the subtypes as the blue and yellow points are not separated. Instead, the blue and yellow points are segmented in Figure 4(b) and even further away in Figure 4(c).

As in Figure 4(b), it is clear that the groups of yellow cells and cyan cells are separated from the rest groups based on eigengene #4 that is enriched with genes that are important to bladder/pelvic ganglion development and maybe involved in gender development too. At the same time, it can be noted that the red group is different from the blue, cyan, and yellow groups based on eigengene #2 that is closely associated with synapse formation. In addition, according to Figure 4(c), the blue and yellow groups are separated when both eigengenes #3 and #4 are involved and eigengene #3 is closely connected with the glutamate metabolism and inhibitory synapse development. These neural functions are critical for the interpretation of the cell population clustering.

It is important to notice that the visual outcome is very different from traditional PCA based visualization. As shown in Figure 5(a), if all the genes are used for visualization of the cells using traditional PCA, there is not a clear separation of the cells except for a small group. If we limit the gene features for PCA to the ones involved only in the gene modules listed in Table 1, we can clearly see three major groups. As a control, we marked the three groups of cells in Figure 5(a) with three different colors, and we can see that there is no clear separation of the cells in Figure 5(a). However, without explicit functional grouping, it is difficult to determine which biological processes and functions are involved in such separation.

3.3. Analysis of the Human Glioblastoma Patients' Brain scRNA-Seq Data. Using the lmQCM algorithm (with y = 0.2), 18 coexpressed gene modules with at least five genes are identified. Using a threshold of 5 for the Kurtosis metric, 16 eigengenes are selected.

Figure 6 is the SPLOM for the long tail eigengenes from the brain tumor study.

From the SPLOM, it is notable that the fourth gene module not only has an eigengene with bimodal distribution but also is involved in effective separation of the cells. While the cells are labeled by the patient and sample IDs, it is clear that some of the separation cases are closely related to the differences between different tumor samples as shown in Figure 7. In particular, eigengene #4 is key in separating the cells in the green group from the rest while other eigengenes can separate other groups (e.g., eigengene #6 separates the yellow cell group from the rest while eigengene #11 separates the red cell group). Interestingly enrichment analysis shows that this gene module for eigengene #4 is highly enriched with extracellular matrix genes (14 genes out of 36, p = 6.304e - 8) and the cell migration process (10 genes, p = 9.145e - 5), suggesting a particular property of the cells in the green group and it is important as the extracellular matrix and cell migration process is considered critical to the invasion of glioblastoma [28, 29].

4. Discussion and Conclusion

In this work, we presented a workflow for detecting distribution patterns in cell populations based on single-cell transcriptome study. With the fast adoption of single-cell analysis, a challenge to researchers is how to effectively extract gene features to meaningfully separate the cell population. However, this often ends up in a chicken-and-egg situation as the separation of the cells often depends on the choice of gene features, yet without a clear pattern it is difficult to determine which gene features are effective. Our workflow uses the well-developed gene coexpression network analysis to take advantage of the fact that coexpressed genes are often functionally or structurally related and the number of coexpressed modules is much smaller than the number of genes. Thus, when the coexpressed modules are summarized into eigengenes, not only can we quickly explore the distribution of cells interactively but also we can promptly interpret the gene features and generate new hypothesis.

Since the cells are separated based on different choices of the gene features, we dub the workflow as "functional virtual flow cytometry," which achieves separation of the cells based on salient gene features. The separation of cells leads to new hypothesis such as the involvement of glutamate metabolism in the separation of the brain cells in the Allen Brain scRNA-seq data and the specific glioblastoma sample with unique cell migration related signature. While for the latter it is unclear if this observation is indeed biological or due to batch effect, our workflow quickly pointed out the pattern for researchers in deeper examination.

With the interactive visualization, additional advanced analysis can be carried out. For instance, in both Figures 4(b) and 7(b), an interesting observation is that the x- and y-axes cannot both have low values, suggesting interesting Boolean relationships between the gene groups [30]. Therefore, as our ongoing work, these analytic tools along with the workflow are being implemented in an online single-cell analytics portal.

SPLOM:      Scatter plot matrix
scRNA-seq:  Single-cell RNA sequencing
PCA:        Principal component analysis
MDS:        Multidimensional scaling
FVFC:       Functional virtual flow cytometry
GCNA:       Gene coexpression network analysis
dLGN:       Dorsal lateral geniculate nucleus
ABA:        Allen Brian Atlas
GEO:        Gene Expression Omnibus.

Conflicts of Interest

The authors declare that they have no conflicts of interest.


This work is partially supported by Human Frontier Science Program (to Kun Huang), the NCI ITCR U01CA188547 (to Kun Huang), and the National Natural Science Foundation of China (61572265 to Zhi Han). The Ohio Supercomputer Center provided computing support.


[1] M. Setty, M. D. Tadmor, S. Reich-Zeliger et al., "Wishbone identifies bifurcating developmental trajectories from single-cell data," Nature Biotechnology, vol. 34, no. 6, pp. 637-645,2016.

[2] S. Krishnaswamy, M. H. Spitzer, M. Mingueneau et al., "Conditional density-based analysis of T cell signaling in single-cell data," Science, vol. 346, no. 6213, Article ID 1250689, 2014.

[3] A. Scialdone, K. N. Natarajan, L. R. Saraiva et al., "Computational assignment of cell-cycle stage from single-cell transcriptome data," Methods, vol. 85, pp. 54-61, 2015.

[4] C. Shao and T. Hofer, "Robust classification of single-cell transcriptome data by nonnegative matrix factorization," Bioinformatics, vol. 33, no. 2, Article ID btw607, pp. 235-242, 2017

[5] B. Anchang, T. D. P Hart, S. C. Bendall et al., "Visualization and cellular hierarchy inference of single-cell data using SPADE," Nature Protocols, vol. 11, no. 7, pp. 1264-1279, 2016.

[6] P. Qiu, E. F. Simonds, S. C. Bendall et al., "Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE," Nature Biotechnology, vol. 29, no. 10, pp. 886-893, 2011.

[7] F. Buettner, K. N. Natarajan, F. P. Casale et al., "Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells," Nature Biotechnology, vol. 33, no. 2, pp. 155-160, 2015.

[8] L. Haghverdi, F. Buettner, and F. J. Theis, "Diffusion maps for high-dimensional single-cell analysis of differentiation data," Bioinformatics, vol. 31, no. 18, pp. 2989-2998, 2015.

[9] P. Angerer, L. Haghverdi, M. Biittner, F. J. Theis, C. Marr, and F. Buettner, "Destiny: diffusion maps for large-scale single-cell data in R," Bioinformatics, vol. 32, no. 8, pp. 1241-1243, 2016.

[10] J. A. DiGiuseppe, M. D. Tadmor, and D. Pe'Er, "Detection of minimal residual disease in B lymphoblastic leukemia using viSNE," Cytometry Part B - Clinical Cytometry, vol. 88, no. 5, pp. 294-304, 2015.

[11] B. Zhang and S. Horvath, "A general framework for weighted gene co-expression network analysis," Statistical Applications in Genetics and Molecular Biology, vol. 4, article 17, 2005.

[12] A. M. Yip and S. Horvath, "Gene network interconnectedness and the generalized topological overlap measure," BMC Bioinformatics, vol. 8, article 22, 2007

[13] J. Zhang, K. Lu, Y. Xiang et al., "Weighted frequent gene co-expression network mining to identify genes involved in genome stability," PLoS Computational Biology, vol. 8, no. 8, Article ID e1002656, 2012.

[14] Z. Han, J. Zhang, G. Sun, G. Liu, and K. Huang, "A matrix rank based concordance index for evaluating and detecting conditional specific co-expressed gene modules," BMC Genomics, vol. 17, article no. 519, 2016.

[15] J. Zhang and K. Huang, "Normalized lmQCM: an algorithm for detecting weak quasi-cliques in weighted graph with applications in gene co-expression module discovery in cancers," Cancer Informatics, vol. 1, article 1, p. 137

[16] Y. Xiang, J. Zhang, and K. Huang, "Mining the tissue-tissue gene co-expression network for tumor microenvironment study and biomarker prediction," BMC genomics, vol. 14, supplement 5, article s4, 2013.

[17] P. Langfelder and S. Horvath, "Eigengene networks for studying the relationships between co-expression modules," BMC systems biology, vol. 1, article 54, 2007

[18] P. Langfelder and S. Horvath, "WGCNA: an R package for weighted correlation network analysis," BMC Bioinformatics, vol. 9, article 559, 2008.

[19] D. W. Huang, B. T. Sherman, and R. A. Lempicki, "Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources," Nature Protocols, vol. 4, no. 1, pp. 44-57, 2009.

[20] J. Chen, E. E. Bardes, B. J. Aronow, and A. G. Jegga, "ToppGene Suite for gene list enrichment analysis and candidate gene prioritization," Nucleic Acids Research, vol. 37, no. 2, pp. W305-W311, 2009.

[21] A. L. Muratov and O. Y. Gnedin, "Modeling the metallicity distribution of globular clusters," Astrophysical Journal, vol. 718, no. 2, pp. 1266-1288, 2010.

[22] J. B. HALDANE, "Simple tests for bimodality and bitangentiality," Annals of Eugenics, vol. 16, no. 1, pp. 359-364,1951.

[23] H. Holzmann and S. Vollmer, "A likelihood ratio test for bimodality in two-component mixtures with application to regional income distribution in the EU," AStA. Advances in Statistical Analysis, vol. 92, no. 1, pp. 57-69, 2008.

[24] L. T. DeCarlo, "On the meaning and use of kurtosis," Psychological Methods, vol. 2, no. 3, pp. 292-307,1997

[25] K. N. Brown, S. Chen, Z. Han et al., "Clonal production and organization of inhibitory interneurons in the neocortex," Science, vol. 334, no. 6055, pp. 480-486, 2011.

[26] H.-T. Xu, Z. Han, P. Gao et al., "Distinct lineage-dependent structural and functional organization of the hippocampus," Cell, vol. 157, no. 7, pp. 1552-1564, 2014.

[27] A. P. Patel, I. Tirosh, J. J. Trombetta et al., "Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma," Science, vol. 344, no. 6190, pp. 1396-1401, 2014.

[28] E. T. Sayegh, G. Kaur, O. Bloch, and A. T. Parsa, "Systematic review of protein biomarkers of invasive behavior in glioblastoma," Molecular Neurobiology, vol. 49, no. 3, pp. 1212-1244, 2014.

[29] S. M. Turaga and J. D. Lathia, "Adhering towards tumorigenicity: altered adhesion mechanisms in glioblastoma cancer stem cells," CNS Oncology, vol. 5, no. 4, pp. 251-259, 2016.

[30] D. Sahoo, D. L. Dill, A. J. Gentles, R. Tibshirani, and S. K. Plevritis, "Boolean implication networks derived from large scale, whole genome microarray datasets," Genome Biology, vol. 9, no. 10, article no. R157, 2008.

Zhi Han, (1,2) Travis Johnson, (2) Jie Zhang, (2,3) Xuan Zhang, (1) and Kun Huang (2)

(1) College of Software, Nankai University, Tianjin, China

(2) Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA

(3) The CCC Biomedical Informatics Shared Resource, The Ohio State University, Columbus, OH, USA

Correspondence should be addressed to Kun Huang;

Received 3 March 2017; Accepted 22 May 2017; Published 17 July 2017

Academic Editor: Ansgar Poetsch

Caption: Figure 1: The workflow of the functional virtual flow cytometry system.

Caption: Figure 2: Workflow for weighted GCNA and eigengene calculation.

Caption: Figure 3: Colored SPLOM for the seven long tail eigengenes from the Allen Brain scRNA-seq data. The subplot in the ith row, jth column of the matrix is a scatter plot of the ith eigengene against the jth eigengene. Along the diagonal are histogram plots of each eigengene.

Caption: Figure 4: Four example scatterplots with broad classes annotated in different colors.

Caption: Figure 5: (a) The 3D plot for the first three principal components using all genes for the cells. (b) The 3D plot for the first three principal components using genes in the gene modules in Table 1 for the cells.

Caption: Figure 6: Colored SPLOM for the long tail eigengenes from the brain tumor study. The subplot in the ith row, jth column of the matrix is a scatter plot of the jth eigengene against the jth eigengene. Along the diagonal are histogram plots of each eigengene.

Caption: Figure 7: (a) The scatter plot between eigengene #4 (x-axis) and eigengene #11 (y-axis). (b) The scatter plot between eigengenes #4 (x-axis) and #6 (y-axis).
Table 1: The seven gene modules whose eigengenes show long tail

Eigengene #     Index     Size      Kurtosis

1                 3        38       10.7844

2                 6        35        5.0379

3                12        18        8.5550

4                13        17       19.9492

5                28        11        4.9068

6                48         6        3.8686

7                60         5       12.5680

Eigengene #                       Enrichment/notes

1              32 predicted genes: three genes are immunoglobulins and
                two are T cell receptors, acute lymphocytic leukemia
                                   (p = 3.157e-7)

2                      Ion transport (p = 3.341e - 7), synapse
                                  (p = 2.590e - 7)

3                      Glutamate decarboxylation to succinate
                (p = 7.715e - 7), inhibitory synapse (p = 7.843e - 7)

4                       Development of lower uro neuro e15.5
                 BladdPelvicGanglion Sox10 top-relative-expression-
                 ranked 1000 (1.227e -7), six genes on chromosome X

5              Hydrogen ion transmembrane transport (p = 4.859e -20),
                    mitochondrial inner membrane (p = 1.533e -16)

6              NADH metabolic process (p = 2.960e -13), myelin sheath
                 (p = 1.643e -3), gluconeogenesis (p = 5.401e -14),
                 genes upregulated in hippocampus at late postnatal
                               stages (p = 9.341e -10)

7                              Mostly predicted genes
COPYRIGHT 2017 Hindawi Limited
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2017 Gale, Cengage Learning. All rights reserved.

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Research Article
Author:Han, Zhi; Johnson, Travis; Zhang, Jie; Zhang, Xuan; Huang, Kun
Publication:BioMed Research International
Article Type:Report
Date:Jan 1, 2017
Previous Article:Genetic Association Study of KCNQ5 Polymorphisms with High Myopia.
Next Article:Parental Genetic Variants, MTHFR 677C>T and MTRR 66A>G, Associated Differently with Fetal Congenital Heart Defect.

Terms of use | Privacy policy | Copyright © 2021 Farlex, Inc. | Feedback | For webmasters |