Text Mining Gene Selection to Understand Pathological Phenotype Using Biological Big Data

Christophe Desterke; Hans-Kristian Lorenzo; Jean-Jacques  Candelier

doi:10.36255/exonpublications.bioinformatics.2021.ch1

PDF HTML XML

Published: Mar 20, 2021

DOI: https://doi.org/10.36255/exonpublications.bioinformatics.2021.ch1

Keywords:

focal segmental glomerulosclerosis, hydatidiform mole, text mining, transcriptome, web interface

Christophe Desterke

University Paris-Saclay, UFR Medicine, France

Hans-Kristian Lorenzo

University Paris-Saclay, UFR Medicine, France

Jean-Jacques Candelier

Hospital P.Brousse, bâtiment Lavoisier, 14 avenue P.V.Couturier, 94800 Villejuif, France

ABSTRACT

Whole transcriptome omics experiments allow for the study of gene regulation at the cellular level. During analysis and interpretation of omics data, false discovery can occur. To minimize false discovery and identify true significant cases, multi-test correction has been introduced to bioinformatics algorithms. The scientific literature offers a huge collection of information that can be parsed using a web Application Programming Interface. Gene selection by text mining can rank information according to its importance while taking into account the most recent updates in scientific literature. The integration of text mining selection in biological big data, such as transcriptome experiments including single cell transcriptome, can achieve an important dimensional reduction of the data without any statistical hypothesis. This avoids false discoveries regarding the molecules of interest. Hydatidiform moles and focal segmental glomerulosclerosis (FSGS) nephropathy are the two examples presented in this chapter, which demonstrate the considerable value of these analytical methods to prove the concept. The best FSGS markers expressed can be displayed by building an interactive online web interface as a web resource based on the glomerular cell transcriptome. This chapter shows the value of integrating text mining with omics data analysis to discover specific molecules and determine their locations and functions associated with complex diseases.

Downloads

Download data is not yet available.

Book

Bioinformatics

Section

Chapter 1

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Article Sidebar

Main Article Content

Downloads

Article Details