ALS Compute
Amyotrophic Lateral Sclerosis (ALS) is a fatal neurodegenerative disorder characterized by the progressive loss of brain and spinal cord motor neurons. Half of ALS patients display cognitive symptoms of frontotemporal dementia (FTD); reciprocally, about 40% of FTD patients show motor neuron deficits, and approximately 15% develop overt ALS. The clinical overlap between ALS and FTD means that the two conditions are thought to represent a disease spectrum (ALS/FTD).
In recent years, the identification of several genetic causes of ALS/FTD has contributed significantly to our understanding of disease pathogenesis. Unfortunately, one-third of the underlying genetic causes of familial cases and ~90% of sporadic cases of ALS/FTD remain unexplained. As such, there is a dire need to identify additional genetic factors contributing to ALS/FTD. Such studies require huge cohorts of harmonized whole genome sequencing (WGS) data sets from cases and controls.
Currently, there are several major ongoing sequencing efforts for ALS patients. Numerous centers lead to inefficiency, especially in terms of overall costs. At a minimum, this includes the cost of high-performance computing, the storage of large data files, and the duplication of effort between groups. This lack of data harmonization between groups precludes sharing of genetic information and weakens collaborative efforts. The cost and logistics are also a barrier to attracting talented investigators to the ALS/FTD field.
To overcome this unmet need, we have founded the ALS Compute project. We are centralizing the storage of ALS/FTD WGS data from every significant sequencing effort in the United States and beyond within a single Cloud environment. This approach will facilitate data harmonization and improve accessibility to the data. To accomplish this, we have made the data and the computational infrastructure available via the Terra platform hosted by NHGRI's Genomic Data Science Analysis, Visualization, and Informatics Lab-Space (AnVIL). This will allow researchers worldwide to access this wealth of data, develop new theories of the disease, and yield breakthroughs in our understanding of ALS/FTD.
- Type: Case-Control
- Archiver: The database of Genotypes and Phenotypes (dbGaP)
