Google Summer of Code 2021 | Work Report
--
- Student: Mohamed Soudy
- Github: https://github.com/MohmedSoudy
- Organization: National Resource for Network Biology (NRNB)
- Project Title: Combining Multi-Omics for optimization and finding novel peptides in cancer
- Mentor: Thuy Tien Bui, Dr. Mohamed Helmy
Overview
LC/MS is a hyphenated technique, which combines the separating power of High-Performance Liquid Chromatography (HPLC), with the detection power of mass spectrometry. Mass Spectrometry is a wide-ranging analytical technique, which involves the production and subsequent separation and identification of charged species.
The project is developed as a standalone shiny app that requires the Extracted Spectra as Mascot Generic Format (MGF) and the Database as FASTA file.
Graphical User Interface (GUI)
The application performs multiple functionalities as the identification or the regular search of the proteomics database using X! Tandem search engine, Furthermore, the application visualizes and report the results to the user in a handy way including the generation of identified spectra, peptides, proteins, and Post Translational Modifications (PTMs).
The application allows the user to enter a database and generates a database that contains the mutations of each sequence according to the six-frame translation.
Then the application performs an iterative database search to identify both regular peptides from the traditional search and mutated peptides from the search of mutated database that may lead to the identification of novel peptides.
Upload Page
The upload page is designed to retrieve the information regarding the Mass spectrometer output file ex: (MGF, RAW) and the database file in FASTA format the user can also change the search setting using the file default_param.xml with and customize the search parameters.
Search Page
The Search page reports the number of spectra in the input file, then applies the iterative search using the traditional and mutated database and returns the annotated results of spectra, peptides, proteins, and ptms.
Identified Spectra
Identified Peptides
Identified Proteins
Identified PTMs
Visualization Page
The Visualization page shows the annotated spectra of each identified peptide including the MS/MS results reporting the m/z ratio on the x-axis and the intensities on the y-axis.
Remove identification Page
The Remove identification page excludes the identified spectra from the regular database search and writes a new spectral file to be used in the iterative search process for other databases.