How to use Streamlit to convert misidentified date terms from Excel to updated gene names

A web tool, Gene Updater, made by Streamlit to help preserve scientific data integrity and reproducibility

Kuan Rong Chan, Ph.D.
Omics Diary

--

Gene Updater is a web tool made by Streamlit, which helps convert Excel auto-formatted terms back into updated gene names that are more resilient to autoconversion. For more details, users can read the publication in Scientific Reports

When gene expression data is opened in Excel, some of the gene symbols will be autoconverted into date terms. For instance, “SEPT1” (Septin 1) becomes “SEP-01” in Excel, which can affect downstream pathway analysis as many of these databases rely on gene symbols to detect for pathway enrichment. To circumvent this limitation, we used Streamlit to create a web tool called Gene Updater that allows users to convert the date terms to the updated gene terms recommended by HUGO, which are more resilient to autoconversion by Excel. The webpage is hosted at: https://share.streamlit.io/kuanrongchan/date-to-gene-converter/main/date_gene_tool.py.

Users may visit our GitHub address to download the date_gene_tool.py file to understand the Python codes that allows construction of such a web tool. To run the web tool locally, users can download all the required Python packages and files at the designated GitHub address or at https://zenodo.org/record/6845701#.YtvCYiX0rDs. After directing the terminal commands to the location where the downloaded files are, users can simply type the following command in…

--

--

Kuan Rong Chan, Ph.D.
Omics Diary

Kuan Rong Chan, PhD, Senior Principal Research Scientist in Duke-NUS Medical School. Virologist | Data Scientist | Loves mahjong | Website: kuanrongchan.com