Deep Learning / NLP techniques in Healthcare for decision making

Kamal Jain
Analytics Vidhya
Published in
9 min readSep 10, 2020

--

Introduction

The ubiquitous adoption of electronic health records in hospitals and other healthcare facilities generates vast real-world information, which is very valuable for conducting clinical research. Over the past many years, electronic health records (EHR) systems are been widely adopted across clinics and hospitals. Analysis of this huge data is the foundation to provide improved healthcare to the patients However, manual review of this vast amount of data generated from multiple sources is costly and very time-consuming. It brings huge challenges while attempting to review this data meaningfully. Hence, the role of artificial intelligence (AI) techniques are becoming important in enhancing clinical research and care. As a large number of EHR are locked in clinical narratives, Natural Language Processing (NLP) and Deep Learning (DL) techniques have been leveraged to extract information. While computer vision techniques can be used for medical imaging, NLP can be used for analysing unstructured information in the EHR, reinforcement learning techniques can be used in the context of robotics-assisted operations. NLP algorithms can be used to identify clinically relevant phenotypes while analysing text and determining the grammatical relationships between phrases. Rule-based NLP techniques can be used to get high sensitivity (identification of a large proportion of true cases) and high positive predictive value in clinical records.

Healthcare is one of the domains where computer science is becoming very supportive of varied tasks. AI is increasingly being adopted across the healthcare industry ranging from basic level practices to specialization, and lots of the foremost exciting AI applications leverage language processing (NLP). The capabilities of these AI techniques potentially enable the identification of distinctive clinical characteristics among patients which further helps in clinical care and minimizing methodological heterogeneity in medical research concerning various health diseases.

Abstract

Day by day the health care domain is generating millions of records of patients in a structured and unstructured way. By applying deep learning techniques, it can be converted into a well-structured form i.e. Electronic Health Record (EHR). Decision making is one of the key parts in the Healthcare domain. In the decision making process doctors must refer to many data like laboratory reports, diagnosis reports, medical images, demographic information about the patient, clinical notes, and map it collectively with the concepts of medical science. Here AI especially Natural Language Processing and deep learning can be helpful in many ways.

The objective of this article is to depict the importance of Deep Learning and Natural Language processing EHRs. Based on EHR, the doctor can take quick decisions in case of an emergency. Apart from that, it can also be effective in clinical predictions, detect disease at an earlier stage, forecasting future need of regular check-ups, predictions of hospitalization soon if required. The provision of an enormous amount of clinical information particularly EHR has stimulated the expansion of deep learning techniques that assist within the rapid analysis of patient data.

What is Natural Language Processing?

Natural language processing (NLP) focuses on analyzing text and speech to infer meaning from words. Recurrent Neural Networks (RNNs) — deep learning algorithms play a key role in processing sequential inputs like language, speech, and time-series data. (Andre Esteva et al., 2019). Deep learning is a subset of machine learning having the capacity of learning unsupervised data from unstructured or unlabeled data. On the other side deep learning will be able to learn optimal features from available data without human intervention.

Natural language processing is used to describe the process of using computer algorithms to identify key elements in everyday language and extract meaning from unstructured spoken or written input. NLP requires skills in artificial intelligence, computational linguistics, and other machine learning disciplines. (Jennifer Bresnick et al., 2019)

There are two approaches to NLP.

1) A rule-based approach where computer follows predefined rules in the program

2) Machine Learning-based approach where we can have supervised and unsupervised learning methods. In supervised learning, the computer learns latent rules through human guidance called annotation while in unsupervised learning there is no human interaction.

NLP algorithms, first extract information or concepts from EHRs, then process extracted information, and finally classify patients into a subgroup as per rules and learners. Procedures for NLP are complex because it consists of multiple techniques together. NLP will map phrases or words to concepts of interest, and it needs careful pre-processing of text and it will be converted to document from the natural form. (Young Juhn et al., 2019)

A few examples of low-level NLP tasks (text pre-processing) include the following:

(1) Sentence boundary detection, which is usually defined by a period

(2) Tokenization (breaking a sentence into individual tokens)

(3) Stemming (reducing word into a root form)

(4) Lemmatization (process mapping a token)

A few examples of higher-level NLP tasks include the following:

(1) Named entity recognition

(2) Setting up negation rules

The below figure depicts the text preprocessing and classification of an asthma patient.

NLP Algorithms for document-level text processing and patient-level classification (Young Juhn et al., 2019)

Role of Deep Learning/NLP in Healthcare

Basically, in terms of users, we can categorize the healthcare information coming from below four sources -

1) Doctors

2) Patient

3) Paramedical staff

4) Pharmaceuticals

Stages of healthcare information

Every post-process depends on the diagnosis of a disease. If the disease is identified properly then one can get proper treatment. Sometimes the situation of patients gets serious due to delays in decision making.

Below figure represents, deep learning is an approach where there is a large scale network that is going to accept a variety of input data types like text, image, audio, time-series data, etc. for each data type learns a useful features in its lower level towers. The data from each pillar is then merged and flows through higher levels, allowing the Deep Neural Network to reach to conclusion based on reasoning and evidence across data types.

Deep learning can detect features and learn from a variety of data types (Andre Esteva et al., 2019)

Natural language processing can help healthcare in information extraction, unstructured data to structured data conversion, Document categorization, and summarization. Ultimately it will reduce administrative cost by efficient billing and accurate prior authorization approval. It also going to create Medical Value by providing support for ineffective clinical decisions and streamlined medical policy assessment etc. (Suresh Rangasamy et al., 2018). Apart from that it will help in improving patient interactions with the provider and the EHR, increase patient health awareness, improve care quality, and identify patients with critical care needed.

In healthcare, sequential deep learning and Natural language technologies boosts applications within domains like electronic health records (EHRs). (Andre Esteva et al., 2019)

An outsized medical organization can capture 10 million patients’ medical transactions in a decade. One hospital alone generates 150,000 pieces of data. Aggregation of all EHRs of this scale exemplifies 100 million years of patient data and 200,000 years of doctors’ practice, and it covers excess of rare conditions and diseases. (Andre Esteva et al., 2019)

The below figure depicts various stages of processing

A. Unstructured EHR data. Medical records are stored in heterogeneous formats as per the working mechanism and data structures of a specific hospital. So, it will vary from one hospital to another.

B. Data standardization. Based on FHIR, data will be standardized into the same or we can say homogeneous format by mapping data from multiple sources

C. Sequencing. By temporally sequencing all data into a patient timeline, time-based deep-learning techniques can be applied to the entirety of EHR datasets for making predictions about single patients.

Making predictions using EHRs (Andre Esteva et al., 2019)

NLP can be helpful in the neurology domain too. The below figure depicts how the iterative process of the NLP algorithm which can be helpful in identifying 24 brain imaging phenotypes in two areas of NHS Scotland which had excellent positive predictive value for cerebrovascular and neuro degenerative phenotypes.

The NLP system, Edinburgh Information Extraction for Radiology reports (EdIE-R), is a staged pipeline process as depicted in the below figure, with XML rule-based text mining software at its core.

EdIE-R System Architecture (Emily Wheater et al. 2019)

Conclusion

Deep Learning and NLP will play a significant role in accelerating the decision-making process in the healthcare area. However, the real rewards of developing good algorithms will depend heavily on the quality of the data that they acquire and maintain. The faster decision-making process will allow physicians to focus on the value-added care of patients. NLP with Deep Learning and Computer Vision can process a variety of data together to take precise decisions. Collaborative research can lead to a higher level of treatment in Healthcare. Considering the impact of AI techniques, these systems need to be designed and built very carefully in a larger socio-ecological context of clinical care settings to provide better healthcare to society.

Bibliography

1. What Is the Role of Natural Language Processing in Healthcare? By Jennifer Bresnick https://healthitanalytics.com/features/what-is-the-role-of-natural-language-processing-in-healthcare

2. A guide to deep learning in healthcare, Andre Esteva, Alexandre Robicquet, Bharath Ramsundar, Volodymyr Kuleshov, Mark DePristo, Katherine Chou, Claire Cui, Greg Corrado, Sebastian Thrun and Jeff Dean, Nature Medicine, VOL 25, JANUARY 2019, PP 24–29

3. Natural language processing in healthcare, Suresh Rangasamy, Rosanne Nadenichek, Mahi Rayasam, and Alex Sozdatelev, December 6, 2018

https://www.mckinsey.com/industries/healthcare-systems-and-services/our-insights/natural-language-processing-in-healthcare#:~:text=Artificial%20intelligence%20(AI)%20is%20increasingly,generated%20spoken%20or%20written%20data.

4. Top 12 Use Cases of Natural Language Processing in Healthcare

https://marutitech.com/use-cases-of-natural-language-processing-in-healthcare/

5. A validated natural language processing algorithm for brain imaging phenotypes from radiology reports in UK electronic health records, Emily Wheater, Grant Mair, Cathie Sudlow, Beatrice Alex, Claire Grover and William Whiteley, Wheater et al. BMC Medical Informatics and Decision Making, PP 1–11

6. Deep Learning Techniques for Electronic Health Record (EHR) Analysis, T. Poongodi, D. Sumathi, P. Suresh, and Balamurugan Balusamy, Springer Nature Singapore Pte Ltd. 2021, A. K. Bhoi et al. (eds.), Bio-inspired Neurocomputing, Studies in Computational Intelligence 903, https://doi.org/10.1007/978-981-15-5495-7_5

7. Artificial intelligence approaches using natural language processing to advance EHR-based clinical research, Young Juhn, MD, MPH and Hongfang Liu, Ph.D., American Academy of Allergy, Asthma & Immunology, https://doi.org/10.1016/j.jaci.2019.12.897

Authors

Kamal Jain — He is a dynamic, result-oriented digital technocrat with over 19 years of experience and a proven track record of building futuristic products using digital technologies — Artificial Intelligence (AI), neural networks and deep learning, Machine Learning (ML), Natural Language Processing (NLP), Virtual Desktop Infrastructure (VDI), ‘Bring Your Own Device’ (BYOD), Cloud based Software as a Service (SaaS) solution stack. He has rich experience in building complex technical product solutions in matrix organizations with international experience across different countries — US, the UK, Spain, South Korea, UAE, and India. As a part of giving back to the society and tech sector, he has been voluntary mentoring and providing knowledge sessions at various universities, tech start-ups in the areas of Artificial Intelligence, Machine Learning, Deep Learning & Neural Networks, Cloud based technologies, etc.

https://www.linkedin.com/in/kjainiimb/

Vishal Prajapati — He is a young and enthusiastic person with sound teaching-learning and soft skills, working as an Assistant Professor in the Department of Information Technology, G H Patel College of Engineering & Technology, V. V. Nagar, Anand, Gujarat. He has more than 9 years of teaching experience. He obtained a post-graduate degree in Computer Engineering from Birla Vishwakarma Mahavidyalaya (BVM), GTU. He has published 9 Research papers in various National/International Conferences and Journals. His research interests include Artificial Intelligence, Machine Learning, Deep Learning, Data Science, SQL.

https://www.linkedin.com/in/vishal-prajapati-b3242812/

--

--

Kamal Jain
Analytics Vidhya

Data Science Evangelist | Mentor | Digital Transformation using AI | ML | Deep Learning | Computer Vision | Cloud Computing