This document discusses the need for structured reporting in cardiac catheterization laboratories. It recognizes barriers to clinician adoption but identifies benefits such as improved communication, accurate documentation, and reuse of data for multiple purposes. Structured reporting involves integrating data capture into clinical workflows and using standards to facilitate data interchange and interoperability. It can reduce costs by decreasing documentation time and enabling risk stratification to guide prevention. Widespread adoption requires aligning health IT systems and workflows with clinical models of structured reporting.
The document discusses patient care and monitoring systems. It provides details on the HELP system at LDS Hospital, which was one of the first and most successful clinical information systems. The HELP system evolved from initially providing decision support during care to also supporting nursing care decisions and aggregating data for research. It has been in continuous operation since 1967 and integrated into multiple hospitals. Evaluations found that the HELP system was widely accepted, demonstrated the feasibility of computerized clinical decision support, and provided improvements in patient care and more cost-effective care.
This document discusses using ontologies to simplify semantic solutions for biomedical applications. It provides examples of how ontologies can be used to integrate medical expertise and knowledge from different sources. It also describes challenges in representing biomedical information with ontologies and introduces MedMaP, a medical management portal that aims to simplify access to ontology-based reasoning and analytics using graphical visualizations and self-service tools. MedMaP allows users to customize their experience and gain insights from subject matter experts.
This NLP Project Presentation explores how Natural Language Processing (NLP) and Data Science are revolutionizing the prediction of heart disease. Discover how cutting-edge techniques are being used to analyze textual data, such as patient records and medical reports, to predict the likelihood of heart disease with unprecedented accuracy. For more details on data science Visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Dive into an extensive analysis of heart disease classification, exploring key factors, trends, and predictive models for improved diagnosis and treatment strategies. Visit, https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/ for more
This document discusses hospital information systems (HIS). It begins by defining HIS and explaining their importance in managing clinical, administrative, and financial aspects of hospitals. It then outlines the objectives and components of HIS, including clinical information systems, financial systems, and more. Examples of specific HIS are provided, like electronic medical records and remote patient monitoring. Advantages of HIS include improved data access and efficiency. Challenges to implementation include user acceptance and costs. The document concludes by discussing the life cycle and training involved with HIS.
The document describes a hospital database management system that digitizes patient registration, disease details, doctor information, and the billing system. It assigns a unique ID to each patient and staff member and allows searching by ID. The system stores patient and staff details automatically and allows searching the current status of rooms. It maintains records for indoor and outdoor patients, test and exam details, prescriptions, bills, and more. Administrators and data entry operators can add, view, edit, and manage data in tables for patients, doctors, labs, rooms, and more through a secure database.
The document describes a hospital database management system that digitizes patient registration, disease details, doctor information, and the billing system. It assigns a unique ID to each patient and staff member and allows searching by ID. The system records patient and staff details, stores test results and prescribed treatments, generates bills, and allows administrators and users to enter, view, edit, and delete data. It aims to computerize hospital operations and record-keeping to streamline work and reduce errors.
In this full-day tutorial, you will learn basic overview of electronic medical records systems, health data management and how you can use the OpenMRS system for data and information management. We will cover basics of installation, user management, location management, patient dashboards and some interesting features that are provided by different modules. You can see how OpenMRS can be customized with different modules that are suitable for different contexts. This tutorial is helpful for new users and developers who would like to know the features of OpenMRS. Individuals who would like to evaluate and try to see if OpenMRS fits their healthcare needs will also benefit from this tutorial.
This document provides an overview of hospital information system architectures and strategies. It begins by defining the types of data processed in hospitals, including patient, resource, administrative, and management data. It then describes the main hospital functions like patient care, supply management, administration, and management. Under patient care, it outlines functions like admission, treatment planning, order entry, care delivery, and discharge. It provides details on what each function involves and what data is used. The document aims to explain the components of a hospital information system and how they can be integrated to support clinical and business operations.
The Dual Nature of InformaticsInformatics can be used for impr.docxhe45mcurnow
Ìý
The document discusses the dual nature of informatics, which refers to how informatics can improve health outcomes for individual patients through tools like EHRs and CDS at the point of care, and also for groups of patients through data warehousing and mining of patient information in EHRs. It provides a scenario about a patient, Mrs. Jones, presenting with dizzy spells and nausea, and prompts the reader to consider what information should be collected from Mrs. Jones and how it could help her care and be aggregated to help other similar patients.
This document provides an overview of health informatics and the role of librarians. It defines key terms like electronic health records, health information technology, and meaningful use. It discusses stages of meaningful use and how health informatics tools can improve care delivery and outcomes. The document also explores potential roles for librarians in areas like patient education, training, and research support within the health informatics field.
Health Care Processes and Decision Making_lecture 1_slidesCMDLearning
Ìý
The document discusses the classic paradigm of the clinical process. It describes the elements of the classic paradigm, which assumes a single patient interacts with a single clinician to address a single problem during a single visit. It also examines different types of information clinicians use and how this information is organized. The document outlines the steps in the classic clinical process, including gathering data, analyzing findings, making a diagnosis, and communicating the treatment plan.
Babithas Notes on unit-3 Health/Nursing Informatics TechnologyBabitha Devu
Ìý
This document provides an overview of health information systems. It begins by defining a health information system as a system designed to manage healthcare data, including electronic medical records, hospital operations, and systems supporting healthcare policy decisions. It notes that health information systems commonly access, process, or maintain large volumes of sensitive data, so security is a primary concern. The document then discusses definitions of health care information systems and their components. It provides an example of how data might be recorded and where during a patient's visit. Finally, it outlines different architectures for health information systems, including stand-alone, centralized, decentralized, and federated systems.
A healthcare information system enables the collection, storage, management and analysis of patient treatment histories and other key data. It has several potential benefits including more efficient administration, improved monitoring of drug usage, reduced errors and increased information integrity. There are different types of healthcare information systems such as electronic medical records, practice management software, patient portals and clinical decision support systems. Each system serves an important role like storing patient records, managing daily operations, enabling patient access to health data and assisting healthcare providers in clinical decision making. Overall, healthcare information systems can improve quality of care, reduce costs and improve coordination across the healthcare system.
What happens when cardiologists have had enough with general EHRs that know nothing about cardiology? They formulate a plan to treat those issues, and here is how they did it with a system they designed from the ground up.
Discover the Cardiovascular Suite, including Cardiology EHR & Diagnostics, developed by the heart specialists at Objective Medical Systems.
Game of documentation, Winter is coming Surviving ICD10Nick van Terheyden
Ìý
The document provides an overview of ICD-10 implementation and the impact of clinical documentation on outcomes and reimbursement. It discusses how accurate documentation is important for determining severity of illness and risk adjustment, which drives hospital reimbursement and quality metrics. It emphasizes that physicians need to fully document their clinical decision making to avoid issues like payment denials, penalties, or inaccurate performance assessments that could arise from incomplete records.
Classifying Readmissions of Diabetic Patient EncountersMayur Srinivasan
Ìý
Readmission rates in hospitals are a key indicator on quality of patient care and a clear indication of total cost or inconvenience related to the treatment. Patients with serious medical
conditions such as diabetes mellitus are key drivers of readmission rates owing to the complexity of their illness. Therefore, being able to predict based on certain features whether or not a patient
will need readmission can help doctors and hospitals provide better care initially and not get financially penalized under Obamacare’s readmission policy
This document discusses developing an effective clinical information system. It recommends understanding information needs, conceptualizing problems at the patient, service and research levels. An example system in Wales integrates data from multiple sources using common standards like SNOMED-CT. The document outlines a vision of seamless integration between systems focused on the patient rather than organizations. It emphasizes using examples to understand core informatics requirements and taking an iterative approach to development. Examples provided show how the system supports clinical decision making, research, and justifying service needs with aggregated data.
This document discusses a project that aims to predict re-admission of diabetes patients using machine learning. The project aims to help both patients and hospitals by providing a model to predict re-admission cases so hospitals can better prepare. The dataset contains over 10,000 observations on diabetes patients over 10 years. Several features like weight, payer code, and medical specialty will be dropped due to missing data. Other features like age, admission type, and discharge disposition will be consolidated. Feature engineering will also add a total number of visits feature and preprocess the data. The goal is to build a model that can help hospitals better manage resources and reduce costs and improve patient care.
The document discusses health management information systems and hospital information systems. It defines key terms like system, health system, information, and health information system. It explains that the goal of a health information system is to improve actions and decision making at all levels of the health system by generating relevant information. It outlines some common issues with current health MIS like irrelevant data, poor quality, and lack of timely reporting. It also discusses important components and characteristics of an effective health information system.
This document outlines an agenda and case studies for a healthcare analytics bootcamp. The bootcamp will use healthcare data to develop machine learning solutions to predict heart disease and identify high-risk patients. Case Study 1 will involve exploratory data analysis of tuberculosis data to analyze global trends, hotspots, and mortality rates. Case Study 2 will use a heart disease screening dataset and logistic regression to build a model to predict heart disease risk and develop treatment plans for high-risk patients. The document discusses the types of structured and unstructured healthcare data, sources of data, and applications of machine learning in healthcare analytics.
An overview of clinical healthcare data analytics from the perspective of an interventional cardiology registry. This was initially presented as part of a workshop at the University of Illinois College of Computer Science on April 20, 2017.
My Top 5 articles from 2015-16 about Informatics and Digital Health in Physio...Samantha Plumb
Ìý
The document discusses how digital health and informatics can influence physiotherapy management and outcomes for stroke patients. It reviews 5 articles on this topic. The articles found that electronic medical records (EMRs) can help implement evidence-based guidelines, standardize care, improve documentation and coordination, and enable monitoring of patient progress. EMRs also allow identifying patients for research studies and individualizing rehabilitation. The document recommends that EMRs for stroke patients include clinical pathways, order sets, structured physiotherapy forms, and an evidence-based early mobilization strategy. Overall, incorporating digital health can help translate knowledge into practice and improve stroke care when implementing an EMR.
The document describes a hospital database management system that digitizes patient registration, disease details, doctor information, and the billing system. It assigns a unique ID to each patient and staff member and allows searching by ID. The system records patient and staff details, stores test results and prescribed treatments, generates bills, and allows administrators and users to enter, view, edit, and delete data. It aims to computerize hospital operations and record-keeping to streamline work and reduce errors.
In this full-day tutorial, you will learn basic overview of electronic medical records systems, health data management and how you can use the OpenMRS system for data and information management. We will cover basics of installation, user management, location management, patient dashboards and some interesting features that are provided by different modules. You can see how OpenMRS can be customized with different modules that are suitable for different contexts. This tutorial is helpful for new users and developers who would like to know the features of OpenMRS. Individuals who would like to evaluate and try to see if OpenMRS fits their healthcare needs will also benefit from this tutorial.
This document provides an overview of hospital information system architectures and strategies. It begins by defining the types of data processed in hospitals, including patient, resource, administrative, and management data. It then describes the main hospital functions like patient care, supply management, administration, and management. Under patient care, it outlines functions like admission, treatment planning, order entry, care delivery, and discharge. It provides details on what each function involves and what data is used. The document aims to explain the components of a hospital information system and how they can be integrated to support clinical and business operations.
The Dual Nature of InformaticsInformatics can be used for impr.docxhe45mcurnow
Ìý
The document discusses the dual nature of informatics, which refers to how informatics can improve health outcomes for individual patients through tools like EHRs and CDS at the point of care, and also for groups of patients through data warehousing and mining of patient information in EHRs. It provides a scenario about a patient, Mrs. Jones, presenting with dizzy spells and nausea, and prompts the reader to consider what information should be collected from Mrs. Jones and how it could help her care and be aggregated to help other similar patients.
This document provides an overview of health informatics and the role of librarians. It defines key terms like electronic health records, health information technology, and meaningful use. It discusses stages of meaningful use and how health informatics tools can improve care delivery and outcomes. The document also explores potential roles for librarians in areas like patient education, training, and research support within the health informatics field.
Health Care Processes and Decision Making_lecture 1_slidesCMDLearning
Ìý
The document discusses the classic paradigm of the clinical process. It describes the elements of the classic paradigm, which assumes a single patient interacts with a single clinician to address a single problem during a single visit. It also examines different types of information clinicians use and how this information is organized. The document outlines the steps in the classic clinical process, including gathering data, analyzing findings, making a diagnosis, and communicating the treatment plan.
Babithas Notes on unit-3 Health/Nursing Informatics TechnologyBabitha Devu
Ìý
This document provides an overview of health information systems. It begins by defining a health information system as a system designed to manage healthcare data, including electronic medical records, hospital operations, and systems supporting healthcare policy decisions. It notes that health information systems commonly access, process, or maintain large volumes of sensitive data, so security is a primary concern. The document then discusses definitions of health care information systems and their components. It provides an example of how data might be recorded and where during a patient's visit. Finally, it outlines different architectures for health information systems, including stand-alone, centralized, decentralized, and federated systems.
A healthcare information system enables the collection, storage, management and analysis of patient treatment histories and other key data. It has several potential benefits including more efficient administration, improved monitoring of drug usage, reduced errors and increased information integrity. There are different types of healthcare information systems such as electronic medical records, practice management software, patient portals and clinical decision support systems. Each system serves an important role like storing patient records, managing daily operations, enabling patient access to health data and assisting healthcare providers in clinical decision making. Overall, healthcare information systems can improve quality of care, reduce costs and improve coordination across the healthcare system.
What happens when cardiologists have had enough with general EHRs that know nothing about cardiology? They formulate a plan to treat those issues, and here is how they did it with a system they designed from the ground up.
Discover the Cardiovascular Suite, including Cardiology EHR & Diagnostics, developed by the heart specialists at Objective Medical Systems.
Game of documentation, Winter is coming Surviving ICD10Nick van Terheyden
Ìý
The document provides an overview of ICD-10 implementation and the impact of clinical documentation on outcomes and reimbursement. It discusses how accurate documentation is important for determining severity of illness and risk adjustment, which drives hospital reimbursement and quality metrics. It emphasizes that physicians need to fully document their clinical decision making to avoid issues like payment denials, penalties, or inaccurate performance assessments that could arise from incomplete records.
Classifying Readmissions of Diabetic Patient EncountersMayur Srinivasan
Ìý
Readmission rates in hospitals are a key indicator on quality of patient care and a clear indication of total cost or inconvenience related to the treatment. Patients with serious medical
conditions such as diabetes mellitus are key drivers of readmission rates owing to the complexity of their illness. Therefore, being able to predict based on certain features whether or not a patient
will need readmission can help doctors and hospitals provide better care initially and not get financially penalized under Obamacare’s readmission policy
This document discusses developing an effective clinical information system. It recommends understanding information needs, conceptualizing problems at the patient, service and research levels. An example system in Wales integrates data from multiple sources using common standards like SNOMED-CT. The document outlines a vision of seamless integration between systems focused on the patient rather than organizations. It emphasizes using examples to understand core informatics requirements and taking an iterative approach to development. Examples provided show how the system supports clinical decision making, research, and justifying service needs with aggregated data.
This document discusses a project that aims to predict re-admission of diabetes patients using machine learning. The project aims to help both patients and hospitals by providing a model to predict re-admission cases so hospitals can better prepare. The dataset contains over 10,000 observations on diabetes patients over 10 years. Several features like weight, payer code, and medical specialty will be dropped due to missing data. Other features like age, admission type, and discharge disposition will be consolidated. Feature engineering will also add a total number of visits feature and preprocess the data. The goal is to build a model that can help hospitals better manage resources and reduce costs and improve patient care.
The document discusses health management information systems and hospital information systems. It defines key terms like system, health system, information, and health information system. It explains that the goal of a health information system is to improve actions and decision making at all levels of the health system by generating relevant information. It outlines some common issues with current health MIS like irrelevant data, poor quality, and lack of timely reporting. It also discusses important components and characteristics of an effective health information system.
This document outlines an agenda and case studies for a healthcare analytics bootcamp. The bootcamp will use healthcare data to develop machine learning solutions to predict heart disease and identify high-risk patients. Case Study 1 will involve exploratory data analysis of tuberculosis data to analyze global trends, hotspots, and mortality rates. Case Study 2 will use a heart disease screening dataset and logistic regression to build a model to predict heart disease risk and develop treatment plans for high-risk patients. The document discusses the types of structured and unstructured healthcare data, sources of data, and applications of machine learning in healthcare analytics.
An overview of clinical healthcare data analytics from the perspective of an interventional cardiology registry. This was initially presented as part of a workshop at the University of Illinois College of Computer Science on April 20, 2017.
My Top 5 articles from 2015-16 about Informatics and Digital Health in Physio...Samantha Plumb
Ìý
The document discusses how digital health and informatics can influence physiotherapy management and outcomes for stroke patients. It reviews 5 articles on this topic. The articles found that electronic medical records (EMRs) can help implement evidence-based guidelines, standardize care, improve documentation and coordination, and enable monitoring of patient progress. EMRs also allow identifying patients for research studies and individualizing rehabilitation. The document recommends that EMRs for stroke patients include clinical pathways, order sets, structured physiotherapy forms, and an evidence-based early mobilization strategy. Overall, incorporating digital health can help translate knowledge into practice and improve stroke care when implementing an EMR.
Useful environment methods in Odoo 18 - Odoo ºÝºÝߣsCeline George
Ìý
In this slide we’ll discuss on the useful environment methods in Odoo 18. In Odoo 18, environment methods play a crucial role in simplifying model interactions and enhancing data processing within the ORM framework.
APM event hosted by the South Wales and West of England Network (SWWE Network)
Speaker: Aalok Sonawala
The SWWE Regional Network were very pleased to welcome Aalok Sonawala, Head of PMO, National Programmes, Rider Levett Bucknall on 26 February, to BAWA for our first face to face event of 2025. Aalok is a member of APM’s Thames Valley Regional Network and also speaks to members of APM’s PMO Interest Network, which aims to facilitate collaboration and learning, offer unbiased advice and guidance.
Tonight, Aalok planned to discuss the importance of a PMO within project-based organisations, the different types of PMO and their key elements, PMO governance and centres of excellence.
PMO’s within an organisation can be centralised, hub and spoke with a central PMO with satellite PMOs globally, or embedded within projects. The appropriate structure will be determined by the specific business needs of the organisation. The PMO sits above PM delivery and the supply chain delivery teams.
For further information about the event please click here.
Information Technology for class X CBSE skill SubjectVEENAKSHI PATHAK
Ìý
These questions are based on cbse booklet for 10th class information technology subject code 402. these questions are sufficient for exam for first lesion. This subject give benefit to students and good marks. if any student weak in one main subject it can replace with these marks.
QuickBooks Desktop to QuickBooks Online How to Make the MoveTechSoup
Ìý
If you use QuickBooks Desktop and are stressing about moving to QuickBooks Online, in this webinar, get your questions answered and learn tips and tricks to make the process easier for you.
Key Questions:
* When is the best time to make the shift to QuickBooks Online?
* Will my current version of QuickBooks Desktop stop working?
* I have a really old version of QuickBooks. What should I do?
* I run my payroll in QuickBooks Desktop now. How is that affected?
*Does it bring over all my historical data? Are there things that don't come over?
* What are the main differences between QuickBooks Desktop and QuickBooks Online?
* And more
How to Modify Existing Web Pages in Odoo 18Celine George
Ìý
In this slide, we’ll discuss on how to modify existing web pages in Odoo 18. Web pages in Odoo 18 can also gather user data through user-friendly forms, encourage interaction through engaging features.
Finals of Rass MELAI : a Music, Entertainment, Literature, Arts and Internet Culture Quiz organized by Conquiztadors, the Quiz society of Sri Venkateswara College under their annual quizzing fest El Dorado 2025.
Database population in Odoo 18 - Odoo slidesCeline George
Ìý
In this slide, we’ll discuss the database population in Odoo 18. In Odoo, performance analysis of the source code is more important. Database population is one of the methods used to analyze the performance of our code.
Computer Application in Business (commerce)Sudar Sudar
Ìý
The main objectives
1. To introduce the concept of computer and its various parts. 2. To explain the concept of data base management system and Management information system.
3. To provide insight about networking and basics of internet
Recall various terms of computer and its part
Understand the meaning of software, operating system, programming language and its features
Comparing Data Vs Information and its management system Understanding about various concepts of management information system
Explain about networking and elements based on internet
1. Recall the various concepts relating to computer and its various parts
2 Understand the meaning of software’s, operating system etc
3 Understanding the meaning and utility of database management system
4 Evaluate the various aspects of management information system
5 Generating more ideas regarding the use of internet for business purpose
How to Setup WhatsApp in Odoo 17 - Odoo ºÝºÝߣsCeline George
Ìý
Integrate WhatsApp into Odoo using the WhatsApp Business API or third-party modules to enhance communication. This integration enables automated messaging and customer interaction management within Odoo 17.
SOCIAL CHANGE(a change in the institutional and normative structure of societ...DrNidhiAgarwal
Ìý
This PPT is showing the effect of social changes in human life and it is very understandable to the students with easy language.in this contents are Itroduction, definition,Factors affecting social changes ,Main technological factors, Social change and stress , what is eustress and how social changes give impact of the human's life.
Blind spots in AI and Formulation Science, IFPAC 2025.pdfAjaz Hussain
Ìý
The intersection of AI and pharmaceutical formulation science highlights significant blind spots—systemic gaps in pharmaceutical development, regulatory oversight, quality assurance, and the ethical use of AI—that could jeopardize patient safety and undermine public trust. To move forward effectively, we must address these normalized blind spots, which may arise from outdated assumptions, errors, gaps in previous knowledge, and biases in language or regulatory inertia. This is essential to ensure that AI and formulation science are developed as tools for patient-centered and ethical healthcare.
2. • What is a database?
• A database is essentially a collection of data that has been arranged into a
structure that allows it to be easily read, edited, added or deleted.
• What is a database management system?
• Database Management System is a system designed to manage the
automatic and orderly database. The Database Management System is an
automatic system helping the user to control information, create, update
and maintain the database.
• What do databases do in healthcare?
• Healthcare databases help individual medical organizations understand their
daily activities and their place within the larger healthcare industry. This
allows healthcare professionals to make decisions about how they run their
businesses, the work they do, and the systems they use to manage their
operations. With the rapid advancement of healthcare technology and post-
COVID-19 changes in healthcare operations, it is more vital than ever for
healthcare databases to be organized, well-maintained, and simple to use.
3. • Why is database management important?
• Healthcare databases have been an essential component of understanding and
improving critical care worldwide. The importance of database technology in healthcare
cannot be overstated. The Healthcare database system generates data that requires
delicate handling and is developed for the purpose of assessing the quality of healthcare,
often for a specific disease or within a specific healthcare delivery system. databases in
healthcare can promote:
• 1. Assessing the quality of healthcare
• Hospitals, health departments, local, state and federal agencies, to name a few, all
contribute information to healthcare databases. Healthcare specialists can gain a
thorough understanding of the quality of current healthcare operations by analyzing
aspects such as; financing available to healthcare facilities, service availability and
affordability, use of innovation, and barriers to seeking care.
• 2. Tracking and Monitoring
• Medical workers can utilize reporting and logging tools to keep track of operations with
the help of healthcare databases. these aids healthcare providers in monitoring and
improving the quality of patient treatment by providing an important tool for tracking
healthcare use. Healthcare providers, for instance, need to know whether people are
getting their vaccination or not.
4. Healthcare DBMS
• A database can be defined as a collection of related data (Elmasri and
Navathe, 2010).
• Databases are often subcategorized as SQL databases or NoSQL databases.
• In SQL databases, data is recorded in tables and consists of rows and
columns. The related data may be distributed across several tables in a
trade-off between efficient storage and convenience.
• The database management system (DBMS) is a piece of software that
enables the database to serve several functions. For one thing, it allows for
the retrieval of data using the SQL language (for SQL databases). Another
function is to update the data when needed, also using SQL. Additional
functions of a DBMS include protecting and securing the data.
5. Data engineering with SQL – an example case
• For this chapter, let's pretend you secured a predictive analytics assignment
with a cardiology practice located in the United States. The practice wants
you to predict which patients are at risk of dying within 6 months of their
visit to the clinic. They make their data available to you in the form of a
database that includes six tables.
• For simplicity, we truncate the database to include the information for five
patients only. Our task is to manipulate the data using the SQL language to
consolidate it into a single table so that it can be used for machine learning.
We will first go over the patients in the database and the database structure.
Then, we will introduce basic SQL concepts for engineering and manipulate
the data into a form amenable to machine learning.
6. Case details – predicting mortality for a cardiology practice
• The cardiology practice you are working with has two physicians on
staff: Dr. Johnson and Dr. Wu. While the practice has many patients,
they are interested in identifying which patients who visit are at high
risk of all-cause mortality within the next 6 months.
• Now that we've reviewed the details of the modeling assignment,
let's take a look at the five patients in the database. The preliminary
data sent to you by the cardiology practice includes information on
five patients, distributed across six tables.
7. Case details – predicting mortality for a cardiology practice
The following is the information about the patients:
• Patient ID-1: Patient #1 in the database is a 65-year-old male who has congestive heart failure (CHF), a chronic
condition in which the heart is unable to pump blood properly to the rest of the body. He also has hypertension
(high blood pressure), which is a risk factor for CHF. He visited his cardiologist, Dr. Johnson, on 9/1/2016 and
17/1/2016. On his January 9th visit, he was found to have an elevated BP (154/94) and an elevated B-natriuretic
peptide (BNP) lab value of 350. BNP is a marker of CHF severity. He was subsequently placed on lisinopril and
furosemide, which are first-line treatments for CHF and hypertension. Unfortunately, he passed away on May
15th, 2016.
• Patient ID-2: Patient #2 is a 39-year-old female with a history of angina pectoris (cardiovascular-related chest
pain upon exercising) and diabetes mellitus. Diabetes mellitus is a risk factor for myocardial infarction (heart
attack; a late, often fatal manifestation of atherosclerotic heart disease), and angina pectoris can be seen as an
early manifestation of atherosclerotic heart disease. She visited her cardiologist, Dr. Wu, on January 15th, 2016, at
which time she was found to have an elevated blood glucose level of 225, a sign of uncontrolled diabetes. She
was started on metformin for her diabetes, as well as nitroglycerin, aspirin, and metoprolol for her angina.
• Patient ID-3: Patient #3 is a 32-year-old female who sees Dr. Johnson for management of her hypertension.
During her visit on February 1st, 2016 her blood pressure was elevated at 161/100. She was started on
valsartan/hydrochlorothiazide, an anti-hypertensive combination.
8. • Patient ID: 4: Patient #4 is a 51-year-old male who has severe CHF with
pulmonary hypertension. He saw Dr. Wu on February 27th, 2016. During
that visit, his weight was 211 lbs and his blood pressure was slightly
elevated at 143/84. His BNP level was highly elevated at 1,000. He was
given lisinopril and furosemide for his CHF as well as diltiazem for his
pulmonary hypertension. Unfortunately, he passed away on June 8th, 2016.
• Patient ID-5: The last patient in our database, patient #5, is a 58-year-old
male who presented to Dr. Wu on March 1st, 2016 with a history of CHF
and diabetes mellitus Type 2. During the visit, his glucose was elevated at
318 and BNP was moderately elevated at 400. He was started on lisinopril
and furosemide for his CHF and metformin for his diabetes.
9. The clinical database
• Now that we've gotten to know the five patients whose information is contained in our database,
we can describe the table structure and fields contained in the database, for six mock tables:
PATIENT, VISIT, MEDICATIONS, LABS, VITALS, and MORT.
• Although every clinical database is different, I've tried to use a structure that is commonly seen in
healthcare. Typically, tables are presented by clinical domains (for an example of a research study
that received tables in such a distributed format, see Basole et al., 2015). For example, there is
often one table that contains demographic and personal information, one table for lab results,
one for medications, and so on, so that is how we constructed the database in this example. They
tend to be tied together by a common identifier, which in our case is the Pid field.
• As we describe the tables, we must keep our end-goal of the data engineering phase in mind–to
combine the relevant information from the six tables into a single table, whose columns include
the target variable (mortality in this case) in addition to predictor variables, which should be
useful for predicting the target variable. This will enable us to make a machine learning model
with popular packages such as Python's scikit-learn. With this in mind, we will highlight selected
fields that will be useful for our assignment.
10. The PATIENT table
• In our example, the PATIENT table, which we can see in the following
screenshot, contains the demographic and identifying information of
our patients–their names, contact information, birthdays, and
biological sex.
• In this example, there are only five observations and 11 columns; in
real practice, this table would contain all of the patients affiliated with
the healthcare organization. The number of rows in this table might
range from hundreds to hundreds of thousands, while the table could
potentially include dozens of columns containing detailed
demographic information:
11. • In the database, every unique patient is assigned to an identifier (the
field labeled as Pid), which in our case is simply numbered 1 - 5. The
Pid column allows us to keep track of the patients across different
tables. Also, notice that there is one and only one entry for each
distinct patient ID.
12. The PATIENT table
• After identifying the indispensable identifer column, the focus should be on which variables to
keep and which to discard. Certainly, age and sex are important demographic predictors of
mortality. If race were in this table, that would be another important demographic variable.
• In the database, every unique patient is assigned to an identifier (the field labeled as Pid), which in
our case is simply numbered 1 - 5. The Pid column allows us to keep track of the patients across
different tables. Also, notice that there is one and only one entry for each distinct patient ID.
• Another notable variable in this table is the zip code. Increasingly, socioeconomic data is being
used in machine learning analyses. The zip code can potentially be tied to publicly available
census data; that data can then be joined to the data in this table on the zip code and could
potentially provide information on the average education level, income, and healthcare coverage
for each patient's zip code. There are even organizations who sell household-level information;
however, with that data comes a great responsibility for privacy protection and data security. For
this example, we will omit the zip code to keep our final table simple.
• Information we'll leave out from our final table includes names, street addresses, and phone
numbers. As long as we have the patient ID, these fields shouldn't have much of a predictive
impact on our target variable.
13. The VISIT table
• While the PATIENT table contains basic administrative information
about each patient, our assignment is to predict the mortality risk on
the basis of each visit. The VISIT table contains one observation for
each patient visit, along with some clinical information about each visit:
• Notice that the patient ID is no longer the primary identifier of this
table, since Patient #1 had two visits; instead, there is a Visit_id field
that is numbered from 10001 to 10006 in this example, with one
distinct ID per visit.
14. The VISIT table
• This table also contains Visit_date. Since the cardiology practice
indicated they want to know the mortality risk within 6 months of the
patient visit, we will have to use this field later when we compute the
target variable.
• Two of the fields in this table contain ICD (diagnosis) codes. Actual
tables may contain dozens of codes for each visit. For each coded
field, there is a corresponding name field that contains the name of
the condition that the code represents.
15. The MEDICATIONS table
• The MEDICATIONS table contains one entry for every medication
being taken by our five patients. In this example, there is no single
column that serves as a primary key for this table. As we can see in
the following screenshot, this table includes information about the
medication name, dose, frequency, route, prescribing physician, and
prescription date. The NDC code of each medication is also included;
we covered NDC codes in Chapter 2, Healthcare Foundations:
16. The MEDICATIONS table
• Including medications in our final table will not be straightforward. For example, the
information in the tables does not indicate the class of each medication. The NDC
code is present, but the NDC code is even more granular than the medication name
since it includes the route of administration and dosage in making each unique code;
therefore, multiple forms of lisinopril could have different NDC codes. In order to
make a column for each medication, we could potentially separately make a table for
each medication, which contains all of the medications that compose it, and then
merge that information into our table.
• If we choose to include dosage information, that field will require some cleaning.
Notice that Patient #3 is receiving an anti-hypertensive combination drug–the
valsartan component has a dosage of 160 mg, while the hydrochlorothiazide
component has a dosage of 12.5 mg. This could possibly be coded as two separate
drugs, but creating a script that splits combination drugs into two rows is not trivial.
17. The LABS table
• Laboratory information is an important part of clinical diagnostics, and many laboratory test results make for good predictor
variables (Donze et al., 2013; Sahni et al., 2018). The LABS table includes fields that describe the laboratory test name,
abbreviation, LOINC code, and result:
• There are some different approaches to including lab information in the final table. One way would be to include the raw lab result
as a continuous variable. However, this leads to a problem because the result would be NULL for most labs. We could potentially
navigate around this issue by imputing a value in the normal range when it is missing. Another approach would be to have a binary
variable for a lab test result that is in the abnormal range. This solves the missing data problem, since if the result is missing it
would be zero. However, a BNP value of 1,000 (which indicates severe CHF) would be no different than a BNP value of 350
(which indicates mild CHF) with this method. We will demonstrate both approaches in this chapter.
18. The LABS table
• Also note that the Lab_value field sometimes contains special
characters, for example in the troponin result. These will need to be
removed and the lab values interpreted accordingly. Culture results
(not included in this example) are completely textual, often naming
specific bacterial strains instead of numbers.
• Again, we repeat that this is a simplified example and that many of
the common labs that would be drawn for these patients (for
example, WBC count, hemoglobin, sodium, potassium, and so on) are
excluded here.
19. The VITALS table
• Vital signs are important indicators of a patient's health status and can be good predictors in
healthcare machine learning models (Sahni et al., 2018). Vital signs are typically taken at every
patient visit, so they can easily be included in their raw (numerical) form to preserve granularity.
• In the following screenshot of the table, we notice that while height and weight are present, the
body mass index (BMI) is missing. We will demonstrate the calculation of the BMI in Chapter 5,
Computing Foundations – Introduction to Python. Second, Visit #10004 is missing a temperature
reading. This is common in healthcare and may be caused by an oversight in care:
20. The MORT table
• Finally, we come to the table that contains the target variable. The
MORT table contains just two fields, the patient identifier, and the
date the patient passed away. Patients not listed in this table can be
assumed to be living:
21. Starting an SQLite session
The database engine we will use to transform our database is SQLite. It should be mentioned that SQL
comes in many variants, and the SQL specific to SQLite has minor differences to that specific to MySQL
or SQL Server databases. However, the underlying principles remain constant across all SQL dialects.
At this time, do the following:
• Navigate to the directory containing the sqlite3.exe program in your shell or command prompt (using
the cd command).
• Type sqlite3 mortality.db and press Enter. You should see a prompt that looks like the following:
sqlite>. This prompt indicates that you are in the SQLite program.
• Throughout the remainder of this chapter, we are going to create some tables and execute some SQLite
commands on them in the SQLite program.
• To exit the session at any time, type .exit and press Enter.
22. Data engineering, one table at a time with SQL
sqlite> CREATE TABLE PATIENT(Pid VARCHAR(30) NOT NULL,
Fname VARCHAR(30) NOT NULL,Minit CHAR,
Lname VARCHAR(30) NOT NULL,Bdate TEXT NOT NULL,
Street VARCHAR(50),City VARCHAR(30),State VARCHAR(2),
Zip VARCHAR(5),Phone VARCHAR(10) NOT NULL,Sex CHAR,
PRIMARY KEY (Pid)
);
23. sqlite> INSERT INTO PATIENT (Pid, Fname, Minit, Lname, Bdate, Street, City, State, Zip, Phone, Sex)
VALUES ('1','John','A','Smith','1952-01-01','1206 Fox Hollow
Rd.','Pittsburgh','PA','15213','6789871234','M');
sqlite> INSERT INTO PATIENT (Pid, Fname, Minit, Lname, Bdate, Street, City, State, Zip, Phone, Sex)
VALUES ('2','Candice','P','Jones','1978-02-03','1429 Orlyn Dr.','Los
Angeles','CA','90024','3107381419','F');
sqlite> INSERT INTO PATIENT (Pid, Fname, Minit, Lname, Bdate, Street, City, State, Zip, Phone, Sex)
VALUES ('3','Regina','H','Wilson','1985-04-23','765 Chestnut
Ln.','Albany','NY','12065','5184590206','F');
sqlite> INSERT INTO PATIENT (Pid, Fname, Minit, Lname, Bdate, Street, City, State, Zip, Phone, Sex)
VALUES ('4','Harold','','Lee','1966-11-15','2928 Policy
St.','Providence','RI','02912','6593482691','M');
sqlite> INSERT INTO PATIENT (Pid, Fname, Minit, Lname, Bdate, Street, City, State, Zip, Phone, Sex)
VALUES ('5','Stan','P','Davis','1958-12-30','4271 12th St.','Atlanta','GA','30339','4049814933','M');
31. sqlite> CREATE TABLE MORT(
Pid VARCHAR(30) NOT NULL,
Mortality_date DATE NOT NULL,
PRIMARY KEY (Pid)
);
sqlite> INSERT INTO MORT (Pid,
Mortality_date)
VALUES ('1', '2016-05-15');
sqlite> INSERT INTO MORT (Pid,
Mortality_date)