Skip to Main Content

Data Cleaning and Inspection with Python

This guide provides an introduction to techniques used to clean data in Python.

The Data File - Data Dictionary

The data file consists of the following columns:

Provider ID - a unique identifier for each record

Hospital - Name of hospital

Address - the official physical address of the hospital

Zip Code - the official zip code for the hospital's physical address

County - county in which the hospital is located

Number - hospital phone number

Hospital Type - type of service provided at the hospital

(Acute Care Hospitals, Critical Access Hospitals, Childrens)

Hospital Ownership - owner of the hospital

(Government - Hospital District or Authority, Proprietary, Voluntary non-profit - Private, Government- Federal, Voluntary non-profit - Other, Government - Local, Government - State, Physician)

Emergency Services - whether the hospital provides emergency services to patients

(TRUE, FALSE)

Overall Rating - success rating given to each hospital

(1-5)

Mortality - mortality rating compared to the national average

(Below the National Average, Same as the National Average, Above the National Average)

Patient Experience - patient experience score compared to the national average

(Below the National Average, Same as the National Average, Above the National Average)

Effectiveness - hospital effectiveness as compared to the national average

(Below the National Average, Same as the National Average, Above the National Average)

Num of Patients - number of patients in the hospital at the time of data collection

 

Download the Data File

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.