Skip to main content

Data cleanings? Describe the steps of Data cleaning.


Data cleaning refers to the process of identifying and correcting (or removing) errors and inconsistencies in a dataset so that it can be analyzed and used effectively. This may involve removing duplicates, handling missing values, converting data into a consistent format, and more. The goal of data cleaning is to make sure that the data is accurate, complete, and trustworthy.

The steps in the data-cleaning process typically include:
  1. Inspection: Examine the data to identify any errors or inconsistencies.
  2. Data type conversion: Convert the data into a consistent format, such as converting strings to numbers or dates to a standard format.
  3. Handling missing values: Impute or remove missing values as appropriate.
  4. Outlier detection and treatment: Identify and correct outliers that may impact analysis.
  5. Duplicate removal: Remove duplicate records from the data
  6. Validation: Verify the accuracy and consistency of the data after cleaning.
  7. Saving the cleaned data: Save the cleaned data in a format that can be used for analysis.
Note: The specific steps involved in data cleaning may vary depending on the type of data and the intended use of the data.

Comments

Popular posts from this blog

HOUSING PROBLEMS FOR INDUSTRIAL WORKERS IN KHULNA CITY:A CASE STUDY ON SELECTED INDUSTRIES LOCATED IN WARD 8, 11 AND 13

1.1 Background of the study Housing is one of the basic needs of man after food and clothing. It provides shelter, safety and a sense of belonging to the owner. It also provides privacy, promotes health and comforts, and provides a basis for employment and income generation. More over a planned and well-designed house provides a favorable environment for human resource development. Housing means not only a structure but also a combination of both structure and infrastructure and services needed for living. Today, there is an acute housing crisis in the country, in the rural as well as in the urban areas. From the very beginning of human civilization people used to build shelter, which later turned today’s residence. Modern human civilizations justify residence in different points of view, such as the location, design, orientation, accessibility, environmental feasibility, services facilities etc. Khulna is well known as an industrial as well as divisional city of Banglade...

Assessment of Selected NGO Participation in Community Based Solid Waste Management:A case study of Sonadanga Residential Area

Chapter one 1.1 Background of the study             When a useful material good reaches the end of its life cycle, it losses its economic value and turn into waste. Nearly half of the world’s growing population lives in urban areas, causing enormous pressure on the local environment. Particularly in the large agglomerations of the developing countries, inadequate waste management is causes of serious urban pollution and hazard. Industrialized economies are facing an ever-increasing load of waste and declining landfill space to dispose of these materials. Sustainable management of waste with the overall goal of minimizing its impact on the in an economically and socially acceptable way is a challenge for the coming decades.     Khulna is a typical city in Bangladesh faced with growing urban environmental problems. Global concerns for urban environmental pollution are increasing day by day. The rapid g...

Introduction to Project Management Tools

 Save the Children’s Project Management Methodology (PMM) includes a set of tools that help us prepare, design and implement our projects with quality and time efficiency.  You will use some of these tools in the PRIME system. These tools have been co-designed with staff across the organisation, looking at our current ways of working, best practice and what our peer organisations are doing. The following tools are fundamental to good project management: Needs Assessment  Logframe Detailed Implementation Plan HR Plan MEAL Plan* (and MEAL PIRS) Budget  Procurement Plan  IPTT(within Logframe) Action Tracker Project Design Tool Problem and Objective Trees Work Breakdown Structure (WBS) Project Org Chart  Project Charter Stakeholder Power Map  Stakeholder Register and Engagement Plan Sustainability and Exit Strategy Authority Matrix  Proposal & Award Risk Tool (PART)