Skip to main content

Data cleanings? Describe the steps of Data cleaning.


Data cleaning refers to the process of identifying and correcting (or removing) errors and inconsistencies in a dataset so that it can be analyzed and used effectively. This may involve removing duplicates, handling missing values, converting data into a consistent format, and more. The goal of data cleaning is to make sure that the data is accurate, complete, and trustworthy.

The steps in the data-cleaning process typically include:
  1. Inspection: Examine the data to identify any errors or inconsistencies.
  2. Data type conversion: Convert the data into a consistent format, such as converting strings to numbers or dates to a standard format.
  3. Handling missing values: Impute or remove missing values as appropriate.
  4. Outlier detection and treatment: Identify and correct outliers that may impact analysis.
  5. Duplicate removal: Remove duplicate records from the data
  6. Validation: Verify the accuracy and consistency of the data after cleaning.
  7. Saving the cleaned data: Save the cleaned data in a format that can be used for analysis.
Note: The specific steps involved in data cleaning may vary depending on the type of data and the intended use of the data.

Comments

Popular posts from this blog

Guidelines for Data Quality Assessment (DQA)

                                                                                                                                                          Guidelines for  Data Quality Assessment (DQA) What is Data Quality Assessment (DQA)? DQA stands for Data Quality Assessment or Data Quality Audit. It is a systematic process of evaluating the quality of data that is being collected, processed, stored, and used in a program or project. The objective of DQA is to identify and address any issues or challenges related to data quality that may affect the validity, reliability, and usefulness of the data....

HOUSING PROBLEMS FOR INDUSTRIAL WORKERS IN KHULNA CITY:A CASE STUDY ON SELECTED INDUSTRIES LOCATED IN WARD 8, 11 AND 13

1.1 Background of the study Housing is one of the basic needs of man after food and clothing. It provides shelter, safety and a sense of belonging to the owner. It also provides privacy, promotes health and comforts, and provides a basis for employment and income generation. More over a planned and well-designed house provides a favorable environment for human resource development. Housing means not only a structure but also a combination of both structure and infrastructure and services needed for living. Today, there is an acute housing crisis in the country, in the rural as well as in the urban areas. From the very beginning of human civilization people used to build shelter, which later turned todayā€™s residence. Modern human civilizations justify residence in different points of view, such as the location, design, orientation, accessibility, environmental feasibility, services facilities etc. Khulna is well known as an industrial as well as divisional city of Banglade...

Child Labour and Protection of Human Rights : A Study on the Notun Bazaar, Shekpara and Railway Station of Khulna city.

ABSTRACT This study has done on the child labours of some selected points of the Khulna city. Child labour is a complex problem. It is controversial and emotional issues for the world, but the scenario is acute in the developing countries. The number of child labourers from 5 to 14 years is 250 million in the world today and of them 61% belongs in Asia . Bangladesh is contested terrain in this context and contained 6.5 million child labourers who constitute 16.6% of the total labour force of the country. From the Constitution of the Peoples Republic of Bangladesh and the existing laws it is shown that the human rights of the child labourers are being violated. The study aims to identify existing situation of child labourers in the study area. It also identifies the condition of human rights in respect of child labours and show how they are being violated. The Shekpara, Notun Bazaar and Railway Station of Khulna city have been selected as the study area. Data has been ...