Introduction to Data Science

What is Data Science?

Data Science as the name suggests is a field of science that deals with data. It combines the power of computers and mathematics for analyzing data, extracting important information from it and process this information for getting a useful output.

Learn about the providers of online masters in data science by clicking here

How can we use Data Science?

There are two ways in which we can use data science:

Finding a solution to a problem by analyzing the data.
Analyzing the data and come up with new ideas that can be implemented or come up with new problems that can be solved with it.

Classifications of Data Science

Data Science can be classified into the following:

Data Collection
Data Analysis
Data Visualization

We will take a brief look at each of these three…

Data Collection

In philosophy, we call the things that are known or are assumed as facts which makes the basics of reasoning and calculation as DATA. Collecting data has been one of the most common things that humans have been doing for ages.

Our ancestors used to collect data in rocks and stones for remembering the number of their cattle or to create memories about their life or the knowledge they have gained which they wanted to pass on to the next generation.

In the modern world, the basic purpose of collecting data is for using it to find solutions to existing problems.

We collect data mainly in these different forms like:

Sound data
Visual data
Text data

Types of data

The two main types of data are:

Structured data

Structured data is information that is organized. For example, a data set which contains names and roll numbers in two different column.

Unstructured data

These are a collection of information that is not processed. Examples are IoT sensor data, emails, chats, etc.

Data Analysis

Now that we have collected the data, for finding the solution to the problem that we have, we need to analyse the data.

The process of analyzing data using different tools like R, Python, MATLAB, etc. (We can use the libraries available in these programming languages for analyzing data by plotting graphs or charts) is called data analysis.

For example, consider the problem of housing price prediction. Imagine we have a dataset containing the prices of houses over the past 10 years. We would like to predict the price of the house in the coming year using this data.

One way we could do this is by plotting a graph where on the x-axis we give the years and, on the y-axis, we give the price of houses. When we plot the data like that, we would be able to see a pattern in which the prices of the house are increasing or decreasing over time.

And now by using this trend we would be able to predict the possible increase in price for a house in the coming years.

Data Visualization

Data visualization is a tool that is used to explain the data using graphical representations of the data. It helps the data analyst to understand different patterns in data and outliers and trends in data.

Also, the data analyst can use the visualization techniques to present his findings to the customer in the form of graphs, charts and maps.

Some of the different libraries in python for data visualization are:

Plotly
Seaborn
Ggplot
Altair
Matplotlib
Bokeh
Folium

If we are not using a programming language for visualization, we can use below tools:

Google charts
Tableau
Xplenty
Hubspot
Whatagraph

Data visualization example

We shall see an example of data visualization of data about three machines A, B, C, D and E for the period 01-10-2020 to 07-10-2020, done in python programming language using Plotly library.

You can download the data by clicking this download button. Download

Subsets of Data Science

Artificial Intelligence

AI – Artificial Intelligence is the intelligence that enables machines to think like a human and find solutions to problems with little or no human intervention. There are mainly 3 types of AI:

Artificial Narrow Intelligence (ANI)

Narrow AI is the most common form of AI that machines have these days. ANI allows machines to be automated and do a particular task or a small set of tasks all on its own, with very little or no human intervention.

It doesn’t have emotions or feelings of consciousness. It cannot do a wide variety of tasks if it isn’t programmed for it.

Examples:

Self-driving cars
Auto – Pilot
Spam Filters
Chatbots

Artificial General Intelligence (AGI)

This type of AI can only be seen in sci-fi movies and can exhibit human-level intelligence. This type of AI would be hard to distinguish from normal humans and would be able to show emotional intelligence.

They can think like human beings and would be able to solve problems based on situations rather than just system needs. In other words, if there is a situation where a particular solution to a problem might be harmful to someone else, at this situation the machine might choose another solution.

Artificial Super Intelligence (ASI)

This type of AI will have an intelligence level that would be far superior to humans and would be able to think much faster than us. They would have greater problem-solving skills and updates themselves which would be more brilliant than the one before, all in just a matter of days.

They would have the ability to evolve quickly and become the better versions of themselves. This type of intelligence can even be a threat to our existence.

Machine Learning

Machine learning is the process of teaching a machine to accept inputs and do calculations based on algorithms build upon statistics and probability, to come up with an output, that is closer or equal to the expected output.

We can see the use of machine learning in our day to day life, for example, the recommendation system in YouTube or Instagram ads is all based on machine learning where the data of what the user clicks the most and likes the most is fed into a system and the system learns about the user’s interests and it suggests the contents that the user is most interested in.

Machine learning is classified mainly into 3 types of learning:

1.Supervised Learning

Let’s say we want our machine to classify images of apple from a set of other images. In supervised learning, we will initially provide the ML-model input images and labels according to the name of the fruit in the image.

An ML-Model is a set of algorithms that learn different features from input data and gives an output.

The model would compare the image and label and learn the features that map a particular image to a particular label.

And now when we give the model a new image it would be able to identify the same features that it had seen in the data, that we had used to train it and would map the image to the particular label.

Common supervised learning problems:

Classification: Classification group’s the output to categories that are previously given to it as labels. For example 0,1, cat, mouse, apple, mango, etc.
Regression: Regression is used to predict a continuous quantity. For example predicting live temperature in room. Stock market price prediction is also an example.

2.Unsupervised Learning

In unsupervised learning, the model is provided with input data without any labels. The model would categorize the data into different groups based on similar features. Unsupervised learning is mainly used for two types of problems:

Clustering: Clustering identifies features that are similar between data and classifies according to these similarities. The model itself classifies input data according to similar features in data. For example, clustering peoples to different groups based on the spread of COVID-19 in their area.
Association: For example, associating a particular product to a buyer based on another product he brought recently (mapping).

3.Reinforcement Learning

Its like teaching a baby what’s right and wrong. If he does right we will appreciate him, by giving him some chocolates, gifts, etc.. and we will give him a feedback if he does something wrong. So next time if he does something he would know that it is good or bad based on feedbacks or rewards he got before while doing the same.

So, reinforcement learning is a reward-based system in which an agent interacts with an environment by performing some actions and learn from rewards (either negative or positive) obtained from interpreter . There is no predefined data and no supervision. Follows a trial-and-error method for learning. It should identify an output by itself and we would just say if its right or wrong.

Examples:

Self driving cars where the environment is road and the interpreter (error signal generator) is a human in the driving seat. The human sends a signal based on the direction the car automatically takes or maybe the lane changes the car makes or maybe while parking if it follows the rules.

An automated machine that is used for categorizing products into different groups based on its weight. The person who monitors the task would generate an error signal which is negative if the machine classify the product wrongly and it would give positive response if the machine does it correctly

In addition to these, there is another type of learning called semi-supervised in which some data is labelled and others are unlabelled.

4.Deep Learning

Deep learning is a subset of machine learning where we use artificial neural networks for doing the supervised, unsupervised, and reinforcement learning tasks.

Artificial Neural Networks (ANNs) are inspired by the neurons in the human brain. In deep learning, we use multiple layers of neurons connected in which one layer of the neuron will learn a particular feature from the input and the output is passed through a function, which mostly uses some probabilistic equations to identify the useful features and pass it as an input to the next layer and so on, until it reaches the final layer where we get an output.

Benefits of using neural networks

Neural network can have lots of layer’s, each having number of neurons. So even if one neuron isn’t performing well the model would identify it and won’t affect the performance. Also the data (input data and the features identified from data) is stored in the neurons itself in form of numbers. So we don’t have to use a separate database for storing these data.

Also neural networks, can be manipulated to whichever way we want for different tasks. it can be used for solving multiple problems, basically like our brain can do lots of things by firing different sets of neurons.

The two main areas in which Deep Learning is used the most are:

Computer vision

Computer vision is a field of artificial intelligence that uses deep learning to learn about the visual world. We know that an image is a collection of pixel values. In the computer, we represent these values as numbers in the matrix.

These numbers are fed into the neural networks which would then learn the features of the image and would be able to either classify an image or to detect an object in the image.

A type of neural network called Convolutional Neural Networks (CNNs) is used commonly for this. Some of the most common applications of computer vision are:

Defect detection in manufacturing
Self-driving cars
Intruder detection

Natural Language Processing

NLP is a field of artificial intelligence that uses the power of neural networks to understand human language in a useful way. NLP can be used to read, understand, and create natural language. Some of the applications of NLP are:

Google Translate
MS Word, Grammarly – for grammar check or spellcheck
Siri, Alexa – Personal Voice Assistant

If you have decided to go ahead with data science, you can refer out next article on data science.

Thank you

If you know any subject that can be related to manufacturing industry or industrial engineering, you can earn some income by becoming article contributor of this website. For knowing more about it, please visit Join us page.

You don’t need to have any experience in article writing, just knowledge on the subject is needed.

Also you can know more about our team of article contributors by visiting the about us page.

About the Author

Deepak Jose is a B-Tech CS student with a passion for Data Science. Loves learning about Data Science, coding, and science in general. Does data analysis and visualization as a hobby. Even though I’m in the Computer Science path I always find time to learn about space, automobiles, geography, energy, architecture, arts, etc. Loves solving problems and learning about new inventions.

LATEST ARTICLES FROM KNOW INDUSTRIAL ENGINEERING

Manpower Calculation Software for Manufacturing
Manpower Calculation Software for Manufacturing: Why Factories Need a Smarter Way to Plan People In manufacturing, manpower is not just a number. It directly affects production output, delivery performance, overtime cost, machine utilization, line efficiency, and customer satisfaction. Many factories still calculate manpower requirements manually using Excel sheets, assumptions, old standard times, or last month’s… Read more: Manpower Calculation Software for Manufacturing
Manpower Planning Sample
Manpower planning is one of the most important activities in manufacturing. If manpower is planned wrongly, the factory may face production delays, excess labour cost, poor line balance, overtime, or underutilization of people. Many companies still plan manpower based on experience or rough assumptions. But for better productivity, manpower should be planned using proper data… Read more: Manpower Planning Sample
Factory Productivity Improvement Tools: The Smarter Way to Improve Manufacturing Performance
In modern manufacturing, productivity is no longer improved by effort alone. It improves when factories use the right systems, tools, and data to make faster and better decisions. This is why factory productivity improvement tools are becoming essential for manufacturers that want to reduce inefficiencies, improve output, and stay competitive. Looking for factory productivity improvement… Read more: Factory Productivity Improvement Tools: The Smarter Way to Improve Manufacturing Performance
Industrial Engineering Is Entering Its Golden Hour – Here’s What’s Really Changing
This article is written by Shivani Kadam. She is experienced in Project Management. For years, Industrial Engineering was seen as the field that kept factories running smoothly, cuts waste and made workflows cleaner. Important work, yes but honestly, it often stayed behind the scenes. In this article lets discuss how Industrial Engineering is entering its… Read more: Industrial Engineering Is Entering Its Golden Hour – Here’s What’s Really Changing
How AI is quietly reshaping Industrial Engineering
This article is written by Shreya Desai. She is experienced in Process improvement. Let’s discuss about Artificial Intelligence and Industrial Engineering. Over the past few years, I’ve noticed something interesting on the shopfloors….It’s not loud.It’s not flashy.It’s not even obvious at first glance. But you can feel it. AI is slipping into industrial engineering, and… Read more: How AI is quietly reshaping Industrial Engineering
Career transition from automotive to solar module manufacturing
This article is written by Ashish Gaurav Chaturvedi. Ashish is Operations & Production professional. Here we will be discussing Career transition from automotive to solar module manufacturing. I started my professional journey in the automotive industry. My role involved working on production lines, quality control, design for manufacturing, and taking part in continuous improvement efforts.… Read more: Career transition from automotive to solar module manufacturing
Adaptive Project Management in an Uncertain Economy
This article is written by Shivani Kadam. She is experienced in Project Management. Adaptive project management is, moving towards the right or desired outcome, even if the path turned into something completely different from the original plan. I once ran a project that looked flawless from the outside, the kind you’d happily showcase in a… Read more: Adaptive Project Management in an Uncertain Economy
How Value Stream Mapping Uncovered a Hidden Defect on the Line
This article is written by Shreya Desai. She is experienced in Process improvement. In a manufacturing plant, not all problems make noise. Some hide in plain sight. Idle operators, partially filled racks, or a batch of components waiting for someone to notice them. These quiet signs are symptoms of process inefficiencies that cost time, money,… Read more: How Value Stream Mapping Uncovered a Hidden Defect on the Line
Machine Hour Rate (MHR)
The Machine Hour Rate refers to the total expense associated with operating a particular machine for one hour. It includes both fixed and variable costs associated with operating the machine. By calculating this rate, factories can accurately determine how much it costs to produce a part or complete a job, especially when machine time is… Read more: Machine Hour Rate (MHR)
Predetermined Motion Time System (PMTS)
This article is contributed by Umang Kumar Singh. Predetermined motion time system (PMTS) is a method of arriving Standard time of a work, that comprises predetermined time values corresponding to each and every work element. When we talk about Industrial engineering, work measurement plays a crucial role for measuring productivity and reducing operational costs. As… Read more: Predetermined Motion Time System (PMTS)
Karakuri in Manufacturing
This article is written by Shreya Desai. She is experienced in Process improvement. Karakuri in manufacturing refers to smart, mechanical solutions that harness basic physical principles like gravity, levers, springs, pulleys, and counterweights to support operators and streamline tasks—without relying on electricity, motors, or automated systems. What’s the coolest automation you’ve seen? Maybe a robotic… Read more: Karakuri in Manufacturing
Role of Industrial Engineer in unionized manufacturing
This article is written by Umang Kumar Singh. The role of an Industrial Engineer in a unionized manufacturing environment is unavoidable, as it clear misunderstandings among workers, align both workers and management with company goals, and strengthens the relationship between them. Everyone knows Industrial engineering as Resource Allocation Optimizer, Waste Reducer and Process Efficiency Increasing… Read more: Role of Industrial Engineer in unionized manufacturing
Digital twins in manufacturing
This article is written by Shreya Desai. She is experienced in Process improvement. What If Your Assembly Line Had a Mind of Its Own? A system on your shop floor that not only sees—but senses. One that learns your bottlenecks before your planners do. That catches torque drift before QA flags it. That knows what’s… Read more: Digital twins in manufacturing
Process Capability Index
This article is written by Shreya Desai. She is experienced in Process improvement. Let’s talk cookies—yes, the kind you bake. Imagine you’re baking a batch, aiming for uniform size, shape, and that perfect golden brown. But occasionally, a few turn out too flat, too big, or just a tad burnt. That inconsistency is what we… Read more: Process Capability Index
Teamwork in Manufacturing: How an Industrial Engineer and Friends Revived a Factory
In the fast-paced world of manufacturing, where deadlines are tight, margins are thin, and pressure is constant, it’s easy to believe that only hard data and systems drive success. But sometimes, what truly turns things around is leadership built on trust, friendship across departments, and the silent strength of a well-bonded team. At the center… Read more: Teamwork in Manufacturing: How an Industrial Engineer and Friends Revived a Factory
Uses of Why-Why analysis in 5S Implementation
Why-Why analysis tool is used in 5S Implementation, to find the root causes of problems. Why-Why analysis will be useful in following stages of 5S Implementation. We shall check in detail how we are using in each stage. Why-Why analysis in Shine Phase In Shine phase following activities are also done. Please note the following… Read more: Uses of Why-Why analysis in 5S Implementation
How Industrial Engineers Work
Industrial Engineers makes things better than anyone and they have a secret way of working to achieve this. Wondering How exactly do industrial engineers work? What systematic approach do they follow to bring improvements in factories, offices, hospitals, and even in services? Lets discuss in this article. Let’s break down their method into simple stages… Read more: How Industrial Engineers Work
Career Growth Tips for Young Professionals from a General Manager
In this article, we will be sharing career growth tips from Bairy Venkata Ugendhar and following are the short introduction about him, before we start. Bairy Venkata Ugendhar GM – Industrial Engineering Operations | 28+ Years in Manufacturing With over 28 years of hands-on experience across apparel, consulting, machine tools, and finance industries, Mr. Ugendhar… Read more: Career Growth Tips for Young Professionals from a General Manager
Manufacturing in Space
If you are reading this article in 2025 -2030, you may feel little awkward on the topic ‘manufacturing in space’. But, in future it will be a necessity. Manufacturing in space, is the process of producing or manufacturing products in space to overcome the effect of Earth gravity during the manufacturing process.
Standard Work Instruction (SWI) for Manpower Optimization
This article is contributed by Ashish Rathore. He is presently driving continuous improvement projects and he is a Mechanical Engineer with MBA in HR and marketing Standard Work Instruction (SWI) is a document which is displayed in the work station which helps operator to do the work considering quality, productivity and safety. In this article… Read more: Standard Work Instruction (SWI) for Manpower Optimization
How and Why to eliminate the concept of “Working hours”
The debate around working hours and work-life balance is louder than ever. But let’s pause and ask—do hours really matter if you love what you do? When employees truly enjoy their work or feel like the company is their own, work stops feeling like a burden. They don’t count hours; they focus on results. The… Read more: How and Why to eliminate the concept of “Working hours”
Proposal Writing and Bid writing
Proposal writing and Bid writing is a very important and a very critical skill that encompasses business, government, and different sectors including the IT and also the non-IT sectors. It involves writing well-structured documents that provide solutions to the requirements and also address all the problems that the client requires. In this article we will… Read more: Proposal Writing and Bid writing
How to prioritize machines and activities for implementing SMED
This article is written by Bharathkumar Radha Krishna. He is an Industrial engineer with expertise in lean methodologies and value stream mapping In this article, we will dive deeper into SMED. We will discuss how to prioritize machines and activities for implementing SMED, define economic lot size quantity, and will provide tips and techniques. In… Read more: How to prioritize machines and activities for implementing SMED
Single-Minute Exchange of Die (SMED)
This article is written by Bharathkumar Radha Krishna. He is an Industrial engineer with expertise in lean methodologies and value stream mapping In today’s manufacturing world, efficiency and productivity are the important keys to success. Every minute counts, which is why every organization is constantly seeking ways to improve or optimize their operations. Imagine a… Read more: Single-Minute Exchange of Die (SMED)
How to add dimensions in AutoCAD
We all know that AutoCAD is a powerful tool for drafting and designing and it is widely used in various fields such as engineering, construction, architecture etc. If you want to learn more about AutoCAD you can check the articles here. In today’s article, we are going to talk about a very important tool that… Read more: How to add dimensions in AutoCAD
What will happen if you don’t hire an Industrial Engineer
If you don’t hire an industrial engineer, you might be missing out on a crucial opportunity to optimize your business processes, improve efficiency, and ultimately save both time and money. Industrial engineers are professionals who specialize in finding ways to make systems and processes work better. Here are some compelling reasons why hiring an industrial… Read more: What will happen if you don’t hire an Industrial Engineer
Why a candidate with Industrial Engineering background is most suitable to lead a factory
I prefer candidates with an Industrial Engineering background to lead factory Operations or similar higher roles for several below compelling reasons. In addition to their traditional responsibilities, an individual with this background brings unique skills and perspectives to the role, fostering enhanced efficiency and innovation across the entire organization. Industrial Engineers are adept at optimizing… Read more: Why a candidate with Industrial Engineering background is most suitable to lead a factory
Unit Per People Hour (UPPH)
Unit per people hour is a measure of manhour used for manufacturing a product. It is abbreviated as UPPH. In this article lets discuss some formulas related to this and the uses of UPPH. At the end of this article you will be able access an online tool related UPPH, where you can enter the… Read more: Unit Per People Hour (UPPH)
Unit Per Hour (UPH)
Unit per hour is a measure of capacity of manufacturing or assembly line in a factory. Unit per hour is abbreviated a UPH. UPH means, how many units a manufacturing line or assembly can be produced in an hour. Let’s discuss this in detail. We will include following in this article. You may refer a… Read more: Unit Per Hour (UPH)
How to grade operators in a factory and Why
Grading of the operators is categorizing or grouping the operators according to the various factors which is essential to do the job. In this article we will be discussing on how to grade operators along with the factors to be considered. So, stay tuned… We are explaining everything with practical examples. Before moving to our… Read more: How to grade operators in a factory and Why

Now or Never

We’ve got your back on your manufacturing journey — Stay in touch

Connect on LinkedIn Follow us on Facebook Watch our YouTube Videos Discuss in our WhatsApp Group Follow our WhatsApp Channel Follow our WhatsApp Channel

Know Industrial Engineering Platform – Helping manufacturing industry professionals worldwide since 2019

Soon Factovare can take over many of the manual works in manufacturing industry.

Register for Free Training & Certification

What is Data Science?

How can we use Data Science?

Classifications of Data Science

Data Collection

Types of data

Data Analysis

Data Visualization

Data visualization example

Subsets of Data Science

Artificial Intelligence

Artificial Narrow Intelligence (ANI)

Artificial General Intelligence (AGI)

Artificial Super Intelligence (ASI)

Machine Learning

1.Supervised Learning

2.Unsupervised Learning

3.Reinforcement Learning

4.Deep Learning

Benefits of using neural networks

Computer vision

Natural Language Processing

About the Author

LATEST ARTICLES FROM KNOW INDUSTRIAL ENGINEERING

We’ve got your back on your manufacturing journey — Stay in touch

Related

1 thought on “Introduction to Data Science”

Leave a Comment Cancel Reply

Soon Factovare can take over many of the manual works in manufacturing industry.

Register for Free Training & Certification

What is Data Science?

How can we use Data Science?

Classifications of Data Science

See how Factovare helps factories digitize work

Data Collection

Types of data

Data Analysis

Data Visualization

Data visualization example

Subsets of Data Science

Artificial Intelligence

Artificial Narrow Intelligence (ANI)

Artificial General Intelligence (AGI)

Artificial Super Intelligence (ASI)

Machine Learning

1.Supervised Learning

2.Unsupervised Learning

3.Reinforcement Learning

4.Deep Learning

Benefits of using neural networks

Computer vision

Natural Language Processing

About the Author

LATEST ARTICLES FROM KNOW INDUSTRIAL ENGINEERING

We’ve got your back on your manufacturing journey — Stay in touch

Related

1 thought on “Introduction to Data Science”

Leave a Comment Cancel Reply