wisemonkeys logo
FeedNotificationProfileManage Forms
FeedNotificationSearchSign in
wisemonkeys logo

Blogs

Types of Big Data

profile
Abrar Ahmed Khan
Mar 13, 2022
0 Likes
0 Discussions
116 Reads

Since the invention of computers, people have used the term data to refer to computer information, and this information was either transmitted or stored. But that is not the only data definition; there exist other types of data as well. So, what is the data? Data can be texts or numbers written on papers, or it can be bytes and bits inside the memory of electronic devices, or it could be facts that are stored inside a person’s mind. Now, if we talk about data mainly in the field of science, then the answer to “what is data” will be that data is different types of information that usually is formatted in a particular manner or the quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media.

Then comes the question “what is Big Data”? As per the definition on the Oracle website, “The definition of big data is data that contains greater variety, arriving in increasing volumes and with more velocity. This is also known as the three Vs, Volume, Velocity, Variety. Put simply, big data is larger, more complex data sets, especially from new data sources. These data sets are so voluminous that traditional data processing software just can’t manage them. But these massive volumes of data can be used to address business problems you wouldn’t have been able to tackle before.” Size is the first, and at times, the only dimension that leaps out at the mention of big data.

Now that I have addressed the elephant in the room, lets understand what are the different types of data.

Following are the types of Big Data:

  1. Structured
  2. Unstructured
  3. Semi-structured

Structured

Any data that can be stored, accessed and processed in the form of fixed format is termed as a ‘structured’ data. Over the period of time, talent in computer science has achieved greater success in developing techniques for working with such kind of data (where the format is well known in advance) and also deriving value out of it. However, nowadays, we are foreseeing issues when a size of such data grows to a huge extent, typical sizes are being in the rage of multiple zettabytes.

Examples Of Structured Data

An ‘Employee’ table in a database is an example of Structured Data

Employee_ID

Employee_Name

Gender

Department

Salary_In_lacs

2365 

Rajesh Kulkarni 

Male 

Finance

650000

3398 

Pratibha Joshi 

Female 

Admin 

650000

7465 

Shushil Roy 

Male 

Admin 

500000

7500 

Shubhojit Das 

Male 

Finance 

500000

7699 

Priya Sane 

Female 

Finance 

550000

 

 

Unstructured

Any data with unknown form or the structure is classified as unstructured data. In addition to the size being huge, un-structured data poses multiple challenges in terms of its processing for deriving value out of it. A typical example of unstructured data is a heterogeneous data source containing a combination of simple text files, images, videos etc. Now day organizations have wealth of data available with them but unfortunately, they don’t know how to derive value out of it since this data is in its raw form or unstructured format.

Examples Of Un-Structured Data

The output returned by ‘Google Search’

Screenshot of Google search

Example Of Un-Structured Data

 

Semi-structured

Semi-structured data can contain both the forms of data. We can see semi-structured data as a structured in form but it is actually not defined with e.g. a table definition in relational DBMS. Example of semi-structured data is a data represented in an XML file.

Examples Of Semi-Structured Data

Personal data stored in an XML file-

<rec><name>Mathew Thomas</name><sex>Male</sex><age>26</age></rec>

<rec><name>Abrar Khan</name><sex>Male</sex><age>26</age></rec>

<rec><name>Laura Whistler</name><sex>Female</sex><age>29</age></rec>

<rec><name>Jim Halpert Roy</name><sex>Male</sex><age>37</age></rec>

<rec><name>Pam Beasley</name><sex>Female</sex><age>35</age></rec>


Comments ()


Sign in

Read Next

BLOCKCHAIN MACHANISM

Blog banner

Smartsheet

Blog banner

Every body is beautiful

Blog banner

Multiple-Processor Scheduling in Operating System

Blog banner

Deadlock and Starvation

Blog banner

Smart Homes | Zigbee Alliance

Blog banner

Cyber Forensics in a Ransomware Attack Recovery

Blog banner

Service Strategy principles

Blog banner

Cycling

Blog banner

Theads

Blog banner

INTRANET

Blog banner

Logical and physical address

Blog banner

JIRA SOFTWARE

Blog banner

DNS Cache

Blog banner

MySQL

Blog banner

I/O Management and Disk Scheduling

Blog banner

VIDEO INTERVIEWS : A NEW ECOSYSTEM TO GET DREAM JOBS

Blog banner

Deadlock

Blog banner

MULTITHREADING:ENHANCEING PERFORMANCE AND EFFICIENCY IN COMPUTING

Blog banner

Linux Memory Management

Blog banner

Deadlock and Starvation in an Operating System

Blog banner

Fault tolerance

Blog banner

SmartData Collective: Data Science aur Analytics ki Duniya

Blog banner

Why is ITSM important in IT organization?

Blog banner

Dekkers Algorithm

Blog banner

Blog on Smartsheet.

Blog banner

Stop Racism

Blog banner

How return on investment is defined in IT services

Blog banner

Utilizing Data-Hiding and Retrieval Techniques in Cyber Forensics

Blog banner

Deadlock and Starvation

Blog banner

What is OS Fingerprinting?

Blog banner

BUSINESS MODELS OF E COMMERCE

Blog banner

Real-Time Operating Systems (RTOS) Deep Explanation

Blog banner

Digital Forensics Challenges and Tools

Blog banner

Social Media Sentiment Analysis

Blog banner

My Favorite Country

Blog banner

OPERATING SYSTEM

Blog banner

Nature’s Brush on Silk: The Secret Behind Patola Colours

Blog banner

Virtual Machine

Blog banner

AI and cyber Security

Blog banner

SIEM Empowering Security

Blog banner

Supervised and Unsupervised Learning

Blog banner