Learn Data @ Bash Shell


A simple course demonstrate the use of Bash shell in processing real-world data sets

...

Learn Data @ Bash Shell


A simple course demonstrate the use of Bash shell in processing real-world data sets

Know more about the course

As featured on the


...
...
...

How to do Data Science @ command line?


Bash may not the best way to handle all kinds of data! But, there often comes a time when you are provided with a pure Bash environment, such as what you get in the common Linux based super computers and you just want an early result or view of the data before you drive into the real programming, using Python, R and SQL, SPSS, and so on.


Lear more!
...
Features

Learn to use Bash in processing real-world data sets


...

Darkside of the data-intensive languages

Expertise in the data-intensive languages also comes at the price of spending a lot of time on them. In contrast, bash scripting is simple, easy to learn and perfect for mining textual data! Particularly if you deal with genomics, microarrays, social networks, life sciences, and so on. It can help you to quickly sort, search, match, replace, clean and optimise various aspect of your data, and you wouldn’t need to go through any tough learning curves.




Say hello to big data!

There are several examples of practical data mining that will have a flow of importing specific data resources into flat text-type files. Bash can run different programs (grep, sort, sed, and so on) on those files, clean, optimise and extract preliminary views (cut, csvlook, view, cat, head, etc.) of the data. There is one part of data mining, which involves unstructured data and then transforming it into a structured one (awk, shell). A scripting language like Bash can be very useful for doing the transformation. We strongly believe, learning and using Bash shell scripting should be the first step if you want to say, Hello Big Data!

...



...

Rocket start your data journey!

If you haven’t used Bash before, feel free to skip the projects and get to the tutorials part. Read the tutorials and then come back to the projects again. The tutorial section will introduce with bash scripting, regular expressions, AWK, sed, grep and so on. Finally, it gives you a concise beginner friendly guide to the big data landscape including an overview of the critical Big Data tools such as HDFS, MapReduce, YARN, Flume, Hive and more. The book finishes with a near-complete list of references to all the relevant command line and Big data tools.

Learn Bash and take your first step into the Data Sciences!


What do you learn?

...
Proejct 1: University Ranking Data

In this project, using a dataset called ‘US News Universities Rankings 2017’ we will explore different features and learn Bash shell’s head, cut, grep and so on. See examples (towardsdatascience.com):

[Part I] [Part II] [Part III]

Project 2: Facebook Data

The goal of this project is to find the most vibrant status message on a FaceBook ‘page’, with just one Bash command. In this process you will learn Bash functions, awk and more. See examples:

[Part I] [Part II]

Project 3: AU Crime Data

In this project, mining a historical dataset provided by the AFP we will find different stats on crimes per Australian city (awk, sort, and so on)

Project 4: Text mining

In this project, we will use containing plays and poems stats from the Shakespeare-era and find Shakespeare’s most freq words. Learn awk, Bash functions, and so on !

Tutorials: Bash, awk, regex and so on

If you haven’t used Bash before, don’t worry! The tutorial section will introduce with bash scripting, regular expressions, AWK, sed, grep and so on.

Beyond the text files

Finally, it gives you a concise beginner friendly guide to the big data landscape including an overview of the critical Big Data tools such as HDFS, MapReduce, etc.

Get the awesome Course!

Learn to Analyze Data in Bash Shell & Linux


Choose any of the following, available in 3 different formats

Leanpub


PDF


EBook
Dataset
Source code


Buy now!

Udemy


Video


Video lectures
Data sets + source
Documentation


Enroll now!

Educative


Interactive


Code playgrounds
Video lectures
Data sets + Documentation


Enroll now!

Supporting Materials

For the “Learn to analyze data in Bash Shell and Linux” project.

Sample chapters

Get it now!

Source codes

Get it now!

Data sets

Get it now!

Have you seen our Learn to Use HPC and Supercomputers course?