Learn Data @ Bash Shell


A simple course demonstrate the use of Bash shell in processing real-world data sets

...

Learn Data @ Bash Shell


A simple course demonstrate the use of Bash shell in processing real-world data sets

How to do Data Science @ command line?


Students, engineers, data scientists, sysadmins — who want to parse some (giga to tera bytes of?) text data to get the relevant information out by removing noises, often ask these questions, “how can I do that? how can I quickly form and use a Regex? What’s Regex by the way?”.


Lear more!
...

Learn Bash and take your first step into the Data Sciences!


Proejct 1: University Ranking Data

In this project, using a dataset called ‘US News Universities Rankings 2017’ we will explore different features and learn Bash shell’s head, cut, grep and so on.

Project 2: Facebook Data

The goal of this project is to find the most vibrant status message on a FB page, with just one Bash command (learn Bash functions and more).

Project 3: AU Crime Data

In this project, mining a historical dataset provided by the AFP we will find different stats on crimes per Australian city (awk, sort, and so on)

Project 4: Text mining

In this project, we will use containing plays and poems stats from the Shakespeare-era and find Shakespeare’s most freq words. Learn awk, Bash functions, and so on !

Tutorials: Bash, awk, regex and so on

If you haven’t used Bash before, don’t worry! The tutorial section will introduce with bash scripting, regular expressions, AWK, sed, grep and so on.

Beyond the text files

Finally, it gives you a concise beginner friendly guide to the big data landscape including an overview of the critical Big Data tools such as HDFS, MapReduce, etc.

Learn to Analyze Data in Bash Shell – Course Formats


Choose any of the following formats that suits your need to get started with

EBook

This book starts with some practical bash-based flat file data mining projects involving: University ranking data, Facebook data, Crime Data Shakespeare-era plays and poems data. If you haven’t used Bash before, feel free to skip the projects and get to the tutorials part. Read the tutorials and then come back to the projects again. The tutorial section will introduce with bash scripting, regular expressions, AWK, sed, grep and so on. Finally, it gives you a concise beginner friendly guide to the big data landscape including an overview of the critical Big Data tools such as HDFS, MapReduce, YARN, Flume, Hive and more. The book finishes with a near-complete list of references to all the relevant command line and Big data tools.

Leanpub.com


Video Lectures

Animated video lectures professionally produced explaining three of the four projects: University ranking data, Facebook data, Crime Data with bash scripting, regular expressions, AWK, sed, grep and so on.

Udemy.com


Interactive

An innovative project-based data learning variant of the course (includes video lectures). It demonstrates the use of Bash shell (Bash, sed and awk including RegEx) in processing textual data. It can help to learn to sort, search, match, replace, clean and optimize various aspects of data with Bash Shell. The target audience (students, researchers, scientists, journalists, data miners, developers) didn’t have to go through any tough learning curve. This course also should have helped RedHat, SuSE and Ubuntu Linux learners and Data Science enthusiasts.

Educatve.io


Have you seen our Learn to Use HPC and Supercomputers course?