Search in the webpage

Learn Data with Bash Shell – Animated Videos + PDF 👍

Course Introduction

Image Alt

Learn Data with Bash Shell – Animated Videos + PDF 👍

  /  Linux  /  Learn Data with Bash Shell – Animated Videos + PDF 👍

Learn Data with Bash Shell – Animated Videos + PDF 👍


About this course

This beginner friendly course is specifically designed to show you how to use Bash commands and shell programming to handle textual data which can be a csv format data or systems log file. In this course you will  learn Bash by doing projects. However, you need to understand the fact that Bash may not the best way to handle all kinds of data! But there often comes a time when you are provided with a pure Bash environment, such as what you get in the common Linux based Super-computers and you just want an early result or view of the data before you drive into the real programming, using Python, R and SQL, SPSS, and so on. Expertise in these data-intensive languages also comes at the price of spending a lot of time on them.


In contrast, bash scripting is simple, easy to learn and perfect for mining textual data! Particularly if you deal with genomics, microarrays, social networks, life sciences, and so on. It can help you to quickly sort, search, match, replace, clean and optimise various aspect of your data, and you wouldn’t need to go through any tough learning curves. We strongly believe, learning and using Bash shell scripting should be the first step if you want to say, Hello Big Data!


This course starts with some practical bash-based flat file data mining projects involving:

  • University ranking data [Previews: Part I, Part II, Part III]
  • Facebook data [Previews: Part I, Part II]
  • AU crime data
  • Shakespeare era plays and poems data (text mining)

(Data sets and documentations are included)


If you haven’t used Bash before, feel free to skip the projects and get to the tutorials part (supporting materials: eBook). Read the tutorials and then come back to the projects again. The tutorial section will introduce with bash scripting, regular expressions, AWK, sed, grep and so on. Finally, it gives you a concise beginner friendly guide to the big data landscape including an overview of the critical Big Data tools such as HDFS, MapReduce, YARN, Flume, Hive and more. The course finishes with a near-complete list of references to all the relevant command line and Big data tools.


Authored by Ahmed Arefin, PhD and thankfully voiced by A. Collinwood (voice artist). This awesome course is a component of the Learn Scientific Computing project.

How to get the course?

You can buy the course via the button, it will appear in your Gumroad account’s dashboard > library  section.



Team - Learn Scientific Programming


Can I just enroll in a single course? I'm not interested in the entire Specialization?

Yes, you can!

What is the refund policy?

Full refund available for the paid courses.

What background knowledge is necessary?

Basic Linux knowledge.

Do i need to take the courses in a specific order?

No, however this course is a core component of the ‘Learn Scientific Programming‘ initiative.


Course Introduction 0/2


Learn to analyze data with Bash - course introduction

2 videos
1:59 minutes
1:27 minutes