Is SHELL needed to be a Data Scientist?
In one word, it depends upon the work. Most of the Well-Known Data Scientists have some knowledge about SHELL Scripting.
What Is SHELL?
In computing, a shell is a user interface for access to an operating system’s services. In general, operating system shells use either a command-line interface or graphical user interface, depending on a computer’s role and particular operation.
**Read this article – What is Shell and Kernel in Linux?
How is SHELL Used?
The prompt, $, which is called the command prompt, is issued by the shell. While the prompt is displayed, you can type a command. Shell reads your input after you press Enter. It determines the command you want to be executed by looking at the first word of your input. A word is an unbroken set of characters. Spaces and tabs separate words.
In Unix, there are two major types of shells −
- Bourne shell − If you are using a Bourne-type shell, the $ character is the default prompt.
- C shell − If you are using a C-type shell, the % character is the default prompt.
The Bourne Shell has the following subcategories −
- Bourne shell (sh)
- Korn shell (ksh)
- Bourne Again shell (bash)
- POSIX shell (sh)
The different C-type shells follow −
- C shell (csh)
- TENEX/TOPS C shell (tcsh)
How Do Data Scientists Use SQL?
In Shell, Data Scientists have to build some pipelines for the data. They write some script regarding their regular data or more likely same types of data. Suppose you have to print the First and the Last date from one or more file, and it’s a regular job to you, so, in that case, Data Scientists build one Script ( .sh ) file to handle this type of operation in one second.
So, what will be the Script to print the first and the last date?
for filename in [email protected] do head -n 2 $filename | tail -n 1 tail -n 1 $filename done
This will print the First Date and the Last Date from the given datasets.