Process

I used to have some experience with basic shell command before, therefore this exercise was not hard for me to get start with. It gave me a chance to refresh my knowledge about shell command as well. I also found the website http://www.thegeekstuff.com/2010/11/50-linux-commands/ very useful for shell command beginners and learned a lot from it.

Exercise

The command sort -n compare according to string numerical value, rather than character by character.

wc -l < mydata.dat：wc -l doesn't have any command line parameters, it reads from standard input and send the contents of mydata.dat to wc's standard input. wc -l mydata.dat: wc gets a command line parameter telling it to open mydata.dat and get the output result.

uniq is faster when working with large data sets, it removes only adjacent duplicated lines by comparing line by line. We can also use sort salmon.txt| uniq to remove all duplicated lines.

we use "head -5" to get the first 5 lines of the animals.txt:

2012-11-05,deer
2012-11-05,rabbit
2012-11-05,raccoon
2012-11-06,rabbit
2012-11-06,deer

Then use "tail -3" to get the last three lines of previous output.

2012-11-05,raccoon                                                                                                                                                                 
2012-11-06,rabbit                                                                                                                                                                  
2012-11-06,deer

"sort -r > final.txt" sort the previous outcome and save the result as "final.txt".

2012-11-06,rabbit
2012-11-06,deer
2012-11-05,raccoon

cut -d, -f 2 animals.txt | sort | uniq

SILShack Spring 2014

Lei's Pipes & Filters

Process

Exercise