steev's thoughts » Blog Archive » Quick frequency tables in unix

Quick frequency tables in unix

I found this here and thought it should be recorded.

If you have a single series of data (in my case, AS numbers) and you want a frequency count, how can you do that on the command line?
... pipe input here ... | sort | uniq -c | sort -r -n

This 1. sorts incoming data as required by uniq, 2. outputs the unique keys and their frequency of occurrence, sorted by the key in lexicographical order, and 3. resorts the output by the frequency of occurrence in descending order, leaving you with something like:

$ grep -P " 33363$" rib.20101113.txt | awk '{print $2}' | sort | uniq -c | sort -r -n

Where the first column is the frequency and the second column is the unique key in the source data stream.

This entry was posted on Thursday, November 18th, 2010 at 1:38 am and is filed under Geekery. You can follow any responses to this entry through the RSS 2.0 feed. Both comments and pings are currently closed.

Comments are closed.