grep vs AWK vs Ruby, and a uniq disappointment

Posted by eldersnake on Mar 26, 2017 8:13 PM EDT
The Linux Rain; By Bob Mesibov
Mail this story
Print this story

In my data-cleaning work I often make up tallies of selected individual characters from big, UTF-8-encoded data files. What's the best way to do this? As shown below, I've tried grep/sort/uniq, AWK and Ruby, and AWK's the fastest. The trials also revealed an unexpected problem with the uniq program in GNU coreutils.

Full Story

  Nav
» Read more about: Story Type: Editorial; Groups: Developer, GNU, Linux, Standards

« Return to the newswire homepage

This topic does not have any threads posted yet!

You cannot post until you login.