bug-coreutils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH] split: --chunks option


From: Pádraig Brady
Subject: Re: [PATCH] split: --chunks option
Date: Tue, 15 Dec 2009 10:16:36 +0000
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.5) Gecko/20091204 Thunderbird/3.0

On 15/12/09 08:12, Jim Meyering wrote:
Chen Guo wrote:
...
It's good to make long option names consistent between tools,
and to avoid long, common prefixes like "--number-".
Have you considered --bytes and --lines, like tail has?

Unfortunately split already uses the long options --bytes and --lines.

One possibility is to stick with the existing long option names,
but let an argument of the form "K/N" evoke the new semantics:

   --bytes=K/N  extract the K'th of N portions (byte-oriented)
   --lines=K/N  extract the K'th of N portions (line-oriented)

Since tail's -n means --lines, making "split -n 4" mean "bytes"
would be confusing.

Using a short option name like -n *may* be fine,
but you have to do a survey of all other vendor
versions of split to ensure that none of them
provide an -n option.

We're kinda stuck between a rock and a hard place. Originally
Padraig suggested -n at least in part to maintain compatibility with
BSD's split. On one hand, keeping it -n would be conflicting with
tail and head, but on the other hand not using -n would conflict with
BSD.

Actually, now that I think a little more about it, using "-n N" to mean
split on bytes makes sense, since split is primarily a byte-oriented tool,
unlike head and tail, which seem to be more line-oriented by default.

Right. Also doing -n lines:4 would allow one to specify
a distribution method which may be required. I.E. this
could be used to specify round robin distribution of lines
which might be required.

-n lines-rr:4

Also specifying other delimiters might be useful like:

-n nul:4

cheers,
Pádraig.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]