Added split operations to IsSequence, EqSequence #80

richsmith92 · 2015-08-20T15:03:43Z

No description provided.

snoyberg · 2015-08-20T16:25:07Z

src/Data/Sequences.hs

@@ -411,6 +410,12 @@ class (Monoid seq, MonoTraversable seq, SemiSequence seq, MonoPointed seq) => Is
    unsafeIndex :: seq -> Index seq -> Element seq
    unsafeIndex = indexEx

+
+    -- | 'splitWhen' splits a sequence into components delimited by separators, where the
+    -- predicate returns True for a separator element.


Would you mind adding a comment about whether the separators are retained or discarded? A concrete example would be useful. At least I personally get confused about this whenever I use split-style functions.

snoyberg · 2015-08-20T16:26:57Z

In addition to the documentation comments, would you be able to add a test case for this, to codify the semantics?

richsmith92 · 2015-08-20T16:46:01Z

I'll add following (from ByteString docs):

The resulting components do not contain the separators. Two adjacent separators result in an empty component in the output.

And will work on test case as well

gregwebs · 2015-08-20T17:01:54Z

I find the naming of these unintuitive (even if it may be consistent with some of the underlying libraries).
Our current naming scheme uses on to signify a function for groupOn (in that case it is returning an Ord). So splitOn and splitWhen would mean the same thing by that convention

I think split from the patch can be renamed to splitBy or splitAt or splitElem.
Perhaps splitOn from the patch can be renamed to splitSeq ?

MaxGabriel · 2015-08-20T18:08:48Z

IIRC the different splitting operators have slightly different edge case behavior (IIRC it was how they handle being given an empty string as the separator, or how splitting on a string consisting of just the separator works). I don't remember for sure but it might be worth double checking.

richsmith92 · 2015-08-20T22:11:06Z

Considering the naming, Data.List.Split, ByteString and Text provide three different naming schemes. I followed the first, except I added lacking split :: Element seq -> seq -> [seq]. I find myself using split most of time, especially for CSV/TSV parsing, so I judged it deserves the shortest name.

I think the most intuitive short name for seq -> seq -> [seq] must be splitSeq. Following the same logic, I could rename split to splitElem, but then nice short split name is unused.

To summarize:

splitWhen is good name already
splitOn should be renamed to splitSeq (better ideas?)
split should either stay same or be renamed to splitElem

gregwebs · 2015-08-20T22:22:08Z

@w3rs I agree with your explanation. Do you use classy-prelude? One option is to re-export splitElem as split in classy-prelude. I am still wondering if there is something more intuitive: splitAt captures it for me, except that at often refers to integer indexed.

richsmith92 · 2015-08-20T22:34:49Z

Do you use classy-prelude? One option is to re-export splitElem as split in classy-prelude.

@gregwebs yes, it makes sense.

splitAt captures it for me, except that at often refers to integer indexed.

This name is actually already taken with this purpose:

splitAt :: Index seq -> seq -> (seq, seq)

gregwebs · 2015-08-21T00:19:45Z

I am favoring splitElem, but I will go with consensus (or better ideas that come up).

Rename `split` to `splitElem`, `splitOn` to `splitSeq`. Split functions now follow `Data.List.Split` rules for edge cases. These rules are also encoded in test cases.

richsmith92 · 2015-08-22T13:20:14Z

I pushed the new commit with improvements. It also has new intercalate function for IsSequence (I needed it as inverse for splitSeq)

gregwebs · 2015-08-22T13:39:40Z

Looking good! If you want a shorter name, you could use splitEq instead of splitElem

richsmith92 · 2015-08-22T13:56:07Z

@gregwebs I think I can bear a couple extra chars after all.

snoyberg · 2015-08-23T11:05:23Z

Looks good to me, I'm happy to merge thanks for all the improvements.

Can you both confirm that this is ready to go?

richsmith92 · 2015-08-23T14:57:16Z

@snoyberg since we couldn't find much better names than splitElem and splitSeq, I'm fine with what we have.

kbillings · 2015-08-23T15:16:24Z

src/Data/Sequences.hs

+    --
+    -- 'splitElem' can be considered a special case of 'splitSeq'
+    --
+    -- > 'splitSeq' (singleton sep) === 'splitElem sep'


You cannot use '' with >. It produces invalid documentation. You must wrap the statement like:

@ 'splitSeq' (singleton sep) === 'splitElem' sep @

richsmith92 · 2015-08-23T15:26:12Z

@kbillings good catch! I had no success with stack haddock and forgot to ask others to review the docs. Will remove single quotes from lines with >

gregwebs · 2015-08-23T18:54:47Z

thanks a lot!

Added split operations to IsSequence, EqSequence

Added split operations to IsSequence, EqSequence

84250cf

snoyberg reviewed Aug 20, 2015
View reviewed changes

Rework split functions, add intercalate and tests.

26da588

Rename `split` to `splitElem`, `splitOn` to `splitSeq`. Split functions now follow `Data.List.Split` rules for edge cases. These rules are also encoded in test cases.

kbillings reviewed Aug 23, 2015
View reviewed changes

Fix for haddock

80b627d

gregwebs added a commit that referenced this pull request Aug 23, 2015

Merge pull request #80 from w3rs/master

de9149f

Added split operations to IsSequence, EqSequence

gregwebs merged commit de9149f into snoyberg:master Aug 23, 2015

snoyberg added a commit that referenced this pull request Aug 24, 2015

Version bump for #80

d715304

Added split operations to IsSequence, EqSequence #80

Added split operations to IsSequence, EqSequence #80

Uh oh!

Conversation

richsmith92 commented Aug 20, 2015

Uh oh!

snoyberg Aug 20, 2015

Choose a reason for hiding this comment

Uh oh!

snoyberg commented Aug 20, 2015

Uh oh!

richsmith92 commented Aug 20, 2015

Uh oh!

gregwebs commented Aug 20, 2015

Uh oh!

MaxGabriel commented Aug 20, 2015

Uh oh!

richsmith92 commented Aug 20, 2015

Uh oh!

gregwebs commented Aug 20, 2015

Uh oh!

richsmith92 commented Aug 20, 2015

Uh oh!

gregwebs commented Aug 21, 2015

Uh oh!

richsmith92 commented Aug 22, 2015

Uh oh!

gregwebs commented Aug 22, 2015

Uh oh!

richsmith92 commented Aug 22, 2015

Uh oh!

snoyberg commented Aug 23, 2015

Uh oh!

richsmith92 commented Aug 23, 2015

Uh oh!

kbillings Aug 23, 2015

Choose a reason for hiding this comment

Uh oh!

richsmith92 commented Aug 23, 2015

Uh oh!

gregwebs commented Aug 23, 2015

Uh oh!

Uh oh!