Consider Text instead of ByteString #54

tysonzero · 2019-08-28T17:47:15Z

URI's are explicitly declared to be a sequence of characters, and not a sequence of octets, as per the RFC.

Thus ByteString seems like a dangerous type to use for this purpose, as it represents a sequence of octets and not a sequence of characters.

This would also be more compatible with IRIs, as according to the RFC they are also a sequence of characters, and the characters do not fit within ASCII.

hasufell · 2023-12-29T10:04:00Z

I don't think this library clashes with the spec. The interpretation of the bytestrings is left to the caller. That actually seems like the right thing to do:

This specification does not mandate any particular character encoding for mapping between URI characters and the octets used to store or transmit those characters. When a URI appears in a protocol element, the character encoding is defined by that protocol; without such a definition, a URI is assumed to be in the same character encoding as the surrounding text.

The characters in the ABNF grammar are ASCII and as such we don't need to know the encoding to parse:

The ABNF notation defines its terminal values to be non-negative integers (codepoints) based on the US-ASCII coded character set [ASCII]. Because a URI is a sequence of characters, we must invert that relation in order to understand the URI syntax.

hasufell · 2023-12-29T10:24:42Z

That said, I'd actually say 'Text' is wrong and dangerous, because it makes the decoding choice for you (UTF-8), which is not what the spec says.

tysonzero changed the title ~~What is the reason for choosing bytestring over text~~ Consider Text instead of ByteString Aug 28, 2019

tysonzero mentioned this issue Sep 29, 2019

servant-client: replace BaseUrl with uri-bytestring haskell-servant/servant#949

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider Text instead of ByteString #54

Consider Text instead of ByteString #54

tysonzero commented Aug 28, 2019

hasufell commented Dec 29, 2023

hasufell commented Dec 29, 2023

Consider Text instead of ByteString #54

Consider Text instead of ByteString #54

Comments

tysonzero commented Aug 28, 2019

hasufell commented Dec 29, 2023

hasufell commented Dec 29, 2023