Handle Unicode emoji variants #8

colditzjb · 2017-09-29T17:47:12Z

Unicode variants of the //ufeo* type are not being recoded in the parser (decode.py). We may be able to ignore these as they are context-dependent and add little or no utility for classification purposes.

See this link:
https://stackoverflow.com/questions/38100329/some-emojis-e-g-have-two-unicode-u-u2601-and-u-u2601-ufe0f-what-does

colditzjb · 2017-11-14T20:58:19Z

Check out emojitracker's list of known emoji: https://github.com/mroth/emoji_data.rb/blob/master/vendor/emoji-data/emoji.json

colditzjb · 2018-08-25T00:26:01Z

After some group discussion, a few Unicode variants may be potentially valuable for continued research (e.g., Fitzpatrick variants are potentially interesting, when available). This Unicode issue is an ongoing topic of discussion.

colditzjb · 2018-11-02T20:48:01Z

@sanyabt - I think this should just be a simple update to the emojilist.csv file. Should we ask one of our RA's to do this? If so, is there a list of important emoji or symbols that we're not currently capturing? (Don't worry about foreign language Unicode characters though.)

colditzjb self-assigned this Sep 29, 2017

colditzjb assigned sanyabt Nov 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle Unicode emoji variants #8

Handle Unicode emoji variants #8

colditzjb commented Sep 29, 2017

colditzjb commented Nov 14, 2017

colditzjb commented Aug 25, 2018

colditzjb commented Nov 2, 2018

Handle Unicode emoji variants #8

Handle Unicode emoji variants #8

Comments

colditzjb commented Sep 29, 2017

colditzjb commented Nov 14, 2017

colditzjb commented Aug 25, 2018

colditzjb commented Nov 2, 2018