<?xml version="1.0" encoding="UTF-8"?><rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
> <channel><title>Comments on: Bugs in Twitter Text Libraries</title> <atom:link href="http://shkspr.mobi/blog/index.php/2010/03/bugs-in-twitter-text-libraries/feed/" rel="self" type="application/rss+xml" /><link>http://shkspr.mobi/blog/index.php/2010/03/bugs-in-twitter-text-libraries/</link> <description>Mobiles, Shakespeare, Politics, Usability.</description> <lastBuildDate>Tue, 07 Feb 2012 17:59:37 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <item><title>By: Matt Sanford</title><link>http://shkspr.mobi/blog/index.php/2010/03/bugs-in-twitter-text-libraries/#comment-6017</link> <dc:creator>Matt Sanford</dc:creator> <pubDate>Wed, 31 Mar 2010 15:36:20 +0000</pubDate> <guid
isPermaLink="false">http://shkspr.mobi/blog/?p=1924#comment-6017</guid> <description>It&#039;s great to have people checking over the auto-linking issues at last. The www...foo issue was reported at http://github.com/mzsanford/twitter-text-rb/issues/closed#issue/4 and has since been fixed in the Open Source library and is awaiting deploy on twitter.com.It turns out to be a harder problem that expected. The www...foo bug was introduced while adding support for some IDNs. I&#039;ll investigate the TLD stuff but the main worry is the gTLD process and what that will mean for the list of valid TLDs.</description> <content:encoded><![CDATA[<p>It&#8217;s great to have people checking over the auto-linking issues at last. The <a
href="http://www...foo" rel="nofollow">http://www&#8230;foo</a> issue was reported at <a
href="http://github.com/mzsanford/twitter-text-rb/issues/closed#issue/4" rel="nofollow">http://github.com/mzsanford/twitter-text-rb/issues/closed#issue/4</a> and has since been fixed in the Open Source library and is awaiting deploy on twitter.com.</p><p>It turns out to be a harder problem that expected. The <a
href="http://www...foo" rel="nofollow">http://www&#8230;foo</a> bug was introduced while adding support for some IDNs. I&#8217;ll investigate the TLD stuff but the main worry is the gTLD process and what that will mean for the list of valid TLDs.</p> ]]></content:encoded> </item> <item><title>By: Steven Pears</title><link>http://shkspr.mobi/blog/index.php/2010/03/bugs-in-twitter-text-libraries/#comment-6014</link> <dc:creator>Steven Pears</dc:creator> <pubDate>Wed, 31 Mar 2010 13:01:09 +0000</pubDate> <guid
isPermaLink="false">http://shkspr.mobi/blog/?p=1924#comment-6014</guid> <description>I think the regex is a good start, but that you need to make it the first in a two-pass process. Put a word boundary on the regex so that it doesn&#039;t allow for &quot;awwww...&quot;, then use regex groups to give you a quick and easy way to check a valid TLD.If your TLD list is out of date then you accept you may not catch 100%, but you&#039;re going to get a much more accurate set of results compared to regex alone, and in the long run people will appreciate it.</description> <content:encoded><![CDATA[<p>I think the regex is a good start, but that you need to make it the first in a two-pass process. Put a word boundary on the regex so that it doesn&#8217;t allow for &#8220;awwww&#8230;&#8221;, then use regex groups to give you a quick and easy way to check a valid TLD.</p><p>If your TLD list is out of date then you accept you may not catch 100%, but you&#8217;re going to get a much more accurate set of results compared to regex alone, and in the long run people will appreciate it.</p> ]]></content:encoded> </item> <item><title>By: David Carrington</title><link>http://shkspr.mobi/blog/index.php/2010/03/bugs-in-twitter-text-libraries/#comment-6013</link> <dc:creator>David Carrington</dc:creator> <pubDate>Wed, 31 Mar 2010 11:55:35 +0000</pubDate> <guid
isPermaLink="false">http://shkspr.mobi/blog/?p=1924#comment-6013</guid> <description>My point is that the list of TLDs changes, and I can&#039;t be arsed to subscribe to something which notifies me when I need to refresh the list in my apps. It will also make the regex even bigger than it is now, which I&#039;d personally prefer not to do.There is also the problem with clashes such as &quot;Hi john.in tomorrow?&quot; - is that www.john.in? I think not. It&#039;s those kinds of issues which I believe are not worth the effort of solving for the sake of 7 characters.</description> <content:encoded><![CDATA[<p>My point is that the list of TLDs changes, and I can&#8217;t be arsed to subscribe to something which notifies me when I need to refresh the list in my apps. It will also make the regex even bigger than it is now, which I&#8217;d personally prefer not to do.</p><p>There is also the problem with clashes such as &#8220;Hi john.in tomorrow?&#8221; &#8211; is that <a
href="http://www.john.in?" rel="nofollow">http://www.john.in?</a> I think not. It&#8217;s those kinds of issues which I believe are not worth the effort of solving for the sake of 7 characters.</p> ]]></content:encoded> </item> <item><title>By: Tweets that mention Bugs in Twitter Text Libraries – Terence Eden has a Blog -- Topsy.com</title><link>http://shkspr.mobi/blog/index.php/2010/03/bugs-in-twitter-text-libraries/#comment-6012</link> <dc:creator>Tweets that mention Bugs in Twitter Text Libraries – Terence Eden has a Blog -- Topsy.com</dc:creator> <pubDate>Wed, 31 Mar 2010 11:53:28 +0000</pubDate> <guid
isPermaLink="false">http://shkspr.mobi/blog/?p=1924#comment-6012</guid> <description>[...] This post was mentioned on Twitter by James Heaver. James Heaver said: RT @edent: Bugs in Twitter&#039;s text processing libraries - http://shkspr.mobi/blog?p=1924 (thoughts from devs welcome) [...]</description> <content:encoded><![CDATA[<p>[...] This post was mentioned on Twitter by James Heaver. James Heaver said: RT @edent: Bugs in Twitter&#39;s text processing libraries &#8211; <a
href="http://shkspr.mobi/blog?p=1924" rel="nofollow">http://shkspr.mobi/blog?p=1924</a> (thoughts from devs welcome) [...]</p> ]]></content:encoded> </item> <item><title>By: Owen Blacker</title><link>http://shkspr.mobi/blog/index.php/2010/03/bugs-in-twitter-text-libraries/#comment-6011</link> <dc:creator>Owen Blacker</dc:creator> <pubDate>Wed, 31 Mar 2010 11:32:05 +0000</pubDate> <guid
isPermaLink="false">http://shkspr.mobi/blog/?p=1924#comment-6011</guid> <description>I&#039;d disagree with David about adding support for the full canonical list of TLDs — it&#039;s software, it&#039;s not as if it&#039;s hard and a 5% character saving is gonna be useful to quite a few people.But you&#039;re both definitely right about consistency, that&#039;s just sloppy of them!</description> <content:encoded><![CDATA[<p>I&#8217;d disagree with David about adding support for the full canonical list of TLDs — it&#8217;s software, it&#8217;s not as if it&#8217;s hard and a 5% character saving is gonna be useful to quite a few people.</p><p>But you&#8217;re both definitely right about consistency, that&#8217;s just sloppy of them!</p> ]]></content:encoded> </item> <item><title>By: David Carrington</title><link>http://shkspr.mobi/blog/index.php/2010/03/bugs-in-twitter-text-libraries/#comment-6010</link> <dc:creator>David Carrington</dc:creator> <pubDate>Wed, 31 Mar 2010 10:48:10 +0000</pubDate> <guid
isPermaLink="false">http://shkspr.mobi/blog/?p=1924#comment-6010</guid> <description>Consistency I agree needs to be fixed. The www problem too.I&#039;m not convinced that adding handling for all TLDs is worth the few extra characters saved.</description> <content:encoded><![CDATA[<p>Consistency I agree needs to be fixed. The www problem too.</p><p>I&#8217;m not convinced that adding handling for all TLDs is worth the few extra characters saved.</p> ]]></content:encoded> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced (Requested URI is rejected)

Served from: www.shkspr.mobi @ 2012-02-09 07:54:29 -->
