<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Passionately Opinionated &#187; perl</title>
	<atom:link href="http://www.benfallaize.co.uk/tag/perl/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.benfallaize.co.uk</link>
	<description>a blog by Ben Fallaize</description>
	<lastBuildDate>Sat, 19 Nov 2011 21:23:18 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
		<item>
		<title>Perl&#8217;s ability to get away with murder</title>
		<link>http://www.benfallaize.co.uk/2009/09/perl-and-its-ability-to-get-away-with-murder/</link>
		<comments>http://www.benfallaize.co.uk/2009/09/perl-and-its-ability-to-get-away-with-murder/#comments</comments>
		<pubDate>Mon, 14 Sep 2009 19:19:13 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Programming]]></category>
		<category><![CDATA[perl]]></category>
		<category><![CDATA[spider]]></category>

		<guid isPermaLink="false">http://www.benfallaize.co.uk/?p=39</guid>
		<description><![CDATA[Recently I&#8217;ve been using Perl to write a sort of search engine.  Basically, it gets given a web address, it goes off, has a good mooch through the given web address, gathers the information I tell it to, has a little sleep, and comes back to me, all the while fooling the chosen site into &#8230; <a href="http://www.benfallaize.co.uk/2009/09/perl-and-its-ability-to-get-away-with-murder/">Read more <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Recently I&#8217;ve been using Perl to write a sort of search engine.  Basically, it gets given a web address, it goes off, has a good mooch through the given web address, gathers the information I tell it to, has a little sleep, and comes back to me, all the while fooling the chosen site into thinking it&#8217;s a perfectly normal web user.</p>
<p>Now I am using this technology for perfectly genuine reasons, but it strikes me that it would be very easy to abuse.  Also, for your inexperienced programmer, could easily get out of control.  What also strikes me, is that these two scenarios almost overlap if you aren&#8217;t careful.</p>
<p>Say for instance, you write a basic spider to read a site, pick up any links, read them too, and record all the information in a text file.  Say also, you forget to put in a checker for duplicate links.  Already you have a spider that will never stop.  In other words, you&#8217;ve put it into an infinite loop.  &#8221;Oops&#8221; you might say, but consider the website you just spidered.  What you have essentially just done is a DoS attack.  Sure, it may not have done any damage, but it has the potential.</p>
<p>However, chuck in a few random commands, and your basic terrible script can suddenly emulate a real web user, give it a user-agent, and voila, spider away, steal what you want, the possibilities are endless and frankly, scary&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.benfallaize.co.uk/2009/09/perl-and-its-ability-to-get-away-with-murder/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

