<?xml version="1.0" encoding="UTF-8"?><!-- generator="WordPress/2.5.1" -->
<rss version="0.92">
<channel>
	<title>Digital Digressions by Stuart Sierra</title>
	<link>http://stuartsierra.com</link>
	<description>From programming to everything else</description>
	<lastBuildDate>Wed, 14 May 2008 15:58:26 +0000</lastBuildDate>
	<docs>http://backend.userland.com/rss092</docs>
	<language>en</language>
	
	<item>
		<title>EC2 Authorizations for Hadoop</title>
		<description>I just did my first test-run of a Hadoop cluster on Amazon EC2.  It's not as tricky as it appears, although I ran into some snags, which I'll document here.  I also found these pages helpful: EC2 on Hadoop Wiki and manAmplified.

First, make sure the EC2 API tools ...</description>
		<link>http://stuartsierra.com/2008/05/14/ec2-authorizations-for-hadoop</link>
			</item>
	<item>
		<title>Stop Your Java SAX Parser from Downloading DTDs</title>
		<description>Back in February, in a slightly plaintive post, the W3 sysadmins asked that people stop hammering their servers with requests for XHTML DTDs.  Everyone said yes, this is a stupid problem that wouldn't have happened if a) the XML spec were less dumb, or b) XML libraries were less ...</description>
		<link>http://stuartsierra.com/2008/05/08/stop-your-java-sax-parser-from-downloading-dtds</link>
			</item>
	<item>
		<title>We Don&#8217;t Know How We Program</title>
		<description>Paul Johnson, in the U.K., wrote a piece about how there is no known "process" for programming.  At some point, all the theory and methodology goes out the window and someone has to sit down, think about the problem, and write some code.

I'm sure I won't be the only one ...</description>
		<link>http://stuartsierra.com/2008/05/08/we-dont-know-how-we-program</link>
			</item>
	<item>
		<title>Calling Java Constructors with this()</title>
		<description>The things I don't know about Java... could fill a book.  Here's a new one, from the Hadoop sources:


public ArrayWritable(Class valueClass) {
    // ...
}

public ArrayWritable(Class valueClass, Writable[] values) {
  this(valueClass);
  this.values = values;
}


The second constructor uses the syntax this(arg) to call a different constructor, ...</description>
		<link>http://stuartsierra.com/2008/05/05/calling-java-constructors</link>
			</item>
	<item>
		<title>Astronauts Without Mission Control</title>
		<description>Joel Spolsky complains that architecture astronauts are taking over at big, rich companies like Google and Microsoft, pushing out elaborate architectural systems that don't solve actual problems.

He's right in that smart, technical people like to take on any large, abstract problem that is, as he puts it, "a fun programming ...</description>
		<link>http://stuartsierra.com/2008/05/01/astronauts-without-mission-control</link>
			</item>
	<item>
		<title>A Million Little Files</title>
		<description>My PC-oriented brain says it's easier to work with a million small files than one gigantic file.  Hadoop says the opposite -- big files are stored contiguously on disk, so they can be read/written efficiently.  UNIX tar files work on the same principle, but Hadoop can't read them ...</description>
		<link>http://stuartsierra.com/2008/04/24/a-million-little-files</link>
			</item>
	<item>
		<title>The Great Database Rewrite</title>
		<description>I just discovered the paper The End of an Architectural Era (It's Time for a Complete Rewrite), about re-designing database software from the ground up.  It contains some unsurprising predictions -- "the next decade will bring domination by shared-nothing computer systems, often called grid computing" -- and some interesting ideas:

	Any ...</description>
		<link>http://stuartsierra.com/2008/04/23/the-great-database-rewrite</link>
			</item>
	<item>
		<title>Power At Your Fingertips</title>
		<description>I just ran my first Amazon EC2 instance.  Kind of a heady feeling, having nearly unlimited computing power just a few keystrokes away.  I got the same feeling the first time I logged in as root on a dedicated web server.

I gotta say, though, that the ticking meter -- even ...</description>
		<link>http://stuartsierra.com/2008/04/22/power-at-your-fingertips</link>
			</item>
	<item>
		<title>There Is No Database</title>
		<description>I think I'm starting to get a handle on how Hadoop is supposed to work. The MapReduce model isn't what troubles me.  The mind-bending part is that there is no database.  Everything happens by scanning big files from beginning to end.  It's like everything I learned about data ...</description>
		<link>http://stuartsierra.com/2008/04/21/there-is-no-database</link>
			</item>
	<item>
		<title>Disk is the New Tape</title>
		<description>An interesting scenario from Doug Cutting: Say you have a terabyte of data, on a disk with 10ms seek time and 100MB/s max throughput.  You want to update 1% of the records.  If you do it with random-access seeks, it takes 35 days to finish.  On the ...</description>
		<link>http://stuartsierra.com/2008/04/17/disk-is-the-new-tape</link>
			</item>
</channel>
</rss>
