Yearly Archives: 2008
Antidenormalizationism
When storing any large collection of data, one of the most critical decisions one has to make is when to normalize and when to denormalize. Normalized data is good for flexibility — you can write queries to recombine things in … Continue reading
Fragmentation and the Failure of the Web
What makes an on-line community? In the past two weeks I have received announcements of three new “communities” all interested in using open-source software to retrieve, share, and analyze data from or about governments. Most of these announcements say the … Continue reading
Whither RDF?
RDF is seductive. I can’t get away from it. Something about the ability to represent anything and everything in one consistent model just tugs at my engineer’s heartstrings. The problem with RDF, as I’ve discovered through painful experience, is that … Continue reading
Clojure for the Semantic Web
I dropped in to hear Rich Hickey talk about Clojure at the New York Semantic Web meetup group. Some highlights: • Some programs, like compilers or theorem provers, are themselves functions. They take input and produce output. Purely functional languages … Continue reading
The Document-Blob Model
Update September 22, 2008: I have abandoned this model. I’m still using Hadoop, but with a much simpler data model. I’ll post about it at some point. … Gosh darn, it’s hard to get this right. In my most recent … Continue reading
Thrift vs. Protocol Buffers
Google recently released its Protocol Buffers as open source. About a year ago, Facebook released a similar product called Thrift. I’ve been comparing them; here’s what I’ve found: Thrift Protocol Buffers Backers Facebook, Apache (accepted for incubation) Google Bindings C++, … Continue reading
Moving the ‘C’ in MVC
I’m sure I’m not the first to suggest this, but here goes. Ever since somebody first thought of applying the Model-View-Controller paradigm to the web, we’ve had this: The View is a conflation of HTML and JavaScript. JavaScript is an … Continue reading
Lawyers and Engineers
Part of what I hope to do with the Program on Law & Technology at Columbia is bridge the communication gap between lawyers and engineers. The two groups think completely differently. To take a recent example, a new P2P file-sharing … Continue reading
Strange Referrers
The web is a strange beast. Server logs reveal just how strange. Someone’s crawling AltLaw.org, sending an HTTP Referrer of “http://www.nero.com/enu/downloads-nero8-trial.php” with a User-Agent identified as “MSIE 5.00; Windows 98″. What the heck?
AI Lives!
Just when you thought the A.I. Winter would last forever, up pops Brainhat with an open-source inference engine that uses natural language as its primary interface. Cool!