In Summary

By Joseph Kibe on 3 July 2009 9:52 PM

While I'm not ready to put any code online, I thought I would make a brief note about the latest phase in the development of my online note taking application.

At some point in the last week, I decided it would be useful to give my software the ability to algorithmically generate short summaries of long notes. From the point of view of the user interface, it's quite a bit easier to browse notes and find something when the software doesn't display too little information — just a title or some statistics — or too much information, such as the whole ten paragraph note. (Think how much scrolling users would have to do!)

I've spent the last two days working on the algorithm, and I'm really quite pleased with how well it works, especially since it's a relatively simple algorithm, of less than 300 lines of code.

For instance, I just ran an article from CNN about Sarah Palin's recent decision to step down as Governor of Alaska through the algorithm. The original article is a little more than 5800 characters. My algorithm produced this 54 character summary:

She will not seek election to a second gubernatorial term in 2010. " Palin was elected governor in 2006. Palin said she was transferring authority to Lt. You certainly don't do this as a stepping stone to run for president. "So, no decision yet on either 2010 or let alone 2012. " Blitzer asked. "No decision that I'd want to announce today," Palin responded.

It has a number of flaws, like the stray quote marks, incorrect punctuation of questions and a pronoun as the first word, but it's not half bad either.

No TrackBacks

TrackBack URL: http://www.kibeland.com/cms/mt-tb.cgi/289

Leave a comment