Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
A Python Script to Automatically Extract Excerpts From Articles (davidziegler.net)
41 points by thomaspaine on June 12, 2009 | hide | past | favorite | 2 comments


Looks like he took the extraction approach [1] - copy the information deemed most important by the system to the summary. The second approach, abstraction, involves paraphrasing sections of the source document. It seems much harder.

There are some interesting summarizers accessible on the web : http://search.iiit.ac.in/~jags/summarizer/index.cgi

Machine learning, as always, is uber-cool.

[1] http://en.wikipedia.org/wiki/Automatic_summarization


Somewhat related, see this recent article (6/10/09):

"Extracting Meaning from Millions of Pages: University of Washington software pulls facts from 500 million Web pages."

http://beta.technologyreview.com/computing/22773/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: