public marks

PUBLIC MARKS from jdrsantos with tags python & scraper

22 March 2007 11:00

twill: a simple scripting language for Web browsing

by 2 others
twill is a simple language that allows users to browse the Web from a command-line interface. With twill, you can navigate through Web sites that use forms, cookies, and most standard Web features.

22 March 2007 10:15

mechanize

by 2 others
Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize . * mechanize.Browser is a subclass of mechanize.UserAgentBase, which is, in turn, a subclass of urllib2.OpenerDirector (in fact, of mechanize.OpenerDirect

22 March 2007 10:00

Beautiful Soup: We called him Tortoise because he taught us.

by 7 others
Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Three features make it powerful: