streaker

Java-based web crawler

og/le :: streaker :: group members :: source :: license

streaker is a web crawling and indexing agent written in Java.  streaker interfaces with a MySQL database to store relevant web page information, including the full text of pages.  streaker is designed to index a local web or intranet, as in a corporate or academic environment.  In order to avoid overloading web servers with frequent requests, streaker includes a self-throttling mechanism that is implemented on a per-server basis.