Web Forge Spider is a web crawler built in ruby that explores gforge based forges (for example, sourceforge.net).
It retrieves projects information like:
- Project name
- Url where every project has its repository.
- Type of repository (CVS, SVN).
- Url of the mailing list where every project stores its messages.
- Url of the distribution files (aka .tar.gz files) released for every project.
All results are stored in a MySQL database, for further study.
All results are stored in a MySQL database, for further study.