lacerate

Opensource Ruby App to scrape Google search results and generate reports

View the Project on GitHub

README

CircleCI Code Climate


Lacerate is an OpenSource Rails application under MIT License. Lacerate basically takes keywords in a CSV file, scrapes Google and extracts useful information to generate analytics and report for business and SEO analysis.

Info we extract from google

Core Features

Technical challenges

Preventing Banning IP

Google lets normal user to search queries as much they like, however, they don’t like bots goofing around the site. So, suspecious activity can get our IP blacklisted for future access. So, we should not let that happen. Imitating human search patterns is the only way to get unnoticed by Google’s bot detection algorithm. Things we gonna try are:-

Redis Connection Limitation

Redis we are using; it has connection limit 20; and we are using 5 servers to process the data

alt tag

Speeding up searches

(talking about thousands of keywords)
When there are thousands of keywords you need to take care of, employing a single server(IP) to query with necessary pauses will be pretty time consuming. So, best way is to

For maintainers

Tools used

Known issues:

System dependencies

Configuration

API documentation

OAuth 2 Guidelines

alt tag

Database creation

Database initialization

How to run the test suite

Services

Deployment instructions


Maintainer of this project: Shiva Bhusal