Skip to content

Latest commit

 

History

History
23 lines (15 loc) · 845 Bytes

README.md

File metadata and controls

23 lines (15 loc) · 845 Bytes

Cricket-Statistics

Statistics

This project is basically based on scraping the data from an internet site using bs4 in Python3.

Modules used are:

  1. Beautiful Soup
  2. Pandas
  3. re (Regular Expressions)

Methods Used and Code Overview:

  1. getData(): This method takes in a single parameter i.e. the url to gather/scrape data and store it in excel file after undergoing some cleaning operations using regular expressions.
  2. Then the above method is looped so as to gather data from all available pages.
  3. Now a deep cleaning of data is performed using pandas.
  4. Finally the cleaned data is stored as an excel file.

@author: Obed Junias