Skip to content

Rahul040202/randomwalk-ds-assessment

 
 

Repository files navigation

RandomWalk Data Science Assessment

Instructions:

  1. Fork the github repo into your personal Github account and take a clone into your local system.

    Guide to Forking Github Repo: https://docs.github.com/en/github-ae@latest/get-started/quickstart/fork-a-repo

    Guide for cloning Github Repo: https://docs.github.com/en/repositories/creating-and-managing-repositories/cloning-a-repository

  2. Intantiate a Jupyter Notebook instance in the local working directory and create a notebook which answers the following questions.

    Guide for installing Jupyter Notebook in Local system: https://test-jupyter.readthedocs.io/en/latest/install.html

  3. Save the notebook and push it into forked Github github repo.

    Guide to pushing code into Github Repo: https://docs.github.com/en/migrations/importing-source-code/using-the-command-line-to-import-source-code/adding-locally-hosted-code-to-github

  4. Record a screencast video recording demonstrating the solution in your system and upload the video into the forked github repo.

    Guide to record screen in

    a) MAC: https://support.apple.com/en-in/102618

    b) Windows: https://www.microsoft.com/en-us/windows/learning-center/how-to-record-screen-windows-11

    c) Ubuntu: https://askubuntu.com/questions/4428/how-can-i-record-my-screen

  5. Share the repository link into the Google Form: https://forms.gle/QCfjjyQQ5jnoegNe9

Questions:

Q1: How many rows and columns are there in books.csv dataset?

Q2: How many books do not have an original title?

Q3: How many unique books are present in the dataset ? Evaluate based on the 'book_id' after removing null values in the original_title column.

Q4: What is the average rating of all the books in the dataset based on ‘average_rating’?

Q5: Find the number of books published in the year ‘2000’ based on the ‘original_publication_year’.

Q6: Which book (title) has the maximum number of ratings based on ‘work_ratings_count’.

Q7: Bucket the average_rating of books into 11 buckets [0, 0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, 4.0, 4.5, 5.0] with 0.5 decimal rounding (eg: average_rating 3.0 to 3.49 will fall in bucket 3.0). Plot bar graph to show total number of books in each rating bucket.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%