This topic explains how to implement web scraping and python web development. Web scraping topics such as scrapy, beautiful soup, and others will be covered. A case study based on a Malaysian website.

drshahizan drshahizan Last update: Feb 02, 2024

Stars Badge Forks Badge Pull Requests Badge Issues Badge GitHub contributors Visitors

Don't forget to hit the ⭐ if you like this repo.

About Us

The information on this Github is part of the materials for the subject High Performance Data Processing (SECP3133). This folder contains general big data information as well as big data case studies using Malaysian datasets. This case study was created by a Bachelor of Computer Science (Data Engineering), Universiti Teknologi Malaysia student.

📚 Course: High Performance Data Processing

Contents:

Web Scraping

Tutorial

Selenium

Beautiful Soup

Scrapy

Requests

Lxml

🌟 Case Study: Web Scraping

Team Library Website GitHub
Group 10 Beautiful soup StudyMalaysia.com Open in GitHub
High Five Beautiful soup EduSpiral Consultant Services Open in GitHub
QwQ Beautiful soup States and federal territories of Malaysia Open in GitHub
SDS Scrapy Book Depository Open in GitHub
BigMac Scrapy CompAsia.com Open in GitHub
SIX Scrapy bukukita.com Open in GitHub
AdMiPeQa Selenium Lazada Open in GitHub
SamVerse Selenium Malaysia General Election (GE-15) Open in GitHub
Group 9 Selenium Lazada Shopee Open in GitHub
No Name Requests Puma: sneakers Open in GitHub
Quad Lxml Jobstreet.com Open in GitHub

Contribution 🛠️

Please create an Issue for any improvements, suggestions or errors in the content.

You can also contact me using Linkedin for any other queries or feedback.

Visitors

Subscribe to our newsletter