top of page

EVOASTRA VENTURES

Public·1429 members

Web Scraping Project – Car Details Extraction from AckoDrive

Aim:

To automate the extraction of detailed car listings from the AckoDrive website using Python-based web scraping techniques for data analysis and exploration.


Tools & Libraries Used:

  • Python

  • requests – to fetch HTML pages

  • BeautifulSoup – for HTML parsing and data extraction

  • re (Regular Expressions) – for precise pattern matching

  • pandas & numpy – for data organization and handling missing values.


Approach:

  • Designed a web scraper to traverse through multiple pages (page/1 to page/19) of AckoDrive's car listing section.

  • Extracted multiple attributes for each car listed on the website using class selectors:


    • Brand & Model

    • Car Type (e.g., SUV, Sedan)

    • Fuel Type (e.g., Petrol, Diesel, CNG, Electric)

    • Transmission Type (Manual or Automatic)

    • Seating Capacity

    • Color Availability

    • Variant Details

    • Price

    • City-wise Location


  • Used regex to isolate and clean specific values (like number of seaters, fuel types, and variants).

  • Stored the extracted data into a structured pandas DataFrame, making it suitable for analysis, visualization, or export to CSV.


Results:

  • Successfully scraped and structured detailed information for hundreds of car variants across various cities in India.

  • Generated a clean, tabular dataset which can be used for further use cases such as price comparison, trend analysis, fuel preference insights, and location-based availability.

  • Built a reusable and extendable scraping script with modular data extraction logic.


Outcome: This project demonstrates the ability to extract structured data from semi-structured web pages using Python, effectively handling nested HTML elements, applying pattern recognition via regular expressions, and transforming raw data into an analyzable format.



55 Views
bottom of page