Webscraper for Bachelor-prosjekt 🦾
This repository has been archived on 2024-12-13. You can view files and clone it, but cannot push or open issues or pull requests.
Find a file
Sindre Kjelsrud 2fd2047a73
📝 update README
Co-authored-by: haraldnilsen <harald_998@hotmail.com>
Signed-off-by: Sindre Kjelsrud <kjelsrudsindre@gmail.com>
2024-01-08 17:54:15 +01:00
.gitignore 🙈 add .gitignore 2024-01-05 12:08:01 +01:00
config.py 🎨 add config file 2024-01-08 15:14:05 +01:00
LICENSE 📄 add LICENSE 2024-01-05 11:51:32 +01:00
main.py metadata also collected 2024-01-08 15:42:35 +01:00
README.md 📝 update README 2024-01-08 17:54:15 +01:00
requirements.txt add requirements for project 2024-01-08 17:52:14 +01:00

Webscraper needed for Helseveileder

Part of Bachelor-project V2024

📝 Info

This webscraper will retrieve questions and answers, as well as the category assigned to the question, from Studenterspør.no. This will be used in our Bachelor project.

📋 Prerequisites

  • Python 3.x
  • httpx ~ HTTP client
  • HTMLParser (from selectolax.parser) ~ a fast HTML5 parser with CSS selectors
  • re ~ regular expression matching operations

🛠️ How to run locally

  1. Create Python environment: python -m venv venv
  2. Activate environment: source venv/bin/activate
  3. Install requirements: pip install -r requirements.txt
  4. Run main.py to get a csv.file: python main.py