python scraper examples

Further development is going in https://github.com/Megaputer/inepta, see examples in `examples` folder

This repository contains examples of web scrapers used in Internet Source node to demonstrate their use cases and capabilities.

Installation

Install the newest version of python from https://python.org/downloads. Python 3.7+ is required.
Download this repository (here we placed it to D drive, so full path is D:\python-scraper-examples)
Open Command Prompt and navigate to the repository root folder
Create virtual environment

python -m venv env

Install scraper dependencies

env\Scripts\pip install -r requirements.txt

Download chromium browser for webapp_scraper:

env\Scripts\python -m playwright install chromium

Register web scrapers in PolyAnalyst:
- Navigate to Server settings in PolyAnalyst Administrative Tool
- Open Web scrapers context menu and click on Add item
- Enter the scraper name in the Name field. This name will be displayed in the drop-down Scraper menu in the Internet Source node wizard
- Enter a command in the Command field. For example,
```
D:\python-scraper-examples\env\Scripts\python.exe D:\python-scraper-examples\megaputer_blog.py
```
- Click Save changes to apply new settings

Usage

Add Internet Source node to workspace
Choose one of scrapers registered earlier in the drop-down Scraper menu
Set parameters if selected scraper supports them
Execute node

License

This project is licensed under the MIT License - see the LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
currency_exchange_rates.py		currency_exchange_rates.py
megaputer_blog.py		megaputer_blog.py
requirements.txt		requirements.txt
utils.py		utils.py
webapp_scraper.py		webapp_scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

python scraper examples

Further development is going in https://github.com/Megaputer/inepta, see examples in `examples` folder

Installation

Usage

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Megaputer/python-scraper-examples

Folders and files

Latest commit

History

Repository files navigation

python scraper examples

Further development is going in https://github.com/Megaputer/inepta, see examples in examples folder

Installation

Usage

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Further development is going in https://github.com/Megaputer/inepta, see examples in `examples` folder

Packages