Future of Work Data

This repository contains data and analysis tools for exploring job market data from ESCO (European Skills, Competences, Qualifications and Occupations) and O*NET (Occupational Information Network).

Data Sources

ESCO Dataset: European Skills, Competences, Qualifications and Occupations taxonomy, version 1.2.0 (website)
O*NET Dataset: Occupational Information Network database, version 29.2 (February 2025 Release) (website)

Project Structure

future-of-work-data/
│
├── data/
│   ├── raw/                  # Original CSV and Excel files
│   │   ├── esco/             # ESCO dataset files
│   │   │   └── 1.2.0/        # ESCO version 1.2.0 
│   │   └── onet/             # O*NET dataset files
│   │       └── 29.2/         # O*NET version 29.2
│   ├── duckdb/               # DuckDB databases
│   │   ├── esco_dataset_1.2.0.duckdb
│   │   └── onet_dataset_29.2.duckdb
│   └── derived/              # Derived datasets from SQL queries
│
├── src/                      # Source code
│   ├── etl/                  # ETL scripts for data processing
│   └── utils/                # Utility functions
│
└── sql/                      # SQL queries
    ├── esco/                 # ESCO-specific queries
    ├── onet/                 # O*NET-specific queries
    ├── crosswalk/            # Queries linking ESCO and O*NET
    └── views/                # Python scripts with SQL queries

Install for Development

Clone the repository:

git clone https://github.com/victoriano/future-of-work-data.git
cd future-of-work-data

Setup environment with uv:

uv sync --all-groups --all-extras

The --all-groups option will install development and docs dependencies (e.g. linters etc.), and the --all-extras option optional dependencies such as notebook support.

Optional: register the uv environment as a notebook kernel:

uv run ipython kernel install --user --env VIRTUAL_ENV $(pwd)/.venv --name=fow

This will let you select the kernel fow associated with this environment in Jupyter or VS Code notebooks. You can replace "fow" with a kernel name of your choice.

Data Processing

Converting to DuckDB

The raw data is converted to DuckDB databases for efficient querying:

# Convert ESCO dataset to DuckDB
python -m src.etl.convert_esco_to_duckdb

# Convert O*NET dataset to DuckDB
python -m src.etl.convert_onet_to_duckdb

Usage Examples

Query the data with SQL

import duckdb

# Connect to the databases
esco_con = duckdb.connect('data/duckdb/esco_dataset_1.2.0.duckdb')
onet_con = duckdb.connect('data/duckdb/onet_dataset_29.2.duckdb')

# Example ESCO query
esco_occupations = esco_con.execute("SELECT * FROM occupations_en LIMIT 10").fetchdf()

# Example O*NET query 
onet_occupations = onet_con.execute("SELECT * FROM occupation_data LIMIT 10").fetchdf()

License

This project uses data from:

ESCO dataset, licensed under their license terms
O*NET dataset, licensed under Creative Commons Attribution 4.0 International License

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
data		data
notebooks		notebooks
scripts		scripts
sql		sql
src/fow		src/fow
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
setup_repo.sh		setup_repo.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Future of Work Data

Data Sources

Project Structure

Install for Development

Data Processing

Converting to DuckDB

Usage Examples

Query the data with SQL

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

victoriano/future-of-work-data

Folders and files

Latest commit

History

Repository files navigation

Future of Work Data

Data Sources

Project Structure

Install for Development

Data Processing

Converting to DuckDB

Usage Examples

Query the data with SQL

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages