Plant data scraped from the PFAF website.
The data and images are subject to the copyright laid out in the PFAF website.
main.py
: The main script to scrape all the data. Run with python main.py
.data.sqlite
: All the data about the plants.images/
: All the plant images, downloadd. The sources are located in the plant_images
table in the DB.We use Datasette to create a website to display the data. To start it locally run:
$ datasette data.sqlite -m metadata.json
You can also browse the dataset entirely in your browser using Datasette Lite
This project currently has support for setting up a local development environment using Conda. See the Conda install docs for how to install Conda.
It also uses Conda Lock to create a locked down version of the dependencies for reproducibility.
You are also welcome to use any other way you prefer for developing Python packages.
# To get get started:
$ conda create --name pfaf-data --file conda-[osx/linux/win]-64.lock
$ conda activate
# To regenerate the lockfiles:
$ conda-lock --kind explicit