Technology stack

The project catalogs two types of documents: road coordinate files in json and event files in csv that record traffic detected under particular conditions.
The read data is written to bson documents in the Mongo database by the mongo-init container whose sole purpose is to check if the data is present, and if so, insert it as defined in its javascript files. With the mongo-express container, a user interface is offered that makes it easier to read data in Mongo.
The data are then read from the Neo4j database, container neo4j, which with ad hoc functions, creates a node for each coordinate for the considered roads and adding attributes related to the query placed on the Mongo database. For example, attributes can be derived for a specific time slot and for specific roads.
I dati così ricavati sono rappresentati su grafici e mappe da notebook Jupyter col container notebook.

How to install

To facilitate project installation and sharing, the technology stack is based on docker.
Place in the main folder of the project and execute:

# only for testing or first boot
docker-compose up -d   
# to see the logs
docker container logs container_name
# to end containers
docker-compose down
# to test changes to images
docker-compose up -d --build --remove-orphans

Only on first run enter following token in notebook web page, that is present in Dockerfile.notebook file.

How to use

In order not to overwhelm the size of the repo, csv files from which to read traffic data are not included; you can download these files.
Given the amount of data and the limited hardware resources available during the testing phase, the loading code considers only the data coming from the surveys on Anderlecht and Brussels accomplished with a frequency of 15 minutes.
To consider other csv files you need to edit the init.py file To access docker containers, ports are exposed:

the 1000 port to access Jupyter
the 7474 to access the Neo4j page
the 8081 to access the Mongo Express GUI

For normal use, it is recommended to access port 1000 directly to populate the Neo4j database and to perform data analysis queries. Accessing the Jupyter container the first time will require the token found in the Dockerfile.notebook file. In later versions it will no longer be required.

Database data is saved in ad-hoc volumes to ensure its permanence even after any image updates. While for the remaining containers, important session data is saved in the current project as simple files.

For more information see docker-compose.yml.

Samples

When the start command is run, the Mongo database is automatically populated with the dataset data.
To populate the Neo4j database and view some analyses, connect to the Jupyter notebook container and run the code blocks in the notebook. After running the python file, a first analysis can be a map view of the first 100 streets in Anderlecht, such as:

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
mongo/node		mongo/node
neo4j		neo4j
samples		samples
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile.notebook		Dockerfile.notebook
Dockerfile.pandas-init		Dockerfile.pandas-init
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of contents

Technology stack

How to install

How to use

Samples

About

Releases

Packages

Contributors 2

Languages

ares-17/freight-transport-data

Folders and files

Latest commit

History

Repository files navigation

Table of contents

Technology stack

How to install

How to use

Samples

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages