Flows Above Normal¶

Post-crisis analysis¶

In this worked example we assume the role of an analyst working in the aftermath of a significant crisis in Nepal. Our aim is to use FlowKit to investigate which administrative regions people have been displaced from/to during the crisis, following the methodology used in this paper.

The Jupyter notebook for this worked example can be downloaded here, or can be run using the quick start setup.

Introduction¶

We want to investigate changes in people's home locations caused by the crisis. To do this we will calculate an origin-destination matrix, containing the number of subscribers who lived in region A during a 'benchmark' period before the crisis, but had moved to region B during a 'focal' period after the start of the crisis, for each pair of regions A and B. We call this the 'flows' from the benchmark period to the focal period.

Some of these movements would have been made under normal circumstances, and were not caused by the crisis. To separate the crisis-driven movements from normal mobility patterns, we will also calculate a second origin-destination matrix, containing the flows from the same benchmark period to a 'comparison' period shortly before the crisis began. We can then subtract these 'normal' flows from the 'crisis' flows to get the flows above normal, which we assume to have been caused by the crisis. The diagram below shows the two flows that we will calculate.

Flows above normal

We will finish by using the mapboxgl library to display the total flows above normal in or out of each region on an interactive map, which will look like this:

Screenshot of inflows visualisation

Load FlowClient and connect to FlowAPI¶

We start by importing FlowClient. We also import geopandas and mapboxgl, which we will use later to to visualise the data.

import flowclient as fc
import os
import numpy as np
import geopandas as gpd
import mapboxgl
from mapboxgl.utils import create_color_stops

%matplotlib inline

We must next generate a FlowAPI access token using FlowAuth. If you are running this notebook using the quick start setup, generating a token requires the following steps:

Visit the FlowAuth login page at http://localhost:9091.
Log in with username TEST_USER and password DUMMY_PASSWORD.
Under "My Servers", select TEST_SERVER.
Click the + button to create a new token.
Give the new token a name, and click SAVE.
Copy the token string using the COPY button.
Paste the token in this notebook as TOKEN.

The steps are the same in a production setup, but the FlowAuth URL, login details and server name will differ.

Tokens are only valid for a fixed time, so once a token has expired you will need to create a new one.

Once we have a token, we can start a connection to the FlowAPI system. If you are connecting to FlowAPI over https and the system administrator has provided you with an SSL certificate file, you should provide the path to this file as the ssl_certificate argument toflowclient.connect() (in this example, you can set the path in the environment variable SSL_CERTIFICATE_FILE). If you are connecting over http, this argument is not required.

conn = fc.connect(
    url=os.getenv(
        "FLOWAPI_URL", "http://localhost:9090"
    ),  # This is the URL for the interface flowclient uses to communicate with the FlowKit server
    token=TOKEN,  # Replace TOKEN with the token copied from FlowAuth
    ssl_certificate=os.getenv(
        "SSL_CERTIFICATE_FILE"
    ),  # Path to the SSL certificate file (required if FlowAPI uses an https connection)
)

Calculate origin-destination matrices¶

1. Create query specifications¶

We first need to choose the three time periods (benchmark, comparison and focal) between which we will calculate flows. The simulated crisis in our fake dataset begins on 10^th February 2016, so both the benchmark and comparison periods should fall entirely before this date, and the focal period should be after this date.

We can use the flowclient get_available_dates function to check which dates have data available in FlowKit. We need to provide the connection we have just created as the 'connection' argument - flowclient uses this to communicate with the FlowKit server.

fc.get_available_dates(connection=conn)

{'calls': ['2016-01-01',
           '2016-01-02',
           '2016-01-03',
           '2016-01-04',
           '2016-01-05',
           '2016-01-06',
           '2016-01-07',
           '2016-01-08',
           '2016-01-09',
           '2016-01-10',
           '2016-01-11',
           '2016-01-12',
           '2016-01-13',
           '2016-01-14',
           '2016-01-15',
           '2016-01-16',
           '2016-01-17',
           '2016-01-18',
           '2016-01-19',
           '2016-01-20',
           '2016-01-21',
           '2016-01-22',
           '2016-01-23',
           '2016-01-24',
           '2016-01-25',
           '2016-01-26',
           '2016-01-27',
           '2016-01-28',
           '2016-01-29',
           '2016-01-30',
           '2016-01-31',
           '2016-02-01',
           '2016-02-02',
           '2016-02-03',
           '2016-02-04',
           '2016-02-05',
           '2016-02-06',
           '2016-02-07',
           '2016-02-08',
           '2016-02-09',
           '2016-02-10',
           '2016-02-11',
           '2016-02-12',
           '2016-02-13',
           '2016-02-14',
           '2016-02-15',
           '2016-02-16',
           '2016-02-17',
           '2016-02-18',
           '2016-02-19',
           '2016-02-20',
           '2016-02-21',
           '2016-02-22',
           '2016-02-23',
           '2016-02-24',
           '2016-02-25',
           '2016-02-26',
           '2016-02-27',
           '2016-02-28',
           '2016-02-29'],
 'mds': ['2016-01-01',
         '2016-01-02',
         '2016-01-03',
         '2016-01-04',
         '2016-01-05',
         '2016-01-06',
         '2016-01-07',
         '2016-01-08',
         '2016-01-09',
         '2016-01-10',
         '2016-01-11',
         '2016-01-12',
         '2016-01-13',
         '2016-01-14',
         '2016-01-15',
         '2016-01-16',
         '2016-01-17',
         '2016-01-18',
         '2016-01-19',
         '2016-01-20',
         '2016-01-21',
         '2016-01-22',
         '2016-01-23',
         '2016-01-24',
         '2016-01-25',
         '2016-01-26',
         '2016-01-27',
         '2016-01-28',
         '2016-01-29',
         '2016-01-30',
         '2016-01-31',
         '2016-02-01',
         '2016-02-02',
         '2016-02-03',
         '2016-02-04',
         '2016-02-05',
         '2016-02-06',
         '2016-02-07',
         '2016-02-08',
         '2016-02-09',
         '2016-02-10',
         '2016-02-11',
         '2016-02-12',
         '2016-02-13',
         '2016-02-14',
         '2016-02-15',
         '2016-02-16',
         '2016-02-17',
         '2016-02-18',
         '2016-02-19',
         '2016-02-20',
         '2016-02-21',
         '2016-02-22',
         '2016-02-23',
         '2016-02-24',
         '2016-02-25',
         '2016-02-26',
         '2016-02-27',
         '2016-02-28',
         '2016-02-29'],
 'sms': ['2016-01-01',
         '2016-01-02',
         '2016-01-03',
         '2016-01-04',
         '2016-01-05',
         '2016-01-06',
         '2016-01-07',
         '2016-01-08',
         '2016-01-09',
         '2016-01-10',
         '2016-01-11',
         '2016-01-12',
         '2016-01-13',
         '2016-01-14',
         '2016-01-15',
         '2016-01-16',
         '2016-01-17',
         '2016-01-18',
         '2016-01-19',
         '2016-01-20',
         '2016-01-21',
         '2016-01-22',
         '2016-01-23',
         '2016-01-24',
         '2016-01-25',
         '2016-01-26',
         '2016-01-27',
         '2016-01-28',
         '2016-01-29',
         '2016-01-30',
         '2016-01-31',
         '2016-02-01',
         '2016-02-02',
         '2016-02-03',
         '2016-02-04',
         '2016-02-05',
         '2016-02-06',
         '2016-02-07',
         '2016-02-08',
         '2016-02-09',
         '2016-02-10',
         '2016-02-11',
         '2016-02-12',
         '2016-02-13',
         '2016-02-14',
         '2016-02-15',
         '2016-02-16',
         '2016-02-17',
         '2016-02-18',
         '2016-02-19',
         '2016-02-20',
         '2016-02-21',
         '2016-02-22',
         '2016-02-23',
         '2016-02-24',
         '2016-02-25',
         '2016-02-26',
         '2016-02-27',
         '2016-02-28',
         '2016-02-29']}

pcod	geometry	centroid
NPL.1.1.1_1	MULTIPOLYGON (((85.43076324 27.62451744, 85.425...	{'type': 'Point', 'coordinates': [85.442905911,...
NPL.1.1.2_1	MULTIPOLYGON (((84.80230713 27.67124367, 84.799...	{'type': 'Point', 'coordinates': [84.960119929,...
NPL.1.1.5_1	MULTIPOLYGON (((85.44604492000001 27.48185158, ...	{'type': 'Point', 'coordinates': [85.344826394,...
NPL.1.1.3_1	MULTIPOLYGON (((85.24198914 27.58089828, 85.240...	{'type': 'Point', 'coordinates': [85.349621081,...
NPL.1.1.4_1	MULTIPOLYGON (((85.71025084999999 27.39423752, ...	{'type': 'Point', 'coordinates': [85.621936812,...

	pcod	value
0	NPL.4.1.1_1	667
1	NPL.1.3.3_1	784
2	NPL.1.1.3_1	4278
3	NPL.3.1.4_1	1114
4	NPL.1.2.1_1	1401

pcod	geometry	centroid	Total (benchmark period)	Total (comparison period)	Total (focal period)
NPL.1.1.1_1	MULTIPOLYGON (((85.43076324 27.62451744, 85.425...	{'type': 'Point', 'coordinates': [85.442905911,...	406	446	nan
NPL.1.1.2_1	MULTIPOLYGON (((84.80230713 27.67124367, 84.799...	{'type': 'Point', 'coordinates': [84.960119929,...	461	459	nan
NPL.1.1.5_1	MULTIPOLYGON (((85.44604492000001 27.48185158, ...	{'type': 'Point', 'coordinates': [85.344826394,...	1021	1038	nan
NPL.1.1.3_1	MULTIPOLYGON (((85.24198914 27.58089828, 85.240...	{'type': 'Point', 'coordinates': [85.349621081,...	4278	4292	nan
NPL.1.1.4_1	MULTIPOLYGON (((85.71025084999999 27.39423752, ...	{'type': 'Point', 'coordinates': [85.621936812,...	nan	nan	nan

	P-code	geometry	Total (benchmark period)	Total (comparison period)
0	NPL.1.1.1_1	MULTIPOLYGON (((85.43076324 27.62451744, 85.425...	406	446
1	NPL.1.1.2_1	MULTIPOLYGON (((84.80230713 27.67124367, 84.799...	461	459
2	NPL.1.1.5_1	MULTIPOLYGON (((85.44604492000001 27.48185158, ...	1021	1038
3	NPL.1.1.3_1	MULTIPOLYGON (((85.24198914 27.58089828, 85.240...	4278	4292
4	NPL.1.1.4_1	MULTIPOLYGON (((85.71025084999999 27.39423752, ...	0	0

	pcod_from	pcod_to	value
0	NPL.1.1.1_1	NPL.2.3.4_1	19
1	NPL.1.1.1_1	NPL.2.1.5_1	35
2	NPL.1.1.1_1	NPL.1.3.4_1	23
3	NPL.1.1.1_1	NPL.1.3.1_1	22
4	NPL.1.1.1_1	NPL.1.2.5_1	23

Flows Above Normal¶

Post-crisis analysis¶

Introduction¶

Load FlowClient and connect to FlowAPI¶

Calculate origin-destination matrices¶

1. Create query specifications¶

2. Run the queries¶

Visualise the distributions of locations¶

1. Download geography data¶

2. Get query results¶

3. Visualise the results¶

Calculate flows above normal¶

Inflows and outflows¶

	pcod_from	pcod_to	value
0	NPL.1.1.1_1	NPL.1.1.1_1	-108
1	NPL.1.1.1_1	NPL.1.1.3_1	-69
2	NPL.1.1.1_1	NPL.1.2.5_1	23
3	NPL.1.1.1_1	NPL.1.3.1_1	22
4	NPL.1.1.1_1	NPL.1.3.4_1	23

	P-code	geometry	inflow above normal	outflow above normal
0	NPL.1.1.1_1	MULTIPOLYGON (((85.43076324 27.62451744, 85.425...	-111	32
1	NPL.1.1.2_1	MULTIPOLYGON (((84.80230713 27.67124367, 84.799...	-83	167
2	NPL.1.1.5_1	MULTIPOLYGON (((85.44604492000001 27.48185158, ...	-363	448
3	NPL.1.1.3_1	MULTIPOLYGON (((85.24198914 27.58089828, 85.240...	-2601	1656
4	NPL.1.1.4_1	MULTIPOLYGON (((85.71025084999999 27.39423752, ...	0	0