Interactive maps with BokehΒΆ
Our ultimate goal today is to learn few concepts how we can produce nice looking interactive maps using Geopandas and Bokeh such as:
Simple interactive point plotΒΆ
First, we learn the basic logic of plotting in Bokeh by making a simple interactive plot with few points.
Import necessary functionalities from bokeh.
from bokeh.plotting import figure, save
First we need to initialize our plot by calling the figure
object.
# Initialize the plot (p) and give it a title
In [1]: p = figure(title="My first interactive plot!")
# Let's see what it is
In [2]: p
Out[2]: Figure(id='3e4e1953-baca-4cae-9f10-de4e2f642df4', ...)
Next we need to create lists of x and y coordinates that we want to plot.
# Create a list of x-coordinates
In [3]: x_coords = [0,1,2,3,4]
# Create a list of y-coordinates
In [4]: y_coords = [5,4,1,2,0]
Note
In Bokeh drawing points, lines or polygons are always done using list(s) of x and y coordinates.
Now we can plot those as points using a .circle()
-object. Let’s give it a red color and size of 10.
# Plot the points
In [5]: p.circle(x=x_coords, y=y_coords, size=10, color="red")
Out[5]: GlyphRenderer(id='f3d6d439-47a8-44f8-a574-a3cf5a656c8a', ...)
Finally, we can save our interactive plot into the disk with save
-function that we imported in the beginning. All interactive plots are typically saved as html
files which you can open in a web-browser.
# Give output filepath
outfp = r"/home/geo/points.html"
# Save the plot by passing the plot -object and output path
save(obj=p, filename=outfp)
Now open your interactive points.html
plot by double-clicking it which should open it in a web browser.
This is how it should look like:
It is interactive. You can drag the plot by clicking with left mouse and dragging. There are also specific buttons on the right side of the plot by default which you can select on and off:
Pan button allows you to switch the dragging possibility on and off (on by default).
BoxZoom button allows you to zoom into an area that you define by left dragging with mouse an area of your interest.
Save button allows you to save your interactive plot as a static low resolution .png
file.
WheelZoom button allows you to use mouse wheel to zoom in and out.
Reset button allows you to use reset the plot as it was in the beginning.
Creating interactive maps using Bokeh and GeopandasΒΆ
Now we now khow how to make a really simple interactive point plot using Bokeh. What about creating such a map from a Shapefile of points? Of course we can do that, and we can use Geopandas for achieving that goal which is nice!
Creating an interactive Bokeh map from Shapefile(s) contains typically following steps:
- Read the Shapefile into GeoDataFrame
- Calculate the x and y coordinates of the geometries into separate columns
- Convert the GeoDataFrame into a Bokeh DataSource
- Plot the x and y coordinates as points, lines or polygons (which are in Bokeh words: circle, multi_line and patches)
Let’s practice these things and see how we can first create an interactive point map, then a map with lines, and finally a map with polygons where we also add those points and lines into our final map.
Point mapΒΆ
Let’s first make a map out of those addresses that we geocoded in the Lesson 3. That Shapefile is provided for you in the data folder that you downloaded.
Read the data using geopandas which is the first step.
import geopandas as gpd
# File path
points_fp = r"/home/geo/data/addresses.shp"
# Read the data
points = gpd.read_file(points_fp)
Let’s see what we have.
In [6]: points.head()
Out[6]:
address id \
0 Kampinkuja 1, 00100 Helsinki, Finland 1001
1 Kaivokatu 8, 00101 Helsinki, Finland 1002
2 Hermanstads strandsvΓ€g 1, 00580 Helsingfors, F... 1003
3 ItΓ€vΓ€ylΓ€, 00900 Helsinki, Finland 1004
4 Tyynenmerenkatu 9, 00220 Helsinki, Finland 1005
geometry
0 POINT (24.9301701 60.1683731)
1 POINT (24.9418933 60.1698665)
2 POINT (24.9774004 60.18735880000001)
3 POINT (25.0919641 60.21448089999999)
4 POINT (24.9214846 60.1565781)
Okey, so we have the address and id columns plus the geometry column as attributes.
Now, as a second step, we need to calculate the x and y coordinates of those points. Unfortunately there is not a ready made function in geopandas to do that.
Thus, let’s create our own function called getPointCoords()
which will return the x or y coordinate of a given geometry. It shall have two parameters: geom
and coord_type
where the first one should be a Shapely geometry object and coord_type should be either 'x'
or 'y'
.
def getPointCoords(row, geom, coord_type):
"""Calculates coordinates ('x' or 'y') of a Point geometry"""
if coord_type == 'x':
return row[geom].x
elif coord_type == 'y':
return row[geom].y
Okey great. Let’s then use our function in a similar manner as we did before when classifying data using .apply()
function.
# Calculate x coordinates
In [7]: points['x'] = points.apply(getPointCoords, geom='geometry', coord_type='x', axis=1)
# Calculate y coordinates
In [8]: points['y'] = points.apply(getPointCoords, geom='geometry', coord_type='y', axis=1)
# Let's see what we have now
In [9]: points.head()
Out[9]:
address id \
0 Kampinkuja 1, 00100 Helsinki, Finland 1001
1 Kaivokatu 8, 00101 Helsinki, Finland 1002
2 Hermanstads strandsvΓ€g 1, 00580 Helsingfors, F... 1003
3 ItΓ€vΓ€ylΓ€, 00900 Helsinki, Finland 1004
4 Tyynenmerenkatu 9, 00220 Helsinki, Finland 1005
geometry x y
0 POINT (24.9301701 60.1683731) 24.930170 60.168373
1 POINT (24.9418933 60.1698665) 24.941893 60.169866
2 POINT (24.9774004 60.18735880000001) 24.977400 60.187359
3 POINT (25.0919641 60.21448089999999) 25.091964 60.214481
4 POINT (24.9214846 60.1565781) 24.921485 60.156578
Okey great! Now we have the x and y columns in our GeoDataFrame.
The third step, is to convert our DataFrame into a format that Bokeh can understand. Thus, we will convert our DataFrame into ColumnDataSource which is a Bokeh-specific way of storing the data.
Note
Bokeh ColumnDataSource
do not understand Shapely geometry -objects. Thus, we need to remove the geometry
-column before convert our DataFrame into a ColumnDataSouce.
Let’s make a copy of our points GeoDataFrame where we drop the geometry column.
# Make a copy and drop the geometry column
In [10]: p_df = points.drop('geometry', axis=1).copy()
# See head
In [11]: p_df.head(2)
Out[11]:
address id x y
0 Kampinkuja 1, 00100 Helsinki, Finland 1001 24.930170 60.168373
1 Kaivokatu 8, 00101 Helsinki, Finland 1002 24.941893 60.169866
Now we can convert that pandas DataFrame into a ColumnDataSource.
In [12]: from bokeh.models import ColumnDataSource
# Point DataSource
In [13]: psource = ColumnDataSource(p_df)
# What is it?
In [14]: psource
Out[14]: ColumnDataSource(id='1574239d-0e48-4016-9d2a-df6b1c4cc3e2', ...)
Okey, so now we have a ColumnDataSource object that has our data stored in a way that Bokeh wants it.
Finally, we can make a Point map of those points in a fairly similar manner as in the first example. Now instead of passing the coordinate lists,
we can pass the data as a source
for the plot with column names containing those coordinates.
# Initialize our plot figure
In [15]: p = figure(title="A map of address points from a Shapefile")
# Add the points to the map from our 'psource' ColumnDataSource -object
In [16]: p.circle('x', 'y', source=psource, color='red', size=10)
Out[16]: GlyphRenderer(id='2a888352-232f-43d1-93cb-fae8990115d2', ...)
Great it worked. Now the last thing is to save our map as html file into our computer.
# Output filepath
outfp = r"/home/geo/data/point_map.html"
# Save the map
save(p, outfp)
Now you can open your point map in the browser in a similar manner as in the previous example. Your map should look like following:
Adding interactivity to the mapΒΆ
In Bokeh there are specific set of plot tools that you can add to the plot. Actually all the buttons that you see on the right side of the plot are exactly such tools. It is e.g. possible to interactively show information about the plot objects to the user when placing mouse over an object as you can see from the example on top of this page. The tool that shows information from the plot objects is an inspector called HoverTool that annotate or otherwise report information about the plot, based on the current cursor position.
Let’s see now how this can be done.
First we need to import the HoverTool from bokeh.models
that includes .
In [17]: from bokeh.models import HoverTool
Next, we need to initialize our tool.
In [18]: my_hover = HoverTool()
Then, we need to tell to the HoverTool that what information it should show to us. These are defined with tooltips like this:
In [19]: my_hover.tooltips = [('Address of the point', '@address')]
From the above we can see that tooltip should be defined with a list of tuple(s) where the first item is the name or label for the information that will be shown, and the second item
is the column-name where that information should be read in your data. The @
character in front of the column-name is important because it tells that the information should be taken
from a column named as the text that comes after the character.
Lastly we need to add this new tool into our current plot.
In [20]: p.add_tools(my_hover)
Great! Let’s save this enhanced version of our map as point_map_hover.html
and see the result.
# File path
outfp = r"/home/geo/data/point_map_hover.html"
save(p, outfp)
As you can see now the plot shows information about the points and the content is the information derived from column address.
Hint
Of course, you can show information from multiple columns at the same time. This is achieved simply by adding more tooltip variables when defining the tooltips, such as:
my_hover2.tooltips = [('Label1', '@col1'), ('Label2', '@col2'), ('Label3', '@col3')]
Line mapΒΆ
Okey, now we have made a nice point map out of a Shapefile. Let’s see how we can make an interactive map out of a Shapefile that represents metro lines in Helsinki. We follow the same steps than before, i.e. 1) read the data, 2) calculate x and y coordinates, 3) convert the DataFrame into a ColumnDataSource and 4) make the map and save it as html.
Read the data using geopandas which is the first step.
import geopandas as gpd
# File path
metro_fp = r"/home/geo/data/metro.shp"
# Read the data
metro = gpd.read_file(metro_fp)
Let’s see what we have.
In [21]: metro.head()
Out[21]:
NUMERO SUUNTA geometry
0 1300M 1 LINESTRING (2561676.997249531 6681346.00195433...
1 1300M 2 LINESTRING (2550919.001803585 6672692.00211347...
2 1300M1 1 LINESTRING (2561676.997249531 6681346.00195433...
3 1300M1 2 LINESTRING (2559946.003624604 6678095.99842650...
4 1300M2 1 LINESTRING (2559946.003624604 6678095.99842650...
Okey, so we have the address and id columns plus the geometry column as attributes.
Second step is where calculate the x and y coordinates of the nodes of our lines.
Let’s create our own function called getLineCoords()
in a similar manner as previously but now we need to modify it a bit so that we can get coordinates out of the Shapely LineString object.
def getLineCoords(row, geom, coord_type):
"""Returns a list of coordinates ('x' or 'y') of a LineString geometry"""
if coord_type == 'x':
return list( row[geom].coords.xy[0] )
elif coord_type == 'y':
return list( row[geom].coords.xy[1] )
Note
Wondering about what happens here? Take a tour to our earlier materials about LineString attributes.
By default Shapely returns the coordinates as a numpy array of the coordinates. Bokeh does not understand arrays, hence we need to convert the array into a list which is why we
apply list()
-function.
Let’s now apply our function in a similar manner as previously.
# Calculate x coordinates of the line
In [22]: metro['x'] = metro.apply(getLineCoords, geom='geometry', coord_type='x', axis=1)
# Calculate y coordinates of the line
In [23]: metro['y'] = metro.apply(getLineCoords, geom='geometry', coord_type='y', axis=1)
# Let's see what we have now
In [24]: metro.head()
Out[24]:
NUMERO SUUNTA geometry \
0 1300M 1 LINESTRING (2561676.997249531 6681346.00195433...
1 1300M 2 LINESTRING (2550919.001803585 6672692.00211347...
2 1300M1 1 LINESTRING (2561676.997249531 6681346.00195433...
3 1300M1 2 LINESTRING (2559946.003624604 6678095.99842650...
4 1300M2 1 LINESTRING (2559946.003624604 6678095.99842650...
x \
0 [2561676.997249531, 2560202.997150008, 2560127...
1 [2550919.001803585, 2551145.9991329825, 255126...
2 [2561676.997249531, 2560202.997150008, 2560127...
3 [2559946.003624604, 2560094.9990514405, 256018...
4 [2559946.003624604, 2559644.0031698933, 255947...
y
0 [6681346.001954339, 6681016.996685321, 6680969...
1 [6672692.002113477, 6672713.997145447, 6672737...
2 [6681346.001954339, 6681016.996685321, 6680969...
3 [6678095.998426503, 6678179.9976436375, 667824...
4 [6678095.998426503, 6678008.998522878, 6677957...
Yep, now we have the x and y columns in our GeoDataFrame.
The third step. Convert the DataFrame (without geometry column) into a ColumnDataSource which, as you remember, is a Bokeh-specific way of storing the data.
# Make a copy and drop the geometry column
In [25]: m_df = metro.drop('geometry', axis=1).copy()
# Point DataSource
In [26]: msource = ColumnDataSource(m_df)
Finally, we can make a map of the metro line and save it in a similar manner as earlier but now instead of plotting circle
we need to use a .multiline()
-object. Let’s define the line_width
to be 3.
# Initialize our plot figure
p = figure(title="A map of the Helsinki metro")
# Add the lines to the map from our 'msource' ColumnDataSource -object
p.multi_line('x', 'y', source=msource, color='red', line_width=3)
# Output filepath
outfp = "/home/geo/data/metro_map.html"
# Save the map
save(p, outfp)
Now you can open your point map in the browser and it should look like following:
Todo
Task:
As you can see we didn’t apply HoverTool for the plot. Try to apply it yourself and use a column called NUMERO
from our data as the information.
Polygon map with Points and LinesΒΆ
It is of course possible to add different layers on top of each other. Let’s visualize a map showing accessibility in Helsinki Region and place a metro line and the address points on top of that.
1st step: Import necessary modules and read the Shapefiles.
from bokeh.plotting import figure, save
from bokeh.models import ColumnDataSource, HoverTool, LogColorMapper
import geopandas as gpd
import pysal as ps
# File paths
grid_fp = r"/home/geo/data/TravelTimes_to_5975375_RailwayStation.shp"
point_fp = r"/home/geo/data/addresses.shp"
metro_fp = r"/home/geo/data/metro.shp"
# Read files
grid = gpd.read_file(grid_fp)
points = gpd.read_file(point_fp)
metro = gpd.read_file(metro_fp)
As usual, we need to make sure that the coordinate reference system is the same in every one of the layers. Let’s use the CRS of the grid layer and apply it to our points and metro line.
# Get the CRS of our grid
In [27]: CRS = grid.crs
In [28]: print(CRS)
{'init': 'epsg:3067'}
# Convert the geometries of metro line and points into that one
points['geometry'] = points['geometry'].to_crs(crs=CRS)
metro['geometry'] = metro['geometry'].to_crs(crs=CRS)
Okey now, the geometries should have similar values:
In [29]: points['geometry'].head(1)
Out[29]:
0 POINT (385149.4478367225 6671962.669661295)
Name: geometry, dtype: object
In [30]: metro['geometry'].head(1)