Understanding Today's Online GIS User Through the Lens of a WorldMap Survey

WorldMap is an open source online mapping application which aims to lower barriers for scholars who wish to visualize, analyze, organize, present, and publish mapped information. In late 2013, 290 respondents among the 8,000 registered users participated in an online survey in which they described their activities, purposes, experiences, and preferences regarding the system. Participants also described their professional background, GIS skill level, age, gender, and country of work. This study analyzes the results of the survey, by summarizing the responses to each question independently and by examining the relationships and dependencies of these answers across the different questions to try to better understand why users responded the way they did. The study is based on the user-centered design (UCD) approach. We aim to use the survey results to improve our understanding of user demographics and needs. Findings from this study will be used to guide WorldMap improvements, and we hope the findings will also shed light on the broader requirements of online GIS users.

Links:
Download

no file attachments

Share

Visualizing Urban Hospitality

2016 Fisher Prize Award Co-winner - Undergaduate Category

Judges Comments: “Notable for its clarity and legibility of presentation, and simple, elegant cartography.  A well told story of the geographic distribution of social variables”

View the award winning poster

no links

Files:
yunhan.png

Share

Cartography Workshop Spring 2017

Location:  Room B-129 of Northwest Building.

Download Workshop Materials

How to Apply (Limited to 24 participants):

  • For Harvard Affiliates, please submit your application by clicking “REGISTER” button (HUID login required).
  • For Non-Harvard applicants, please submit your application by clicking “NON-HARVARD REGISTER” button.

no links

no file attachments

Share

DiTF Mapping Workshop

A hands on workshop taught to the Digital Teaching Fellows and others in the Harvard History Department.

 Download DITF mapping workshop materials.

Harvard WorldMap is an Open Source mapping platform developed to lower barriers for scholars who wish to explore, visualize, edit, and publish geospatial information. Google Fusion Tables is an experimental web application that allows for gathering, editing, sharing, and mapping data in the forms of various visualizations. Between these two tools, you have a powerful toolkit for creating, maintaining, and merging geographic data sets. In this hands-on workshop, you’ll be introduced to both tools, and see how they can help you get the most out of geospatial information.

 

no links

no file attachments

Share

Dynamic Mapping of Secondary Cities Symposium

Symposium Program (updated on 6/10/2016)

Keynote Addresses:

Open Algorithms, Open Data: Toward Continuous Mapping of Sustainable Development
Alex (Sandy) Pentland, MIT
June 14, 2016 1:15-1:45pm

The Challenge of Mapping Human Geography Data in the Urban Milieu
Lee Schwartz, Director of Office of the Geographer, Department of State
June 15, 2016 9:10-9:40am

Secondary Cities in the developing world are rapidly growing urban areas that are regional hubs for commerce, services, and governance in developing countries that often do not have adequate planning mechanisms for future development and growth. This symposium will focus on the foremost data collection tools and technologies used to map urbanization in such cities. It is part of the Secondary Cities initiative, a global collaboration of universities and organizations supported by the Humanitarian Information Unit of the Office of the Geographer of the State Department.

The aim of the symposium is to bring together organizations and individuals involved in innovative mapping activities and solutions for emergency preparedness, resiliency planning, and urban sustainability.  The symposium will open with a half-day session to provide a hands-on venue to explore and assess online, open sources tools for mapping and creating geospatial data for cities.  The 2nd day is a full day with sessions including:  1) Secondary Cities overview; 2) case studies of dynamic city data collection; 3) interactive assessment session of tools as linked to emergency preparedness, resiliency planning, and urban sustainability; 4) solutions/lessons learned. The product from this symposium will be a white paper on dynamic tools as well as a position paper on how dynamic mapping is achieved in Secondary Cities.

This event is free and open to the public.

 

Symposium Videos

Day 1 - June 14, 2016

Welcome Address

  • Jason Ur (Harvard University)  Video

Dynamic Mapping of Secondary Cities: An Introduction to Tools & Technologies

  • Melinda Laituri (Colorado State University)  Video | Slides

Keynote Address: Open Algorithms, Open Data: Toward Continuous Mapping of Sustainable Development

  • Sandy Pentland (MIT)   Video

ikeGPS Spike Intro & Demo   Video | Slides

  • James Pardue (IkeGPS)  

CartoDB Intro & Demo  Video

  • Jeff Ferzoco (CartoDB)  

GeoNode Intro & Demo    Video

  • Cristiano Giovando (World Bank) & Paolo Corti (Harvard University)   

KoBo ToolBox Intro & Demo  Video

  • Patrick Vinck (Harvard University)

ArcGIS Online, Story Map & Survey 123 Intro & Demo  Video | Slides

  • Courtney Claessens (Esri)

Geographic Futures Intro & Demo  Video | Slides

  • Mitchell Sipus (White House Presidential Innovation Fellow)

WorldMap Intro & Demo  Video

  • Ben Lewis (Harvard University)

Field Papers Intro & Demo  Video

  • Eric Rodenbeck (Stamen)

 

Day 2 - June 15, 2016

Overview & Orientation

  • Melinda Laituri (Colorado State University)   Video

Keynote Address: The Challenge of Mapping Human Geography Data in the Urban Milieu

  • Lee Schwartz, Geographer of the United States Director, Office of the Geographer  Video | Slides

Panel I: Why Dynamic Mapping in Secondary Cities?

  • Mark Montgomery (Stony Brook University) (Slides), Phil Yang (George Mason University) (Slides), Keith Clarke (University of California-SB) (Slides)   Video

Panel II: Case Studies on Dynamic Data Collection and Mapping Projects for Urban Developing

Communities

  • Reinhard Goethart (MIT) (Slides), Gabriel Moreno (Harvard University) (Slides), Kellie Stokes (Yale University) (Slides), Kirstin Miller (Ecocity Builders) (Slides)   Video

Tool & Technologies Applied to Humanitarian Aid

  • Nathaniel Raymond (HHI, Harvard University)  Video

Citizen Participation and GIS Use in Urban India

  • Rina Ghose (University of Wisconsin-Milwaukee) Video

The Use of Smartphones for Dynamic Mapping and Planning of Transit Systems in Africa

  • Zachary Patterson (Concordia University, Canada)  Video | Slides

Panel III: Solutions, Lessons Learned, and Next Steps for Secondary Cities

  • Rina Ghose (University of Wisconsin-Milwaukee), Zachary Patterson (Concordia University, Canada), Nathaniel Raymond (HHI, Harvard University)   Video

no links

no file attachments

Share

Big Data Geocoding and Overlay Analysis - A combined approach using ArcGIS and PostgreSQL

By Devika Kakkar, Jeff Blossom, and Giovanni Zambotti

In a recent project we were tasked to geocode approximately 53 million U.S. addresses (in CSV format), and determine the 2000 and 2010 U.S. Census block group FIPS codes for each address.  This blog will discuss the workflow used to accomplish this big data processing task, detailing each step taken to optimize the process, including scripts written.  The final process ended up being a combined approach using ArcGIS, PostgreSQL, and files formatted in CSV.  The full job ended up taking 96 hours to geocode in ArcGIS, and 14 hours for block group determination in PostgreSQL.  The Dell PC used for all processing is a 64-bit Windows 7 Professional SP1 operating system machine with i7-6700 CPU @ 3.40GHz × 2 and 64 GB of RAM.

First, the CSV file containing the 53 million addresses was geocoded using ArcGIS for Desktop 10.3 and Esri Business Analyst 2015. The processing took about 96 hours to run. Results were saved into a file geodatabase that ended up being 16.86 GB in size. The geocoding rate started out at 200,000 per hour, but then steadily increased to as much as 900,000 per hour, averaging about 550,000 per hour for the duration. 99% of the addresses were matched. To improve batch geocoding performance, please refer to the link below:

https://blogs.esri.com/esri/arcgis/2011/02/09/tuning-a-locator-for-improved-performance/

Next, determining the census block group 2000 and 2010 FIPS codes for each address was necessary. ArcGIS has several ways to accomplish this, including the tools spatial join, identity, and intersect. The 2000 and 2010 block group polygon boundaries were copied from the ESRI Data and Maps datasets into the file geodatabase holding the geocoded addresses. Each block group file has just over 200,000 polygons. The spatial join, identity, and intersect tools were run on the addresses and 200,000 block group polygons, and each time ArcGIS threw an “Out of Memory” error. Having run into this error before (ArcGIS running out of memory when attempting analyses with datasets numbering in the millions), a python multithreading script (included in “Python Multithreading Script” section below) running simultaneous processes was used to process a subset (~11.7 million addresses) of the full dataset against the block groups. The multiprocessing script uses the python PostGIS library psycopg2, referenced at the link below:

http://pythonhosted.org/psycopg2/

Python Multithreading Script:

import multiprocessing

import time

import psycopg2

from datetime import datetime

startTime = datetime.now()

print startTime

census_block_list = range(1,11722278)

conn = psycopg2.connect(“dbname=postgres user=postgres password=postgres”)

def update_points_in_census_block(census_id):

       cur = conn.cursor()

sql = “update ba2000 as points set fipsout = polygons.id from bg2000 as polygons where polygons.ogc_fid = “ + str(census_id) + “ and st_intersects(polygons.wkb_geometry, points.wkb_geometry);”

       cur.execute(sql)

       conn.commit()

if name == ‘main‘:

        pool = multiprocessing.Pool(processes=3)

        results = pool.map(update_points_in_census_block, census_block_list)

         pool.close()

         pool.join()

         print datetime.now() - startTime

Though this approach was extremely easy to implement, it was not so efficient due to the long processing time of roughly 2.5 hours per block group for the subset used. Therefore, we turned to a solution using the PostgreSQL. PostgreSQL version 9.4 can be downloaded and installed using the link below:

http://www.enterprisedb.com/products-services-training/pgdownload

The PostGIS extension was enabled and connected to the database using QGIS. Importing the addresses and census block group polygons into PostgreSQL took 30 hours. Therefore, an alternative method using a CSV file instead of geodatabase was used for import. First, the geocoded geodatabase was exported to a CSV file.  Then the CSV file was copied to a PostgreSQL table. This took only 6 minutes to complete which was much less compared to the time of importing a geodatabase. After the import was complete, an “Intersect” command was run to find the block group for each address. Processing the block group 2000 and 2010 determination for all 53 million address locations took about 14 hours.  Once complete, the results were exported back into CSV. See the “PostgreSQL workflow” section below, where each command is listed.

In conclusion, by experimenting with various processing techniques available in ArcGIS and PostgreSQL, an efficient process was developed to handle this big data geocoding and spatial overlay analysis task.  

PostgreSQL workflow:

# Connect to the database, QGIS connection is used

# Create an empty table with the same fields as the csv file

CREATE TABLE public.geocoding_result

(      “OBJECTID” oid,
loc_name character varying,
status character varying,
match_addr character varying,
x double precision,
y double precision,
altpatid double precision,
address character varying,
city character varying,
state character varying,
zip_code character varying   )

WITH (OIDS=FALSE);
ALTER TABLE public.geocoding_result
OWNER TO postgres;

# Import addresses, copy csv file to the table in PostgreSQL

COPY geocoding_result from ‘C:\Temp \geocoded_final.csv’ DELIMITERS ‘,’ CSV;

# Add Geometry field to the table

ALTER TABLE public.geocoding_result ADD COLUMN geom geometry(Point,4326);

# Update geometry with x,y values

UPDATE public.geocoding_result SET geom = ST_SetSRID(ST_MakePoint(x, y),4326)

#import census block groups from file geodatabase into PostGre

ogr2ogr -overwrite -f “PostgreSQL” PG:”host=localhost user=postgres dbname=postgres password=postgres” “~/Census.gdb” “blkgrp2010”
ogr2ogr -overwrite -f “PostgreSQL” PG:”host=localhost user=postgres dbname=postgres password=postgres” “~/Census.gdb” “blkgrp2000”

# Create (spatial) index (optional as importing will automatically build index) to increase performance

CREATE INDEX geocoding_result_gist
ON geocoding_result
USING GIST (wkb_geometry);

# Intersection for 2010 boundary(Make sure that addresses and block groups are in same coordinate system)

CREATE TABLE geofips AS
SELECT geocoding_result.,blkgrp2010.fips
FROM geocoding_result,blkgrp2010
WHERE ST_Intersects(geocoding_result.wkb_geometry,blkgrp2010.wkb_geometry);

# Rename new field “fips” to fips 2010

ALTER TABLE geofips RENAME COLUMN fips TO fips2010;

# Intersection for 2000 boundary

CREATE TABLE geofipsall AS
SELECT geofips.,blkgrp2000.fips
FROM geofips,blkgrp2000
WHERE ST_Intersects(geofips.wkb_geometry,blkgrp2000.wkb_geometry);

# Rename new field “fips” to fips 2000

ALTER TABLE geofipsall RENAME COLUMN fips TO fips2000;

# Add Column fips2010, fips2000 to geocoding_result

ALTER TABLE geocoding_result ADD COLUMN fips2010 varchar(12);
ALTER TABLE geocoding_result ADD COLUMN fips2000 varchar(12);

# Select non-geocoded results

CREATE TABLE Ustatus AS SELECT FROM geocoding_result WHERE status=’U’;

# Union two tables together

CREATE TABLE geocoding_final_result AS (SELECT FROM geofipsall UNION SELECT * FROM ustatus);

# Export result to csv

COPY geocoding_final_result(loc_name,status,x,y,match_addr,altpatid,address,
city,state,zip_code,fips2010,fips2000) TO ‘C:\Temp\geocoding_final_result_new.csv’ DELIMITER ‘,’ CSV HEADER;

# Export a sample result for reviewing

COPY (SELECT loc_name,status,x,y,match_addr,altpatid,address,city,
state,zip_code,fips2010,fips2000 FROM geocoding_final_result LIMIT 1000) TO ‘C:\Temp\geocoding_final_result_1000.csv’ DELIMITER ‘,’ CSV HEADER;

# End

no links

no file attachments

Share

Global Spatio-Temporal Search - NEH Funded Enhancements to WorldMap

Though millions of geo-service layers exist just a click away, there is no good way to discover most of them. In the early days of the web it was hard to find a web page; this situation holds today for most geo-data.

With a grant from the National Endowment for the Humanities, the CGA is building a platform to create, maintain, and deploy a global public registry of web map services, and has developed a new visual interface to support searches by time and space.  The technology can scale to millions, even billions of objects.

The registry will expose an API to enable any mapping system to find and bind to any layer in the registry.  The platform is open source.

New Geospatial Search Capability Funded by NEH (ALPHA) V.2 from HarvardCGA on Vimeo.

Links:
YouTube Video

Files:
search_ui_bd.png

Share

FAO AQUASTAT

Multiple global GIS and other data sets from the Food and Agriculture Organization of the United States including dams, irrigation, water use, and water resources.

Links:
AQUASTAT

no file attachments

Share

April 2016

CGA Newsletter April 2016 PDF Download

HIGHLIGHTS

  • 2016 CGA Conference: Space, Place, and Geographic Thinking in the Humanities
  • Call for Applications:
      2016 Fisher Prize and Davis Center Award Competitions
      Cartography Workshop Spring 2016
  • Upcoming GIS Technical Training Workshops
  • CGA’s Monthly GIS Presentations
  • 2016 EDC Student Award Winner Announced
  • Review of No Dig, No Fly, No Go: How Maps Restrict and Control, by Mark Monmonier
  • The Long Walk
  • 2016 ESRI Dev Summit Conference Notes
  • I/UCRC STC Activities at AAG
  • Harvard Map Collection
  • 2016 Esri Education GIS Conference
  • Geospatial Data Technical Assistant Position at Harvard Library
  • GIS Internship opportunity with HIP Consult
  • Flowing City Maps
  • US Primary Elections Map
  • A Fresh Look at Cuban Trade
  • More…

CGA NEWS

Last Call for Registration - 2016 CGA Conference: Space, Place, and Geographic Thinking in the Humanities

This year’s CGA conference is aimed at bringing humanists together with geospatial technologists and theorists, reviewing current status, achievements, lessons learned, unmet needs, challenges, potentials and perspectives of applying geographic analysis in the humanities. Invited speakers will present their on-going explorations, inspiring cases, and expert views across a range of domains and disciplines, and engage with each other and the audience in discussion and debate. Open to public. CGIS South Concourse level, 1730 Cambridge St., Cambridge, MA. The conference preliminary program is available online. To register, please click here.

Call for Applications:

  • 2016 Fisher Prize and Davis Center Award Competitions
    The 2016 Fisher Prize for excellence in GIS will be given to one graduate and one undergraduate student who must be enrolled in the academic year 2015-2016 and in good standing to be considered for the award. Group projects are allowed. The Prize is a cash award of $500 given to each student. The Davis Center Prize for GIS Projects in Russian, East European, and Central Asian Studies is also a $500 cash reward, and is being offered in conjunction with the Fisher Prize. For more on the Davis Center Prize, click here. The deadline for both prizes is 11:59 p.m. Monday, April 25th, 2016. To submit your work, follow the guidelines listed on the Fisher Prize page.
  • Cartography Workshop Spring 2016
    This full-day hands-on workshop focuses on the visual representation of geographic information. Cartographic fundamentals will also be introduced. May 4th, 2016. Room B129, Northwest Building. More information about this workshop can be found here.

Upcoming GIS Technical Training Workshops

CGA offers several non-credit technical training workshops related to GIS (Free to Harvard affiliates but registration is required). The workshops are:

  • Advanced Remote Sensing for Land Use Change Analysis, by Stacy Bogan, 1:00-3:00pm, April 8th, 2016
  • GPS Workshop, by Jeff Blossom, 1:00-4:30pm, April 13th, 2016
  • Wrangling Data into Maps - Longwood, by Jeff Blossom, 1:00-3:00pm, April 15th, 2016
  • Geoprocessing with ModelBuilder, by Giovanni Zambotti, 1:00-3:00pm, April 15th, 2016
  • Making Sense out of Spatial Data - Longwood, by Jeff Blossom, 1:00-3:00pm, April 22nd, 2016
  • Database Design for GPS/GIS Applications, by Nicole Alexander, 1:00-3:00pm, April 22nd, 2016

More information about technical training workshops and registrations can be found here.

CGA’s Monthly GIS Presentations-come join the discussion

  • GIS Colloquium
    “From Metropolitan Area to Megalopolis: How to Delineate China’s Functional Urban Areas with Geographical Open Data”, presented by Kang Wu, visiting scholar at CGA and associate professor at Capital University of Economics and Business in China. April 7th, 2016, 12:00-1:00pm. Room S153, CGIS South Building, 1730 Cambridge St, Cambridge, MA.
  • ABCD-GIS Presentation Series
    “Coding Efficient Disaster Recovery”, presented by Heidi Hurst, a senior at Harvard College studying applied math with a focus in navigation and geospatial analysis. April 21st, 2016, 12:00-1:30pm. Room S153, CGIS South Building, 1730 Cambridge St, Cambridge, MA.
 

2016 EDC Student Award Winner Announced

Congratulations to Heidi Hurst, a bachelor’s degree candidate for 2016 at Harvard College who has won the Esri Development Center Student of the Year Award for her submission: “A Geospatial Framework For Allocating Disaster Recovery Centers”. View the project abstract, technical report, project presentation, and judges’ comments here.

Review of No Dig, No Fly, No Go: How Maps Restrict and Control, by Mark Monmonier

Peter K. Bol wrote a review of this book to express his opinions from a historian’s perspective. This article can be accessed online through Harvard University’s DASH repository.

The Long Walk

An article in Esri’s Spring 2016 issue ArcUser publication on the CGA’s involvement in mapping the Out of Eden Walk.

2016 ESRI Dev Summit Conference Notes

Giovanni Zambotti of the CGA wrote these conference notes after attending the 2016 Esri Dev Summit.

I/UCRC STC Activities at AAG

The NSF funded Spatiotemporal Innovation Center successfully organized a Spatiotemporal Symposium in the American Association of Geographers (AAG) 2016 Annual Meeting at San Francisco. Read more

The CGA Newsletter is published monthly. Editors of this issue are Fei Carnes and Jeff Blossom.

CGA Home Page . Contact us . Follow us on Twitter


no links

no file attachments

Share

Coding Efficient Disaster Recovery

By Heidi Hurst

Noon - 1:00, Room K401, CGIS Knafel Building.

View a recording of the presentation.

View the presentation slides.

Presentation Abstract: For those who survive tornados, hurricanes, floods, or other disasters, Disaster Recovery Centers (DRCs) provide information about Federal Emergency Management Agency (FEMA) programs, access to resources from volunteer organizations and other agencies, and a source of hope.  This talk describes an innovative, quantitative approach to describing quality of DRC location choice and argues for the importance of scripted geospatial analysis in disaster recovery efforts.

 Speaker Bio: Heidi is a senior at Harvard College studying applied math with a focus in navigation and geospatial analysis.  Her project “A Geospatial Framework For Allocating Disaster Recovery Centers” was selected as Harvard’s Esri Development Center Student of the Year Award 2016.  Prior to her research for FEMA, Heidi researched predictive algorithms for lost person behavior for Search and Rescue Applications.

Lunch will be served.

no links

Files:
hurst.png

Share