Massachusetts Institute of Technology
Department of Urban Studies and Planning


11.520: A Workshop on Geographic Information Systems
11.188: Urban Planning and Social Science Laboratory

Homework 2

Extracting & Querying Census Data plus Site Suitability Analysis

Distributed: Wednesday, October 6, 2010
Due: (at start of class/lab)

Wednesday, October 20, 2010 - Question #1
Wednesday, November 3, 2010 - Question #2

Recitation 2

Introduction to MS Access Basics

Recitation 3

Site Suitability Analyses; Dataset for Recitation 3

INTRODUCTION

The main focus of this homework is Question #2 where we undertake a classic Ian McHarg-style 'site suitability analysis'. See this note on spatial analysis "classics" by John Corbett, Ian McHarg: Overlay Maps and the Evaluation of Social and Environmental Costs of Land Use Change, at the website of the Center for Spatially Integrated Social Science http://www.csiss.org/classics. For our hypothetical 'site suitability analysis' example (in question #2), we will endeavor to find suitable locations for a Cambridge senior center by overlaying the locations of those sites that meet each of several criteria: close to major roads, near the target population, far from hazards, etc.

However, this homework also serves to improve your facility with data manipulation tools including use of MS-Access and the 'summarize' tool in ArcMap. Question #1 focuses on database manipulation and analysis using some of the census data involved in Question #2. There are only two questions in this homework set but each is more complicated than you might think because of all that is involved in decomposing them into doable steps that work as you would expect.

BEFORE FIRING UP ARCMAP - AND LONG BEFORE THE DUE DATES FOR EACH PART OF THE HOMEWORK - PLEASE BACK AWAY FROM THE COMPUTER AND READ THROUGH THE WHOLE ASSIGNMENT. Getting the 'big picture' first will help you develop a better GIS strategy and will reduce time and energy wasted.

PART I: QUESTION 1 [40 points]

The following table of Senior Citizen poverty statistics was developed from the 2000 US Census using data from Summary File 3 (SF3) - the tabulation discussed in class and Lab #5.

DATA

Question 1a: Fill in the empty cells in the following partially completed table and answer the questions below. Instead of using ArcMap, you are free to use other database management software (such as MS-Access). Note that you will have to make careful use of the 'summarize' command in ArcMap (or equivalent 'total' or 'group by' commands in MS-Access or other database managers) in order to compute the correct values for many of the columns. Note, also, that all five towns are in the same County, namely Middlesex County in Massachusetts.

Summary of Senior Citizen Poverty Statistics for Cambridge and Abutting Towns, 1999
Town Number of BG's Number of Seniors with Known Poverty Status Average of Percentages Calculated for each Block Group
Total Populated Block Groups Block Group's with seniors with known poverty status

Total
# with known Status

# Below the Poverty Level

Percent
Below

Arlington 44       387    
Belmont 27 27         5.41
Cambridge 80     9,051   12.88  
Somerville 67       1,063    
Watertown 29 29 29 5,159 388 7.52 6.77
Overall 247     32,967   9.74  

Now, take a look at the last two columns and note that the overall percentage of impoverished seniors within a town is generally different from the town average of the percentages that one can calculate separately for each block group within that town. Answer the following three questions (in a paragraph or two each).

Question 1b: Explain briefly why the overall average and the average of the block group percentages are likely to be different.

Question 1c: In this instance which method (that is which of the last two columns) tends to be larger? Why?

Question 1d: Which set of numbers are most appropriate for use in our analysis for Question #2? Why?

Question 1e: When calculating the percentages, think about which denominator to use (i.e., number of block groups, populated block groups, block groups with seniors with know poverty status, number of seniors, etc.) List the block group IDs for those block group(s) with no population, and for those block group(s) for which the poverty status of seniors is not known. [To clarify this last part of the question, we mean those block groups with some population but with no seniors for whom the poverty status is known.]

Turn in the table with all your computed values added and write at most a paragraph or two for each of questions 1b, 1c, and 1d.

Tips: Filling in the table looks simple enough -- just run some of those 'Field' commands for the requisite block groups. But it will take some thought to determine what each column is measuring, which census rows to include and how to do the summarizing. Another tricky part of this question is figuring out which block group is within which town. The block groups can span a town boundary. For this exercise, we define a block group to be "in" a town if the centroid of the block group is inside the town boundary. Using ArcMap's "Spatial Join" functionality sounds like a good idea, but you will have to choose the approriate spatial operator to associate each block group with its town. (See Appendix I below for information about various ways to examine the spatial relationship between block groups and towns.)

PART II: QUESTION 2 [60 points]: The site suitability analysis

A local non-profit group is interested in locating a site for building a senior center in Cambridge. Given your expertise in GIS, you are hired as a GIS analyst by this company to help them locate the best site. After a long meeting with the organization and the community you agree to run some numbers in order to get a handle on the locations and characteristics of potentially suitable Cambridge sites. You settle on the following criteria to get rolling with your site selection process:

  1. The minimum area of land needed for the project is 1 contiguous hectares (1 hectare = 10,000 square meters = 2.471 acres, and 1 acre = 43,560 square feet).

  2. Ideally, the site should be located near, but not in, a residential neighborhood.

  3. Accessibility to the project is a major concern for the organization, especially given the often limited mobility of seniors. You determine that the project should be located within 250 meters of a major road.

  4. The organization is also worried about health risks. They decide that they want the site to be far (more than 300 meters away) from Toxic Release Inventory (TRI) sites as identified by the Environmental Protection Agency (EPA) data from their toxic release inventory databases.

  5. Accessibility by seniors with limited financial means for joining private clubs is deemed especially important. Therefore, you decide to narrow the criteria to focus on census block groups where:

  6. In order to assure access to the senior center to a large number of senior citizens, we are interested in looking at the data for the four towns (north of the Charles River) that border Cambridge -- Arlington, Belmont, Somerville and Watertown, as well as Cambridge itself. However, because the political support for the facility is so strong in Cambridge (the Mayor and Council being behind it), the facility should be located within the Cambridge city limits.

Use ArcMap and the various layers that we have used in class exercises to undertake a basic site suitability analysis. (Feel free to augment your maps with some other map layers stored in the class data directory in order to improve the visual quality of the presentation - but stick with the site selection criteria listed above).

Question 2a: First, prepare 3 maps showing the locations that are acceptable based on the criteria identified above. Map these criteria separately:

  1. proximity to major roads and distance from the TRI Facilities.
  2. appropriate land use characteristics
  3. block groups having a high percentage of seniors below the poverty level.

Question 2b: Next, prepare a fourth map that shows those Cambridge locations that meet all these criteria as well as the 1-continguous-hectare constraint.

Question 2c: Along with the maps, provide a page or two (not a treatise) of discussion concerning:

  1. any choices or interpretations that you made in generating the suitable locations and which you feel bear some explanation, and
  2. your conclusions from this initial analysis regarding suitable sites. Regarding your conclusions, don't just pick one site--there isn't a definitive 'best site' given the criteria we've suggested. Be sure to include some interpretation regarding the extent to which the analysis helps you focus on one part of town, on one or another criterion, on a set of proximity issues that might forecast special interest concerns/complaints, etc. Would you suggest tightening or relaxing some of the criteria? Can you suggest important considerations that are not well captured in this suitability analysis

Hand in a discussion of the answers to question 2c along with the four maps.

 

FAQ and other Suggestions

Need Help?

If you need help, and you think that your question might be of interest to the whole class, send e-mail to 11.520@mit.edu. If you would prefer to ask just the class staff, send e-mail to 11.520staff@mit.edu. Please don't be shy about asking the staff for help if you are struggling with understanding the assignment or having trouble with a particular aspect of ArcMap. We've heard many stories of students struggling for many hours over minor issues that had easy solutions. We'd like to avoid these misadventures, so please contact us sooner rather than later if you get stuck. (When you finally get unstuck, spend a moment reflecting on what you could have done to get unstuck earlier if you had the vocabulary, GIS understanding, or roadmap of ArcGIS to use the various help files and references.) Also, we recognize the value of group work and encourage study group discussion of the homework as well as lab exercises - but we require that you turn in individual work that reflects your own learning and hands-on discovery.

 


Appendix I: Determining which block groups fall within which town

As indicated in the homework text, the Massachusetts town boundaries do not line up precisely with the US Census block group boundaries even though, in almost all instances, the block groups fall with a single town. These differences are known as a 'sliver problem' and, in this case, the reason for the problem is that the census block group boundaries are much less detailed and precise as the Massachusetts Town boundary layer. The following graphics illustrate the problem. [They use a shapefile fivecities.shp containing the borders of the five municipalities in and around Cambridge and north of the Charles River. This shapefile was exported from the MassGIS matown00.shp shapefile and is available in the class data locker.]

 

For your convenience, we have provided in the ms-access database, hw2_sf3_lite.mdb, a cross-reference table, blkgrp2t, that was created by doing a "Select by Location" operation for each of the five towns, with "have their center in" as the topological relation. The table has two columns: bkg_key has the state+county+tract+block-group identifier and town has the corresponding town name. For example, the graph below shows the block groups that "have their center in" Cambridge. A new field "Town" is created and its value is set to be "Cambridge" for those block groups.

Slivers at town boundaries

This method works when only a handful towns are considered. If you want to analyze hundreds of towns, you don't want to undertake this town-by-town selection process by hand. There are more advanced tools to solve this problem such as creating a new shapefile that contains points representing the centroid for each block group in Middlesex County and then doing a spatial join between this new Centroid layer and the town layer. However, creating the centroid layer from the blockgroup shapefile is too much of a distraction for this homework set. (We will use this example later in the semester to demonstrate the use of simple VBA scripts to add the X,Y coordinates of the centroids to the attribute table, so we can then save these columns as a new data table, and then use the Tools/Add-XY-data option in ArcMap to make a point shapefile of the centroids. Doing all this is too complicated and distracting to be part of this homework, but each step is well explained in the ArcGIS help files.)


Written in 1996-2001 by Kamal Azar, Joseph Ferreira, and Tom Grayson
Modified by Myounggu Kang on 10 October 2002 incorporating Census 2000
Modified 2003-2010 by Jeeseong Chung, Jinhua Zhao, Shan Jiang and Joe Ferreira.
Last modified by Joe Ferreira on 6 October 2010, and Shan Jiang on 17 October 2010

Back to the 11.520 Home Page. - Back to the CRON Home Page.