Friday, April 22, 2016

Exercise 7

Goals


This blog post will go through the processes and results of the second part of exercise seven. The first part of exercise seven was covered in the recent python blog post. The goals of the second part were to perform network analysis. The given scenario was as follows: Transportation of frac sand from the mines to a railroad terminal will result in significant impact on local roads. In this exercise, you will route the sand from the mines to the nearest railroad terminal. We will then estimate the number of trips the trucks will take and the cost of the traffic on the local roads. 


Methods

First off, the data that was used in this exercise came from ESRI. They provided a street map in order to do the network analysis. 

Another note before we get into the methods, All of the data and cost results of trips to and from the railroads is hypothetical. The network analysis is simply being done to give us experience using network analysis.

All of the methods done in this exercise were done using model builder. This allows for an organized data flow model so that it is easy to visualize all of the processes being done. The first thing that needed to be done was create a Closest Facility Layer. Once the layer was added, the mines and rails were then put into the layer. The mines were classified as the incidences and the rails were the facilities. Once those were added, then the closest facility  was solved to show where the closest railroads were to each mine. 

Once the routes were copied into a new feature class, a tabulate intersection was preformed to basically join the tables of the routes and the Wisconsin counties. Once the new table was created, a field was added to convert the total route distances per county from meters to miles. There was then another field added which calculated the total cost per county of all of the trucking distance traveled. For the hypothetical situation, we assumed that each truck route is run 50 times to the railroad per year. This is multiplied by two to account for the travel back to the mine. We also then assumed that the cost per mile of a truck driving is 2.2 cents. And with this calculation, we were able to add another field that represented the total cost per year for each county. Figure one below is the model created to preform all of the tasks in exercise 7.

Figure 1


Results

The results of this exercise are represented by a final map. Figure 2 below shows the final map. I have also provided the final table showing the data that was output by the model above. Figure 3 shows the table. The final figure, figure 4, shows a graph giving another visual representation of the counties costs per year. We can look at all of the charts and data and see that Chippewa County has the highest yearly cost for trucking sand. This means that they have the most traffic from heavy trucks on their roads. This heavy traffic can result in the need to repave or redo streets in order to keep them usable. 
Figure 2

Figure 3

Figure 4



Conclusion

I would like to again point out that all of the data and calculations used in this exercise are purely hypothetical. That being said, it is very interesting to be able to apply network analysis to find something out like how much traffic areas are receiving from sand mine trucks. It shows that network analysis has a wide variety of uses. As the results show, Chippewa county had the highest hypothetical cost per year. If we take that knowledge and look at the final map, you can see why that may be. There are plenty of mines in the county but the rail lines that need to be used to transport the sand aren't located in a very convenient location. 






Friday, April 8, 2016

Data Normalization, Geocoding, and Error Assessment Sand Mining Suitability Project

Goals

The goal of this assignment is to geocode the locations of all the sand mines in Wisconsin and compare our results to the actual locations and the geocoding results of our classmates. Each student was given around 16 mines to geocode to ensure that we would be comparing to multiple peoples geocoding.

Methods

The data that was provided was in the form of an Excel table. Each student was assigned around 16 mines to geocode. The first step before geocoding was to normalize the address table. We needed to do this because some of the mines were given in a PLSS address, some were given in a street address, and some were given in both. In normalizing the table, the addresses were separated. Some of the street addresses were missing data like a zip code or street name or city. Because of this, the addresses were normalized further.  

Using ArcMap, we signed into the enterprise account for UWEC. This allowed the geocoding to be done. The first step in the geocoding was to add the table to ArcMap. The software is then able to use its geocoding tools to find addresses within the table. The original geocoding of the table was very messy because of the in-completion of the data. Before manually finding the mines, the mines not assigned were separated out. Once the assigned mines were all that was left, it was time to go through each and every mine to find its actual location. 

First, looking for mines with complete street addresses were checked for their accuracy. Ideally, the geocoding was successful. In a couple cases, the address locator couldn't find the address. The next step is to manually find these addresses. In most cases, Google Maps can in handy finding the mines. It is easier to navigate and cover ground faster. If there was an address that was not complete, and no PLSS was given, all of the mines locations were given in latitude and longitude. Once all of the street address mines were located, then the PLSS mines needed to be located. 

The PLSS shapefile was added to ArcMap. All of the PLSS addresses were given and relatively easy to find, but finding them was a long and tedious process. The PLSS address were given in a series of directional units and numbered squares. 

The second part of the assignment was to gather the data from the students who geocoded the same mines as us. First, it was necessary to add all of their mines into ArcMap and observe the data. There were visible differences in the geocoding. The next step was to merge all of the different classmates data into one feature class. 

Once the data was merged, the actual mine locations shapefile was added into ArcMap. After comparing my own and my classmates geocoding to the correct locations, I generated a near table. This gave the distances from my geocoding to the correct locations and also the distance from the classmates geocoding to the correct. The results are shown in the next section through two maps and tables. 




Results


The following map shows the differences between the mines that I mapped and the mines that my classmates mapped. As you can see for the most part, the mines are in the same general area. But in reality, there is quite a difference once we look at them at a larger scale. You can also notice that there is a point located on the East side of the state. Figure 2 is a table that maps the difference between all of our data points in meters.
Figure 1. Map of Wisconsin showing geocoded mines

Figure 2. shows the distance between mapped points in Figure 1.



Below, Figure 3 shows the difference between my mapped mines and the actual location of the mines. For the most part, my mines are in the general location with some of them located in the correct spot. However, like the map above, at a larger scale the errors will be shown more. Figure 4 shows a table with the distances from my mapped points to the actual mapped points in meters.

Figure 3. Shows my mapped points compared to the actual mine locations.


Figure 4. Shows the distance in meters between my points and the correct points. 



Discussion

After doing this assignment, it is easier to look at data and think about potential errors in the data. From the start we were told the the data was going to have some errors in it regarding the addresses. Some of the addresses would be incomplete. I even found some addresses where the street numbers just had two numbers that were switched. Errors like these can come up at anytime when working with GIS data. The majority of the errors in this assignment would be classified as operational errors. Operational errors are mostly contained in the collecting and managing of data. The main issue of this lab was to geocode data with operational errors inside the data. The inherent errors appeared in the results of our geocoding. It goes along with looking at and interpreting the results. Because the real world is far more complex than our mapping capability, inherent errors occur. For example there were a couple of mines that, when looked at through an imagery base map, didn't look like they were actually located where they were supposed to be. This is all part of GIS, knowing that not everything is going to work perfectly is very important. 





Conclusion

This exercise really showed the difficulty with geocoding and working with error filled data. It showed that no matter what, things are not going to always work smoothly. We need to learn to work around errors and know how to fix them. It also showed the importance of keeping data in an organized fashion.