Urban Agglomeration
this an old idea from urban planning & urban economics, sometimes also referred to as economies of scale
Urban Agglomeration
this an old idea from urban planning & urban economics, sometimes also referred to as economies of scale
Geopolis
interesting
Denise Pumain,
good source - you may find other works by this author that investigate urbanization as a complex system and cities being described in terms of their fractalization and power law distributions
At a 1-km resolution, these new data are not just numbers estimating a country’s population
good
population census abstracts (PCAs), which describe the population of different settlements, the number of households, and additional characteristics
good
gridded estimates of population at a high resolution
good
slums now account for about 26% of all urban population in cities
significance
Before rushing to create a sustainable urban planning, it is necessary to understand urbanization first.
good
These are superb annotations. Your theme and focus has been clearly established. Topically boundaries also defined. A few thoughts.
(1) Prioritize the methods from those that you think present the most promise to the least. Your review has captured some examples that border on both machine learning as well as spatial statistics. This is really impressive. You have also accomplished this very definitely within the context of a clearly articulated human development problem. Just keep looking for new, state of the art methods that present the most promise. Are you familiar with AI Nigeria? GRID3 Nigeria? There is really interesting stuff coming out of those groups in conjunction with Southhampton and Columbia.
(2) Try to narrow your focus a bit. Are you focusing on the intersection of poverty reduction and food security? education? gender equality? Formally articulating your central research question promote this continued narrowing of your focus.
Just keep going!
studying poverty in a multidimensional way rather than a narrow view point
super!
spatial econometrics techniques
excellent
Additionally, studies treat geographic units as independent units rather than acknowledging them as entities surrounded by other units to which they interact with.
excellent - identifying interactions is really important. Are you taking stats?
The authors suggest that this is because of the current way poverty studies are carried out which fails to see the complexity of poverty.
development is not the only outcome that may emerge from Nigeria's highly complex and adaptive, social and economic system
structured additive regression (STAR) model
good
Gayawan, Ezra, Adebayo, & B., S. (1970, January 1). Spatial analysis of women employment status in Nigeria.
Great source. Focusing upon gender inequalities is arguably the most significant first step to take in order to reduce poverty.
geo-spatial covariates were uses to measure the vegetation index, aridity, land-surface temperature, brightness of nighttime lights, and estimated travel time to the nearest functioning water source
excellent
lack of detailed and reliable geo-referenced data inhibits researchers ability to compare levels of cleanliness and sanitation and to find patterns
good
Pokhriyal, N., & Jacques, D. C. (2017).
good source
Gaussian Process Regression process
excellent
the current way of estimating poverty is through household surveys which is costly, time consuming and is more efficient when working with a small amount of households
excellent
spatial, temporal, and individual aspects of the select group
good
food security, economic activity and call data records
good
Nigerian Local Government Area Geodemographic Classification system
interesting
but the only way to come up with effective solutions is to have as accurate of data as possible
good
The rate of death is 3 times higher for those below the poverty line between the ages of 25 and 64.
significance - good
Over 90 million civilians, roughly half of the population, lives in extreme poverty and even more experience chronic poverty
harms & significance - good
This is excellent work. Some thoughts.
(1) You have identified a wide array of potentially viable and effective paths forward. You might want to prioritize the many methods you have identified.
(2) Can you further describe the methods in detail? How is the CDR data being used? How do gravity, radiation, and impedance models functions? What are voronoi polygons? What is state of the art for using CDR data to describe, analyze, model and then predict human movement during an infectious disease outbreak? Just keep pushing forward with your analysis in this area. Also, have a look at flowkit. There's a GitHub repository that will be useful.
(3) I think you could continue to focus on describing the process itself. Why is it important to model human behavior using cell phone data? How does understanding human movement help us to better address an outbreak? How could this information be used in a time of crisis to expedite solving the problem? You've have captured multiple very significant ideas that go straight to inherency, but now organize and structure them in order to describe the process to better inform response.
Just keep going!
One less studied aspect of human mobility is seasonality
super
universal movement models’
good
The impedance model is a parameter-free model adapted from Ohm’s law of electricity. It’s actually quite a simple and ingenious method of estimating human mobility potential.
excellent
susceptible-infected-recovered (SIR) framework
excellent
This study introduces a new impedance model and compares the performances of it with the gravity model and radiation model.
excellent
Sallah, Kankoé, et al. “Mathematical Models for Predicting Human Mobility in the Context of Infectious Disease Spread: Introducing the Impedance Model.” International Journal of Health Geographics, vol. 16, no. 1, 2017, doi:10.1186/s12942-017-0115-7.
great sources
Mathematical modelling, while not as necessary in areas which already have an abundance of current data, has a lot of application in low-income areas with low surveillance.
good
CDRs cannot measure very fine spatial movements
good
(CDRs) and mobile location history
good
Lai, S., Farnham, A., Ruktanonchai, N. W., & Tatem, A. J. (2019)
excellent source
In conclusion, the study demonstrates that there is potential in using national household data to provide a measure of treatment using a Bayesian method.
good, are you able to further describe the method?
Gelman-Rubin and Raftery-Lewis test was used, with the Raftery-Lewis test taking a minimum of 55,318 iterations to achieve an accuracy of 0.0005 at a coverage of 0.999.
good
Bayesian IRT approach
good
Alegana, V. A., Wright, J., Pezzulo, C., Tatem, A. J., & Atkinson, P. M. (2017). Treatment-seeking behaviour in low- and middle-income countries estimated using a Bayesian model. BMC Medical Research Methodology, 17(1). doi: 10.1186/s12874-017-0346-0
good source
OK good. Some thoughts.
(1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? How would such "what if" applications add value to your investigation?
(2) Can you further articulate your problem statement? Which aspect of development will you focus upon with your continued investigation? Have you identified a gap in the literature? How about a tentative research question?
Just keep going!
low-density settlements are evolving and collected satellite data from 1986 to 2000
interesting
Digital Elevation Models (DEM), Landsat Enhanced Thematic Mappers (ETM), and even OpenStreetMap to collect data
good
random sampling to reduce the sampling so that no spatial autocorrelation
good
Very High Resolution (VHR) data could be used in coordination with imaging software to marked areas as either Main Roads, Roads, or Private Land
good
Having lived in Ulaanbaatar from 2010 to 2013
how interesting
Super. Some thoughts.
(1) You could probably go two different directions with this investigation. I think a lot of the current research regarding malaria and vector borne diseases revolve around using CDR data with gravity models, radiation models and impedance models. You may also find an agent-based model of transportation systems and human movement that incorporates predictions of malaria prevalence. It could also be possible to focus on the natural environmental raster data (precipitation, water, climate factors) as predictors in a machine learning method, which is more of the direction you have taken with these annotations. While spread of the disease can be predicted by modeling transport behavior, at its origin, most countries that have eradicated malaria and similar types of vector borne diseases have done so through public works efforts that seek out to spray insecticides that kill mosquito larvae in standing water as breeding areas (is this the case in Cambodia?). If it is possible to more accurately (spatially) and then effectively identify these source locations, potentially there could be great progress towards eradicating the disease.
(2) I have also seen a few studies that seek to model the mosquito population itself. I think the limitation of that approach has been largely computational due to the sheer numbers, the scale of the insect and the environmental and natural factors associated. Maybe that has recently changed.
(3) I still wonder if climate change is increasing malaria disease prevalence. Are the mosquitoes and parasites manifesting in locations across Africa where they previously hadn't been observed? Are they more prevalent in China? In other locations throughout the globe?
(4) Are there any other machine learning or data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation?
Just keep going!
analyze
did they use a model? perhaps CDR data?
Lai, S., Sun, J., Ruktanonchai, N. W.
good source
OpenGeoDA 1.0.1
good platform from Luke Anselin
The researchers were also able to determine low-low, high-low, low-high, and high-high areas. Low-low areas are defined as areas with areas with low resource density surrounded by areas with low resources, and high-high clusters are areas with high resource per capita surrounded by areas with high per capita, and so on
interesting
decision support systems
good
Cox
I think this is the same guy who is famous for using STATA, which for a long time was the biggest competitor to R as statistical software (I used STATA for a long time before switching to R)
Malaria Information System (MIS)
good data source
“knowledge-driven modeling methods”, including spatial multicriteria decision analysis (MCDA) instead
good
This leads to farmers attempting to micromanage the outbreak by attempting to disinfect their local area or sell off the supposedly infected poultry to other villages to not lose profit.
interesting, goes to inherency
These researches call for a better mapping of poultry / avian densities in the form of “risk maps”.
good
Dengue spatio-temporal diffusion patterns and hotspot detection are crucial in tracking and seeing patterns in dengue cases.
excellent source and annotation, a bit dated but still useful and effective, especially as a direction marker
Hotspots for dengue cases were showcased in a Moran scatter plot matrix and a LISA cluster map of dengue fever mortality rate
excellent
empirical Bayes smoothing method
interesting
GIS technologies and spatial analysis
good
malaria, dengue is a vector-borne illness that is spread by mosquitoes
good to consider both
examine diffusion patterns and hotspot identification
excellent
Vector-borne illnesses like malaria are also influenced by a wealth of other factors, including rainfall, temperature, or changes in agricultural practices.
good
Excellent work. Some thoughts:
(1) Narrow your focus in terms of the problem you are seeking to address. Investing a bit more time with regard to your problem statement will help you to narrow your focus and further articulate your research question.
(2) Kriging as a data science method is a good path forward. It will be interesting to see what you come up with in terms of how this spatial statistics methodology is reinforced with machine learning. Use of point process models (Baddeley) is another area of active research that might be find usefulness in the context of your work. Spatial random forest is a kind of next generation spatial statistic methodology that synthesizes machine learning.
(3) Are there any other machine learning or data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation?
Just keep going!
Kriging interpolation method
very good, do you think a machine learning approach might present additional value? perhaps random forest, or spatial random forest?
A mere 1 standard deviation increase in NO2 and coal-related pollution has created a 10% and 11% increase in negative perception of the issue.
good interpretation
linear regression model
ok
spatial distribution of forestry, the largest land loss took place in An Giang where 11,713 ha of was lost over the course of 2 and a half decades
good spatial description
Boundary, Elevation, Transportation, Hydrology, Urban areas, and Vegetation
good
Mekong Delta Geospatial Information Systems (MGIS)
good
Excellent work. Some thoughts:
(1) Narrow your focus in terms of the problem you are seeking to address. Degradation of water quality will likely involve questions that address either non-point or point sources of pollution. Which ones are the most significant offenders? You could also narrow your focus in terms of water use. Are you interested in potable water? Urban water use? Agricultural water use? Investing a bit more time with regard to your problem statement will help you to narrow your focus and further articulate your research question.
(2) Kriging as a data science method is a good path forward. It will be interesting to see what you come up with in terms of how this spatial statistics methodology is reinforced with machine learning. Use of point process models Baddeley) is another area of active research that might be find usefulness in the context of your work.
(3) A superb use of a webpage to produce your annotations. Please confirm the links on your index page are functional.
Excellent thematic focus. Clearly defined topic. A few thoughts.
(1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? Perhaps looking at Utazi, Alegana and some other authors on WorldPop and flow minder might present some options. You do have the Alegana paper from 2015 here, which is good. I think hierarchical bayesian model present the most promise for your investigation. How would such "what if" applications add value to your investigation?
I'm going to send you a previously students literature review on this topic to support your work.
(2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?
Just keep going?
They used spatio-temporal models, in order to collect specific population counts from a given area at a given time. They were able to calculate the error of these models to be very small. They also used model-based geostatistics to estimate population.
good, are you able to further describe their model?
Alegana, V. A., Atkinson, P. M., Pezzulo, C., Sorichetta, A., Weiss, D., Bird, T., … Tatem, A. J. (2015)
good source
This is a basic right that every woman should have access to, which takes Sen’s “friendly” approach to human development.
excellent
Data Science could increase the quality of life for many people, and give women and children especially the freedoms they deserve.
Excellent problem statement - one comment, GIS typically refers to software that ultimately enables users to advance spatial description, analysis and in some cases also models and then prediction (forecasts). It is also typically very effective at communicating spatial data. Spatial data is by nature very complex, thus for a long time, GIS has simplified the complex nature inherent to this type of analysis. At its core, you can think of GIS techniques translating as spatial description, analysis, modeling and prediction -- or perhaps more simply put, spatial statistics. Adding the human or environmental dimension, simply further defines the domain of this spatial inquiry by adding a geo- prefix to the spatial, thus geospatial statistics.
The neonatal mortality rate accounts for 41% of under-five-years old deaths
wow, it really is staggeringly high
Specifically in Ethiopia, 80% of newborn deaths are caused by treatable cases
good
OK good. Two interesting possible directions. Some thoughts.
(1) Are you able to further describe one or more of the machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households, persons and accessibility to primary, secondary or higher education services? Perhaps a hierarchical bayesian model (you mentioned two different ones in your annotation) could be used to describe demographic and economic attributes of persons and households? How would such "what if" applications add value to your investigation?
(2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?
Just keep going!
Once the authors decided on that, they took data from Vietnam’s 2006 national census and combined it with raster data from remote sensing and GIS data on administrative boundaries and completed hierarchical cluster analysis to figure out what areas of Vietnam are considered to be one of the four levels listed above. Once that was completed, the data was transferred onto a map and checked for accuracy through the use of round truthing, remote sensing analysis, and road network analysis.
good
it seems as if both primary and secondary schools are not being distributed appropriately to best suit the population in each sector
interesting
Bayesian model-based geostatistics along with high resolution gridded spatial covariates, both of which were applied to GPS-located household survey data that recorded information on poverty and was retrieved from DHS and/or LSMS programs
excellent
only about thirteen percent of them were actually accepted and received the chance to go
interesting, I hadn't seen this study before -- a bit dated, but good for your thematic / conceptual focus
OK good. Nice thematic focus and problem statement. Likely would benefit from additional emphasis on data science methods. Some thoughts.
(1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation?
You might find a few here.
https://scholar.google.com/citations?user=3qWNbcEAAAAJ&hl=en
Even if the study is not exactly Rio, you can still think about how to apply a method in order to surmise how it would advance understanding.
(2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?
Just keep going!
Brazilian Household Sample Survey (Continuous PNAD)
good
Urban Health Index (UHI) for different neighborhoods in Rio
good
Great stuff. Some thoughts.
(1) You could probably go two different directions with this investigation. I think a lot of the current research regarding malaria and vector borne diseases revolve around using CDR data with gravity models, radiation models and impedance models. You may also find an agent-based model of transportation systems and human movement that incorporates predictions of malaria prevalence. It could also be possible to focus on the natural environmental raster data (precipitation, water, climate factors) as predictors in a machine learning method. While spread of the disease can be predicted by modeling transport behavior, at its origin, most countries that have eradicated malaria and similar types of vector borne diseases have done so through public works efforts that seek out to spray insecticides that kill mosquito larvae in standing water as breeding areas. If it is possible to more accurately (spatially) and then effectively identify these source locations, potentially there could be great progress towards eradicating the disease.
(2) I have also seen a few studies that seek to model the mosquito population itself. I think the limitation of that approach has been largely computational due to the sheer numbers, the scale of the insect and the environmental and natural factors associated. Maybe that has recently changed.
(3) I still wonder if climate change is increasing malaria disease prevalence. Are the mosquitoes and parasites manifesting in locations across Africa where they previously hadn't been observed? Are they more prevalent in China? In other locations throughout the globe?
Just keep going!
Random Forest
good
Geospatial datasets used include health districts of Namibia, CDR data, and various spatial covariate datasets representing rainfall, temperature, elevation, distance to water, etc.
good
CDRs
good
This was accomplished thorough raster files with 5 x 5 grid cells of malaria prevalence. On top, higher resolution of 1 x 1 grid cells were placed that represented population density.
good
Ultimately, malaria is a deadly disease. Since curing malaria is a difficult task, prevention is of the utmost importance.
very good
night-biting Anopheles mosquitos
good, the "night-biting" aspect is often lost on investigators -- has that been definitively proven?
Excellent work. Some thoughts.
(1) You have identified a wide array of potentially viable and effective paths forward. Some of the methods are based more in traditional statistics (poisson, SIR, multinomial logistic regression) but still present promise, especially if they are leveraged with a learning algorithm (MNLR) or possibly a process model (poisson). I'm also wondering about SIR models for simulating an infectious outbreak (using ODEs or an agent-based model).
(2) You might also want to narrow your focus. Taking into consideration where most research is currently "placing its bets" is not a bad way to asses how to best invest your time moving forward. I suggest continuing to focus on CDR data, gravity models (and their variants), radiation models, impedance models, use of voronoi polygons and possibly measures such as radius of gyration (although this may be more effective in the context of a response to a natural disaster). What is state of the art for using CDR data to describe, analyze, model and then predict human movement during an infectious disease outbreak.
(3) I think you could continue to focus on describing the process itself. Why is it important to model human behavior using cell phone data? How does understanding human movement help us to better address an outbreak? How could this information be used in a time of crisis to expedite solving the problem?
Just keep going!
All of the data regarding the land and terrain was obtained using satellite imagery but was soon converted into various indexes (depending on which variable used).
ok, what kind of assessment do you give this source?
SIR model
good
The data shows that the accuracy of using a random forest model was slightly lower than other models such as a logistic regression model.
excellent, I wonder why?
spatial autoregressive modelling approach which was used to compare the results (number of observed Ebola cases) seen from the spatial and non-spatial disparity for each covariate
excellent
Poisson model
good
They wanted to understand if there was a relationship between the Ebola virus and different environmental and socio-demographic variables.
good
investigating how migration patterns change (using cell phone data) after a travel ban has been put on the country
good
global positioning systems and call detail records (along with cell towers)
good
radiation model
good
call detail records’ (CDR’s)
good
There have been studies done regarding an extremely effective vaccine used for the virus, but the vaccine was not introduced to many African countries until after the large outbreak
interesting
Ebola is transmitted through direct contact with a number of body fluids including the blood and vomit of an infected person.
good, describes process
surveillance measures
good
it is possible to not only know where the disease is spreading, but why it is spreading
very good -- perhaps also how it is spreading?
detail records, geospatial data, and modeling the data using diverse models such as the ‘Poisson Model,’
excellent
Good work. Some thoughts:
(1) Further define your focus in terms of the problem you are seeking to address. Will you investigate water use / scarcity in terms of residential demand? agricultural demand? Are you also thinking about water degradation? Are you mostly interested in potable water? Urban water use? Investing a bit more time with regard to your problem statement will help you to narrow your focus and further articulate your research question.
(2) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households?
(3) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?
Just keep going!
groundwork for the rest of the calculations
interesting, I'm wondering if there are other methods that may present more promise in describing, analyzing and predicting water use / water demand?
What physical changes can Palestine make in the next 7 years in order to avoid a total water crisis?
good, beginning to sound like a general research question
data envelopment analysis
interesting
More specifically, it can be classified as violating 3 of Sen’s 5 “instrumental freedoms’’ including political and economic freedoms, as well as “transparency guarantees.”
good for thematic and conceptual purposes
Good work. Some thoughts.
(1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a spatial probability density function could be used to identify boundaries of de facto urban areas? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation? The last two sources seem to introduce these types of methods. Perhaps you could further articulate details from each of these sources?
(2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?
Just keep going!
CDR and RS
very good
build predictive maps of poverty using combinations of CDR and RS data
excellent, could you further describe the model and methodology?
Steel
good source
The materials and methods used to collect and create data were CDRs
did they present a model?
CDRs
good
DMSP
good data source, VIIRS is also used
Nighttime lights
good data source
good health and well-being, gender equality, reduced inequalities, no poverty
ok good
There are already efforts currently targeting poverty, including anti poverty agencies. So, how can the Big Data be used to eradicate poverty?
good
OK good. Excellent thematic focus and problem statement. Some thoughts.
(1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a spatial probability density function could be used to identify boundaries of de facto urban areas? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation?
(2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?
Just keep going!
poverty reduction plans
ok good
hotspots
how did they calculate and identify these hotspots?
von Braun
my old boss!
Gatzweiler
my colleague!
I think this article will help with my general topic of the poverty assessment in Ethiopia.
could you further elaborate on the method?
The author states that by “integrating spatial measures” with data, access to services, water, transportation routes, etc., we could lead ourselves to a more complete understanding of the human development process in Ethiopia
Excellent - is this a spatial probability density function (sometimes aka kernel density estimation)?
public expenditures by mapping the exact location of where the poorest populations live
good - targeting public expenditures
spatial representation of poverty
good
Small Area Estimation
ok good
mapping poverty
good
Problem Statement
very good problem statement -- could you further quantify the significance of these harms within the context of Ethiopia?
OK good. Some thoughts.
(1) Are you able to further identify the data and the variables used with some of these investigations related to nature disaster response in the Caribbean? Are you able to identify names of methods used? names of models used? Some of the methods and models that will likely surface are as follows.
(2) Further specify your topic. Is there a particular type of natural disaster / event that you are investigating? Hurricane? Earthquake? Disease?
(3) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?
Just keep going!
data shows how residents of Puerto Rico moved around before and after the disaster.
good - but I think assuming the user opted into sharing their location data through twitter?
user’s age, sex, and what area of Puerto Rico they lived in were also taken into account
good
using cell phone data to track movement patterns of individuals to see where they went prior to the earthquake and the areas they went post earthquake. The data shows that dispersion post earthquake was similar to travel before the earthquake
good. Did they use a model in their research?
Bengtsson
good source
Many natural disasters plague the Caribbean and its countries have a difficult time recovering from these disasters
edit
OK good. Some thoughts.
(1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a spatial probability density function could be used to identify boundaries of de facto urban areas? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? How would such "what if" applications add value to your investigation?
(2) Can you further articulate your problem statement? Which aspect of air pollution will you focus upon with your continue investigation? Have you identified a gap in the literature? How about a tentative research question?
Just keep going!
variables such as buildings and soil coverage
ok
“centralized economy” and “market-based” economy
does indeed sound interesting
creating a strong spatial concentration gradients consistent with a major contribution from home heating in the ger areas
good
land cover, road length, and ger density
good, what is ger density?
Methods that are being used to measure air pollution and the effects vary by study, but the most common type is using raster layers of satellite images.
good
burning raw coal is one of the few ways they can afford to keep themselves warm during the cold winters
also contributes to outdoor pollution, but would be good to distinguish indoor pollution as a specific topic
it is the most polluted capital city in the world
wow! a statement such as this would benefit from a citation
It's not clear to me where you problem statement resides. Please integrate your identified harms, their significance and the inherent nature of your selected topic into the introduction / development topic sections of your literature review.
Excellent annotations. You've covered a lot of ground in content here. Some thoughts.
Explicitly identify all of the different model involved with these sources. I expect several different ones were used. Can you rank them in terms of which one or two exhibits the most promise in your estimation?
State your problem. Identify the most relevant themes and topics. Describe this as a process in terms of China. Try to identify some areas in the literature that exhibit research gaps, and then begin to articulate a research question. Attenuate that question such that it aligns with your identified gap. Classify the type of "puzzle" or research question you have formulated.
Keep focusing on the data and methods, and also just keep going! Good work!
Tencent’s
interesting link to your previous source
investigate the migration from rural areas to urban areas
great
If q is a higher value, then higher ranked cities have an advantage. If q is lower, then small or medium level cities have an advantage (Deng et al. 2019). The researchers mention the advantages of the rank-size model, but also the disadvantages, which is that it only shows general trends in city scale and nothing specific. In order to combat this, the researchers split China into 4 regions and added more specific parameters (Deng et al. 2019).
excellent, sounds similar to Zipf's law
The method used was a classic rank-size model
power law / Zipf's law ?
predict and figure out the future pattern of urban sprawl through studying the current urban sprawl in China
excellent, urban sprawl is a very important theme
shrinking farmland, low levels of efficiency in land use, landscape fragmentations
good, important development themes
migration and population flow in cities of China and hopes to find patterns
was a model identified and used as part of this study?
LBS
what is LBS?
spatio-temporal location big data
good
big data is much more accurate then conventional data
interesting, good
Lai, J.
another good source
population mobility
excellent
https://www.nature.com/articles/sdata20165
good, but are you able to make this link active?
Gaughan et al. 2016
good
Tatem, A. J.
Good source. For annotation style, Gaughan et al. is typically used when the number of authors is more than 3.
Very clear research focus. Some thoughts.
(1) There might be an argument for visiting some of the traditionally considered foundations of development economics. The first one that comes to mind is Hans Singer and his Post-war Price Relations between Under-developed and Industrialized Countries (1950) and how it contributed to the Singer-Prebisch thesis. Central to this thesis is the idea that due to the terms of trade, developed countries have a moral imperative to return investment (perhaps repair would be a better choice of words) under developed countries from which they have benefitted. You could also look at Sir Arthur Lewis' Nobel Prize winning essay from 1954, Economic Development with Unlimited Supplies of Labor. Both of these sources were instrumental to the foundations of development economics and find corollaries with the current state of under developed regions of that were sources of the transatlantic slave trade. Amartya Sen also provides some insight to this historical perspective on development analysis.
(2) I also can't help but think about the American influence with regard to your topic. While these countries have lost human capital as a result of the transatlantic slave trade, which countries have in fact gained? Can you evaluate this gain somehow and is there in fact an argument in favor of repairing source contributing regions? It's a worthy justification for your investigation and goes straight to the inherent and complex nature of the problem.
(3) How can you take some of the surveys on education and apply a data science / machine learning methodology in order to produce a high resolution, spatial description of education, level of service across West Africa. I suspect many of the big steps forward with progress are going to be at the level of individual schools, throughout Ghana, throughout Nigeria, throughout Liberia and elsewhere in the ECOWAS community. What if you applied a hierarchical bayesian model to some of the survey data you found. Would it be possible to use such a machine learning approach to produce a high resolution description of educational level of service of primary and secondary schools throughout West Africa? Would such a high resolution description of all persons living throughout West Africa as well as the level of educational service they are receiving be useful to inform implementation of policy at the national, district and local level?
All around great stuff. Just keep going.
In sub-saharan Africa alone, 34 percent of primary schools have electricity, while 44 percent have basic drinking water
significant
only 64 percent of primary and 50 percent of secondary school teachers in Sub-Saharan Africa are trained
interesting
Liberia
How has Liberia fared since their civil war? Have things begun to improve?
national household surveys and learning assessments
good
The stronger the ethnic identity, the higher the education rate.
I'm wondering if he is in favor of advancing indigenous institutions as a means for promoting development. Do you think there is evidence that favors advancing indigenous institutions when compared to modern ones, especially when considered in the context of West Africa (Ghana, Nigeria etc...)
So, Obikili believes that long-term effects of the slave trade may have been impacted through ethnic groups, not necessarily by country.
good
extrapolation
ok
why West Africa has the lowest education rates in the world
good
Super. Clearly defined research focus, although a massive topic in terms of scope and scale. Some thoughts.
(1) Might not be a bad idea to revisit some of the traditional development economic works on rural to urban migration, although the push / pull factors you are identifying are much more regional or global in nature. Two good ones to look at are Todaro:
https://www-jstor-org.proxy.wm.edu/stable/1807860?seq=1#metadata_info_tab_contents
and Sir Arthur Lewis' Economic Development with Unlimited Supplies of Labour. Please let me know if you can't find a copy of his 1954 essay that earned him the Nobel Prize.
(2) I'm wondering a bit about which data science methodology is best to investigate further with regard to your research topic. Argument could be made for thinking about the application of a hierarchical bayesian model with the survey data you have identified. There are also arguments in favor or gravity type models with the CDR data you are considering.
(3) Migration patterns are such a big topic that it can be difficult to know where to start in terms of geospatial data, data science / machine learning methods / models etc…A common starting point for thinking about migration is in terms of push factors and/or pull factors, more traditionally within the context of rural to urban migration. The scope and scale of what you are investigating is of a much greater magnitude and complexity than these traditionally considered foundations from development economics. Additionally, thinking of migration of this magnitude in terms of only human movement and behavior, likely over simplifies the problem. For example, we could use CDR data to infer the origins and destinations of where people are coming from and where they wish to go as part of a transportation system model (even if it is by foot or mixed mode). I think you need something that seeks to accomplish far more than simply transport behavior and movement, because you also wish to understand the motivations behind why people choose to leave (push factors) as well as the reasons they are attracted to some location (pull factors). I think looking into agent based models as a data science method for describing, analyzing could also be a promising path forward. A simulation system, such as an ABM, may incorporate different model that address factors at the origin as well as at the destination.
Will be very interested to find out more about the direction you have selected moving forwards! Just keep going!
Bangladesh, Grameephone (GP)
excellent
The most important idea presented in this article is the promising use of mobile network operator call detail records (CDR) as a supplementary data source used to monitor and respond to migration as a result of climate change.
good
Public Use Microdata Series
PUMS is a good data source
To compare against the AMRS, the Climatic Research Unit’s (CRU) time series containing high resolution monthly precipitation along with temperature data from the NASA Modern Era-Retrospective Analysis for Research and Applications (MERRA) was used.
How did they go about doing this?
Data was collected from 9812 households in Kenya, Uganda, Nigeria, Burkina Faso and Senegal, and linked with high resolution gridded climate data from station and satellite sources.
good
Instead of focusing on “climate refugees,” resources should be used to identify populations that are at risk of becoming trapped in place, or being forced to move locally.
very good
A Digital Elevation Model was used to extract mean land slope. Secondly, the global WorldClim data set containing historical climate information was used. Lastly, GIS was used to link communities to the closest rainfall station of which data was available. The findings were different from expected, showing that both fast and slow climate change are likely to influence human migration, but it is possible that potential migrants will be trapped in place rather than displaced.
interesting, was a model used that incorporated both the household surveys as well as the geospatial climate and topology data?
household surveys
good, DHS? living standard surveys?
These populations lack the freedom of protective security, as they are unable to migrate out of climate impacted regions.
Very good, particularly your classification in terms of voluntary migration. Have you had a chance to consider push / pull factors?
Environmental Influences on Human Migration in Rural Ecuador
Nice link!
Very good work. Some thoughts.
(1) I'm wondering a bit about which dimension of health care / disease you may be focusing upon in relation to Tajikistan. Your problem statement indicates you are interested to investigate respiratory health but your annotations do not seem to provide that much information in support of this direction. Perhaps further clarify which dimension of health care you are interested to investigate as part of your literature review. A common approach is start with either infectious diseases, vector borne diseases, non-communicable diseases or psychological disorders. I think you are interested in non-communicable diseases (respiratory / pulmonary, circulatory / heart) and the factors contributing to their prevalence in Tajikistan.
(2) As you have noted, it's likely going to be difficult to find a perfectly matched source to the problem you have presented. Still, you can consider what if scenarios, in conjunction with geospatial data that is available in other locations and contexts as well as the applicable method. For example, what if the authors used a hierarchical bayesian model to spatially describe, analyze and predict outcomes in the context of a particularly selected variable (as well as the predictors). It's one option forward, I'm sure there are many others as well.
Otherwise, just keep going!
Tajikistan seems to have data that is not used effectively, or able to be synthesized and used, which is a direction that I would like to explore in the next part of this assignment.
excellent
Multiple Indicator Cluster Survey
good
compile emergency care facilities in Sub-Saharan Africa
was location of health care facility included in the analysis?
Gridded population data
good
The methodology used for analyzing the data is the institutional synergy and system approaches, pattern methods, and multidimensional comparative analysis. The authors are investigating developmental processes themselves. It seems to be a sort of meta-analysis of methods rather than a standard analysis of processes.
Is there a spatial dimension to this analysis?
distances
what are these distances measuring?
“Pattern” methods
ok interesting
survey data from routine health information systems
is this DHS data?
The main cause of premature death in Tajikistan in 2017 is lower respiratory infection. It has increased by 10% since 2007. Uzbekistan, a nearby country, has lower respiratory infection as its second most cause of premature death, yet the rate in Uzbekistan has decreased by about 30% since 2007.
interesting, I didn't know this at all
If a person is preoccupied with maintaining their health, or the health of others, it prevents them from engaging in something else. Suffering in health is a vicious cycle that prevents natural human development.
good
Great stuff. Research focus is clearly defined. Some thoughts.
(1) You have many economic and demographic elements to consider through your investigation. I think you likely want to start with the survey data and the hierarchical bayesian model. This will probably move you in the right direction of better understanding your problem in terms of scope and scale. Your issue as you progress is identifying how your topic intersects with the larger development theme you are considering (wasn't there another dimension you were investigating as well?)...Perhaps start with the survey, machine learning / hierarchical bayesian model approach and consider what data is out there that could be used in the same manner. Even if you don't find a study that is specific to your location, you can still hypothesize how data or a model from a particular study could be improved upon with an improved methodology.
(2) I also want to encourage you to explore what else is out there, beyond hierarchical bayesian models / machine learning methods. Are there any other machine learning approaches that could be useful towards spatial description, analysis and prediction of your development problem. What about deep learning approaches? Is there anything new on the horizon that presents great hope and promise towards a fuller understanding of a large, complex problem from a national to regional scale?
Other than that, just keep going!