901 Matching Annotations
  1. Apr 2020
  2. miaaao.github.io miaaao.github.io
    1. Denise Pumain,

      good source - you may find other works by this author that investigate urbanization as a complex system and cities being described in terms of their fractalization and power law distributions

    2. population census abstracts (PCAs), which describe the population of different settlements, the number of households, and additional characteristics

      good

    1. These are superb annotations. Your theme and focus has been clearly established. Topically boundaries also defined. A few thoughts.

      (1) Prioritize the methods from those that you think present the most promise to the least. Your review has captured some examples that border on both machine learning as well as spatial statistics. This is really impressive. You have also accomplished this very definitely within the context of a clearly articulated human development problem. Just keep looking for new, state of the art methods that present the most promise. Are you familiar with AI Nigeria? GRID3 Nigeria? There is really interesting stuff coming out of those groups in conjunction with Southhampton and Columbia.

      (2) Try to narrow your focus a bit. Are you focusing on the intersection of poverty reduction and food security? education? gender equality? Formally articulating your central research question promote this continued narrowing of your focus.

      Just keep going!

    2. Additionally, studies treat geographic units as independent units rather than acknowledging them as entities surrounded by other units to which they interact with.

      excellent - identifying interactions is really important. Are you taking stats?

    3. The authors suggest that this is because of the current way poverty studies are carried out which fails to see the complexity of poverty.

      development is not the only outcome that may emerge from Nigeria's highly complex and adaptive, social and economic system

    4. Gayawan, Ezra, Adebayo, & B., S. (1970, January 1). Spatial analysis of women employment status in Nigeria.

      Great source. Focusing upon gender inequalities is arguably the most significant first step to take in order to reduce poverty.

    5. geo-spatial covariates were uses to measure the vegetation index, aridity, land-surface temperature, brightness of nighttime lights, and estimated travel time to the nearest functioning water source

      excellent

    6. lack of detailed and reliable geo-referenced data inhibits researchers ability to compare levels of cleanliness and sanitation and to find patterns

      good

    7. the current way of estimating poverty is through household surveys which is costly, time consuming and is more efficient when working with a small amount of households

      excellent

    8. Over 90 million civilians, roughly half of the population, lives in extreme poverty and even more experience chronic poverty

      harms & significance - good

  3. Mar 2020
  4. 6packofribs.github.io 6packofribs.github.io
    1. This is excellent work. Some thoughts.

      (1) You have identified a wide array of potentially viable and effective paths forward. You might want to prioritize the many methods you have identified.

      (2) Can you further describe the methods in detail? How is the CDR data being used? How do gravity, radiation, and impedance models functions? What are voronoi polygons? What is state of the art for using CDR data to describe, analyze, model and then predict human movement during an infectious disease outbreak? Just keep pushing forward with your analysis in this area. Also, have a look at flowkit. There's a GitHub repository that will be useful.

      (3) I think you could continue to focus on describing the process itself. Why is it important to model human behavior using cell phone data? How does understanding human movement help us to better address an outbreak? How could this information be used in a time of crisis to expedite solving the problem? You've have captured multiple very significant ideas that go straight to inherency, but now organize and structure them in order to describe the process to better inform response.

      Just keep going!

    2. The impedance model is a parameter-free model adapted from Ohm’s law of electricity. It’s actually quite a simple and ingenious method of estimating human mobility potential.

      excellent

    3. Sallah, Kankoé, et al. “Mathematical Models for Predicting Human Mobility in the Context of Infectious Disease Spread: Introducing the Impedance Model.” International Journal of Health Geographics, vol. 16, no. 1, 2017, doi:10.1186/s12942-017-0115-7.

      great sources

    4. Mathematical modelling, while not as necessary in areas which already have an abundance of current data, has a lot of application in low-income areas with low surveillance.

      good

    5. In conclusion, the study demonstrates that there is potential in using national household data to provide a measure of treatment using a Bayesian method.

      good, are you able to further describe the method?

    6. Gelman-Rubin and Raftery-Lewis test was used, with the Raftery-Lewis test taking a minimum of 55,318 iterations to achieve an accuracy of 0.0005 at a coverage of 0.999.

      good

    7. Alegana, V. A., Wright, J., Pezzulo, C., Tatem, A. J., & Atkinson, P. M. (2017). Treatment-seeking behaviour in low- and middle-income countries estimated using a Bayesian model. BMC Medical Research Methodology, 17(1). doi: 10.1186/s12874-017-0346-0

      good source

    1. OK good. Some thoughts.

      (1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? How would such "what if" applications add value to your investigation?

      (2) Can you further articulate your problem statement? Which aspect of development will you focus upon with your continued investigation? Have you identified a gap in the literature? How about a tentative research question?

      Just keep going!

    1. Super. Some thoughts.

      (1) You could probably go two different directions with this investigation. I think a lot of the current research regarding malaria and vector borne diseases revolve around using CDR data with gravity models, radiation models and impedance models. You may also find an agent-based model of transportation systems and human movement that incorporates predictions of malaria prevalence. It could also be possible to focus on the natural environmental raster data (precipitation, water, climate factors) as predictors in a machine learning method, which is more of the direction you have taken with these annotations. While spread of the disease can be predicted by modeling transport behavior, at its origin, most countries that have eradicated malaria and similar types of vector borne diseases have done so through public works efforts that seek out to spray insecticides that kill mosquito larvae in standing water as breeding areas (is this the case in Cambodia?). If it is possible to more accurately (spatially) and then effectively identify these source locations, potentially there could be great progress towards eradicating the disease.

      (2) I have also seen a few studies that seek to model the mosquito population itself. I think the limitation of that approach has been largely computational due to the sheer numbers, the scale of the insect and the environmental and natural factors associated. Maybe that has recently changed.

      (3) I still wonder if climate change is increasing malaria disease prevalence. Are the mosquitoes and parasites manifesting in locations across Africa where they previously hadn't been observed? Are they more prevalent in China? In other locations throughout the globe?

      (4) Are there any other machine learning or data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation?

      Just keep going!

    2. The researchers were also able to determine low-low, high-low, low-high, and high-high areas. Low-low areas are defined as areas with areas with low resource density surrounded by areas with low resources, and high-high clusters are areas with high resource per capita surrounded by areas with high per capita, and so on

      interesting

    3. Cox

      I think this is the same guy who is famous for using STATA, which for a long time was the biggest competitor to R as statistical software (I used STATA for a long time before switching to R)

    4. This leads to farmers attempting to micromanage the outbreak by attempting to disinfect their local area or sell off the supposedly infected poultry to other villages to not lose profit.

      interesting, goes to inherency

    5. Dengue spatio-temporal diffusion patterns and hotspot detection are crucial in tracking and seeing patterns in dengue cases.

      excellent source and annotation, a bit dated but still useful and effective, especially as a direction marker

    6. Vector-borne illnesses like malaria are also influenced by a wealth of other factors, including rainfall, temperature, or changes in agricultural practices.

      good

    1. Excellent work. Some thoughts:

      (1) Narrow your focus in terms of the problem you are seeking to address. Investing a bit more time with regard to your problem statement will help you to narrow your focus and further articulate your research question.

      (2) Kriging as a data science method is a good path forward. It will be interesting to see what you come up with in terms of how this spatial statistics methodology is reinforced with machine learning. Use of point process models (Baddeley) is another area of active research that might be find usefulness in the context of your work. Spatial random forest is a kind of next generation spatial statistic methodology that synthesizes machine learning.

      (3) Are there any other machine learning or data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation?

      Just keep going!

    2. Kriging interpolation method

      very good, do you think a machine learning approach might present additional value? perhaps random forest, or spatial random forest?

    3. A mere 1 standard deviation increase in NO2 and coal-related pollution has created a 10% and 11% increase in negative perception of the issue.

      good interpretation

    4. spatial distribution of forestry, the largest land loss took place in An Giang where 11,713 ha of was lost over the course of 2 and a half decades

      good spatial description

    1. Excellent work. Some thoughts:

      (1) Narrow your focus in terms of the problem you are seeking to address. Degradation of water quality will likely involve questions that address either non-point or point sources of pollution. Which ones are the most significant offenders? You could also narrow your focus in terms of water use. Are you interested in potable water? Urban water use? Agricultural water use? Investing a bit more time with regard to your problem statement will help you to narrow your focus and further articulate your research question.

      (2) Kriging as a data science method is a good path forward. It will be interesting to see what you come up with in terms of how this spatial statistics methodology is reinforced with machine learning. Use of point process models Baddeley) is another area of active research that might be find usefulness in the context of your work.

      (3) A superb use of a webpage to produce your annotations. Please confirm the links on your index page are functional.

    1. Excellent thematic focus. Clearly defined topic. A few thoughts.

      (1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? Perhaps looking at Utazi, Alegana and some other authors on WorldPop and flow minder might present some options. You do have the Alegana paper from 2015 here, which is good. I think hierarchical bayesian model present the most promise for your investigation. How would such "what if" applications add value to your investigation?

      I'm going to send you a previously students literature review on this topic to support your work.

      (2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?

      Just keep going?

    2. They used spatio-temporal models, in order to collect specific population counts from a given area at a given time. They were able to calculate the error of these models to be very small. They also used model-based geostatistics to estimate population.

      good, are you able to further describe their model?

    3. Data Science could increase the quality of life for many people, and give women and children especially the freedoms they deserve.

      Excellent problem statement - one comment, GIS typically refers to software that ultimately enables users to advance spatial description, analysis and in some cases also models and then prediction (forecasts). It is also typically very effective at communicating spatial data. Spatial data is by nature very complex, thus for a long time, GIS has simplified the complex nature inherent to this type of analysis. At its core, you can think of GIS techniques translating as spatial description, analysis, modeling and prediction -- or perhaps more simply put, spatial statistics. Adding the human or environmental dimension, simply further defines the domain of this spatial inquiry by adding a geo- prefix to the spatial, thus geospatial statistics.

    1. OK good. Two interesting possible directions. Some thoughts.

      (1) Are you able to further describe one or more of the machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households, persons and accessibility to primary, secondary or higher education services? Perhaps a hierarchical bayesian model (you mentioned two different ones in your annotation) could be used to describe demographic and economic attributes of persons and households? How would such "what if" applications add value to your investigation?

      (2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?

      Just keep going!

    2. Once the authors decided on that, they took data from Vietnam’s 2006 national census and combined it with raster data from remote sensing and GIS data on administrative boundaries and completed hierarchical cluster analysis to figure out what areas of Vietnam are considered to be one of the four levels listed above. Once that was completed, the data was transferred onto a map and checked for accuracy through the use of round truthing, remote sensing analysis, and road network analysis.

      good

    3. Bayesian model-based geostatistics along with high resolution gridded spatial covariates, both of which were applied to GPS-located household survey data that recorded information on poverty and was retrieved from DHS and/or LSMS programs

      excellent

    4. only about thirteen percent of them were actually accepted and received the chance to go

      interesting, I hadn't seen this study before -- a bit dated, but good for your thematic / conceptual focus

    1. OK good. Nice thematic focus and problem statement. Likely would benefit from additional emphasis on data science methods. Some thoughts.

      (1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation?

      You might find a few here.

      https://scholar.google.com/citations?user=3qWNbcEAAAAJ&hl=en

      Even if the study is not exactly Rio, you can still think about how to apply a method in order to surmise how it would advance understanding.

      (2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?

      Just keep going!

    1. Great stuff. Some thoughts.

      (1) You could probably go two different directions with this investigation. I think a lot of the current research regarding malaria and vector borne diseases revolve around using CDR data with gravity models, radiation models and impedance models. You may also find an agent-based model of transportation systems and human movement that incorporates predictions of malaria prevalence. It could also be possible to focus on the natural environmental raster data (precipitation, water, climate factors) as predictors in a machine learning method. While spread of the disease can be predicted by modeling transport behavior, at its origin, most countries that have eradicated malaria and similar types of vector borne diseases have done so through public works efforts that seek out to spray insecticides that kill mosquito larvae in standing water as breeding areas. If it is possible to more accurately (spatially) and then effectively identify these source locations, potentially there could be great progress towards eradicating the disease.

      (2) I have also seen a few studies that seek to model the mosquito population itself. I think the limitation of that approach has been largely computational due to the sheer numbers, the scale of the insect and the environmental and natural factors associated. Maybe that has recently changed.

      (3) I still wonder if climate change is increasing malaria disease prevalence. Are the mosquitoes and parasites manifesting in locations across Africa where they previously hadn't been observed? Are they more prevalent in China? In other locations throughout the globe?

      Just keep going!

    2. Geospatial datasets used include health districts of Namibia, CDR data, and various spatial covariate datasets representing rainfall, temperature, elevation, distance to water, etc.

      good

    3. This was accomplished thorough raster files with 5 x 5 grid cells of malaria prevalence. On top, higher resolution of 1 x 1 grid cells were placed that represented population density.

      good

    1. Excellent work. Some thoughts.

      (1) You have identified a wide array of potentially viable and effective paths forward. Some of the methods are based more in traditional statistics (poisson, SIR, multinomial logistic regression) but still present promise, especially if they are leveraged with a learning algorithm (MNLR) or possibly a process model (poisson). I'm also wondering about SIR models for simulating an infectious outbreak (using ODEs or an agent-based model).

      (2) You might also want to narrow your focus. Taking into consideration where most research is currently "placing its bets" is not a bad way to asses how to best invest your time moving forward. I suggest continuing to focus on CDR data, gravity models (and their variants), radiation models, impedance models, use of voronoi polygons and possibly measures such as radius of gyration (although this may be more effective in the context of a response to a natural disaster). What is state of the art for using CDR data to describe, analyze, model and then predict human movement during an infectious disease outbreak.

      (3) I think you could continue to focus on describing the process itself. Why is it important to model human behavior using cell phone data? How does understanding human movement help us to better address an outbreak? How could this information be used in a time of crisis to expedite solving the problem?

      Just keep going!

    2. All of the data regarding the land and terrain was obtained using satellite imagery but was soon converted into various indexes (depending on which variable used).

      ok, what kind of assessment do you give this source?

    3. The data shows that the accuracy of using a random forest model was slightly lower than other models such as a logistic regression model.

      excellent, I wonder why?

    4. spatial autoregressive modelling approach which was used to compare the results (number of observed Ebola cases) seen from the spatial and non-spatial disparity for each covariate

      excellent

    5. There have been studies done regarding an extremely effective vaccine used for the virus, but the vaccine was not introduced to many African countries until after the large outbreak

      interesting

    1. Good work. Some thoughts:

      (1) Further define your focus in terms of the problem you are seeking to address. Will you investigate water use / scarcity in terms of residential demand? agricultural demand? Are you also thinking about water degradation? Are you mostly interested in potable water? Urban water use? Investing a bit more time with regard to your problem statement will help you to narrow your focus and further articulate your research question.

      (2) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households?

      (3) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?

      Just keep going!

    2. groundwork for the rest of the calculations

      interesting, I'm wondering if there are other methods that may present more promise in describing, analyzing and predicting water use / water demand?

    3. What physical changes can Palestine make in the next 7 years in order to avoid a total water crisis?

      good, beginning to sound like a general research question

    4. More specifically, it can be classified as violating 3 of Sen’s 5 “instrumental freedoms’’ including political and economic freedoms, as well as “transparency guarantees.”

      good for thematic and conceptual purposes

  5. yunkichristian.github.io yunkichristian.github.io
    1. Good work. Some thoughts.

      (1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a spatial probability density function could be used to identify boundaries of de facto urban areas? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation? The last two sources seem to introduce these types of methods. Perhaps you could further articulate details from each of these sources?

      (2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?

      Just keep going!

  6. larryfeng01.github.io larryfeng01.github.io
    1. OK good. Excellent thematic focus and problem statement. Some thoughts.

      (1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a spatial probability density function could be used to identify boundaries of de facto urban areas? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? What about an agent-based model? How would such "what if" applications add value to your investigation?

      (2) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?

      Just keep going!

    2. The author states that by “integrating spatial measures” with data, access to services, water, transportation routes, etc., we could lead ourselves to a more complete understanding of the human development process in Ethiopia

      Excellent - is this a spatial probability density function (sometimes aka kernel density estimation)?

    1. OK good. Some thoughts.

      (1) Are you able to further identify the data and the variables used with some of these investigations related to nature disaster response in the Caribbean? Are you able to identify names of methods used? names of models used? Some of the methods and models that will likely surface are as follows.

      • Voronoi polygons - what are these? why are they relevant to CDR data?
      • radius of gyration - what is this? what is it measuring? gravity models and other variants? what does a gravity model describe and predict? How is CDR data used to estimate the origins and destinations of all persons (OD matrix) and how is this OD dataset input into a gravity model to describe movement? Is it also useful to estimate future decisions and movements across an area?

      (2) Further specify your topic. Is there a particular type of natural disaster / event that you are investigating? Hurricane? Earthquake? Disease?

      (3) Are you able to identify a gap in the literature? Are you able to formulate a research question that aligns with that gap? Have you classified that research question as a type of "puzzle" that you are attempting to solve?

      Just keep going!

    2. data shows how residents of Puerto Rico moved around before and after the disaster.

      good - but I think assuming the user opted into sharing their location data through twitter?

    3. using cell phone data to track movement patterns of individuals to see where they went prior to the earthquake and the areas they went post earthquake. The data shows that dispersion post earthquake was similar to travel before the earthquake

      good. Did they use a model in their research?

    1. OK good. Some thoughts.

      (1) Are you able to identify some machine learning or other data science methods that IF used in the context of your research and location would present promise towards better understanding your problem? Perhaps a random forest approach from world pop could be used to spatially locate households and persons? Perhaps a spatial probability density function could be used to identify boundaries of de facto urban areas? Perhaps a hierarchical bayesian model (also from world pop) could be used to describe demographic and economic attributes of persons and households? How would such "what if" applications add value to your investigation?

      (2) Can you further articulate your problem statement? Which aspect of air pollution will you focus upon with your continue investigation? Have you identified a gap in the literature? How about a tentative research question?

      Just keep going!

    2. Methods that are being used to measure air pollution and the effects vary by study, but the most common type is using raster layers of satellite images.

      good

    3. burning raw coal is one of the few ways they can afford to keep themselves warm during the cold winters

      also contributes to outdoor pollution, but would be good to distinguish indoor pollution as a specific topic

    1. It's not clear to me where you problem statement resides. Please integrate your identified harms, their significance and the inherent nature of your selected topic into the introduction / development topic sections of your literature review.

      Excellent annotations. You've covered a lot of ground in content here. Some thoughts.

      1. Explicitly identify all of the different model involved with these sources. I expect several different ones were used. Can you rank them in terms of which one or two exhibits the most promise in your estimation?

      2. State your problem. Identify the most relevant themes and topics. Describe this as a process in terms of China. Try to identify some areas in the literature that exhibit research gaps, and then begin to articulate a research question. Attenuate that question such that it aligns with your identified gap. Classify the type of "puzzle" or research question you have formulated.

      Keep focusing on the data and methods, and also just keep going! Good work!

    2. If q is a higher value, then higher ranked cities have an advantage. If q is lower, then small or medium level cities have an advantage (Deng et al. 2019). The researchers mention the advantages of the rank-size model, but also the disadvantages, which is that it only shows general trends in city scale and nothing specific. In order to combat this, the researchers split China into 4 regions and added more specific parameters (Deng et al. 2019).

      excellent, sounds similar to Zipf's law

    3. predict and figure out the future pattern of urban sprawl through studying the current urban sprawl in China

      excellent, urban sprawl is a very important theme

  7. sydneytaylr.github.io sydneytaylr.github.io
    1. Very clear research focus. Some thoughts.

      (1) There might be an argument for visiting some of the traditionally considered foundations of development economics. The first one that comes to mind is Hans Singer and his Post-war Price Relations between Under-developed and Industrialized Countries (1950) and how it contributed to the Singer-Prebisch thesis. Central to this thesis is the idea that due to the terms of trade, developed countries have a moral imperative to return investment (perhaps repair would be a better choice of words) under developed countries from which they have benefitted. You could also look at Sir Arthur Lewis' Nobel Prize winning essay from 1954, Economic Development with Unlimited Supplies of Labor. Both of these sources were instrumental to the foundations of development economics and find corollaries with the current state of under developed regions of that were sources of the transatlantic slave trade. Amartya Sen also provides some insight to this historical perspective on development analysis.

      (2) I also can't help but think about the American influence with regard to your topic. While these countries have lost human capital as a result of the transatlantic slave trade, which countries have in fact gained? Can you evaluate this gain somehow and is there in fact an argument in favor of repairing source contributing regions? It's a worthy justification for your investigation and goes straight to the inherent and complex nature of the problem.

      (3) How can you take some of the surveys on education and apply a data science / machine learning methodology in order to produce a high resolution, spatial description of education, level of service across West Africa. I suspect many of the big steps forward with progress are going to be at the level of individual schools, throughout Ghana, throughout Nigeria, throughout Liberia and elsewhere in the ECOWAS community. What if you applied a hierarchical bayesian model to some of the survey data you found. Would it be possible to use such a machine learning approach to produce a high resolution description of educational level of service of primary and secondary schools throughout West Africa? Would such a high resolution description of all persons living throughout West Africa as well as the level of educational service they are receiving be useful to inform implementation of policy at the national, district and local level?

      All around great stuff. Just keep going.

    2. The stronger the ethnic identity, the higher the education rate.

      I'm wondering if he is in favor of advancing indigenous institutions as a means for promoting development. Do you think there is evidence that favors advancing indigenous institutions when compared to modern ones, especially when considered in the context of West Africa (Ghana, Nigeria etc...)

    1. Super. Clearly defined research focus, although a massive topic in terms of scope and scale. Some thoughts.

      (1) Might not be a bad idea to revisit some of the traditional development economic works on rural to urban migration, although the push / pull factors you are identifying are much more regional or global in nature. Two good ones to look at are Todaro:

      https://www-jstor-org.proxy.wm.edu/stable/1807860?seq=1#metadata_info_tab_contents

      and Sir Arthur Lewis' Economic Development with Unlimited Supplies of Labour. Please let me know if you can't find a copy of his 1954 essay that earned him the Nobel Prize.

      (2) I'm wondering a bit about which data science methodology is best to investigate further with regard to your research topic. Argument could be made for thinking about the application of a hierarchical bayesian model with the survey data you have identified. There are also arguments in favor or gravity type models with the CDR data you are considering.

      (3) Migration patterns are such a big topic that it can be difficult to know where to start in terms of geospatial data, data science / machine learning methods / models etc…A common starting point for thinking about migration is in terms of push factors and/or pull factors, more traditionally within the context of rural to urban migration. The scope and scale of what you are investigating is of a much greater magnitude and complexity than these traditionally considered foundations from development economics. Additionally, thinking of migration of this magnitude in terms of only human movement and behavior, likely over simplifies the problem. For example, we could use CDR data to infer the origins and destinations of where people are coming from and where they wish to go as part of a transportation system model (even if it is by foot or mixed mode). I think you need something that seeks to accomplish far more than simply transport behavior and movement, because you also wish to understand the motivations behind why people choose to leave (push factors) as well as the reasons they are attracted to some location (pull factors). I think looking into agent based models as a data science method for describing, analyzing could also be a promising path forward. A simulation system, such as an ABM, may incorporate different model that address factors at the origin as well as at the destination.

      Will be very interested to find out more about the direction you have selected moving forwards! Just keep going!

    2. The most important idea presented in this article is the promising use of mobile network operator call detail records (CDR) as a supplementary data source used to monitor and respond to migration as a result of climate change.

      good

    3. To compare against the AMRS, the Climatic Research Unit’s (CRU) time series containing high resolution monthly precipitation along with temperature data from the NASA Modern Era-Retrospective Analysis for Research and Applications (MERRA) was used.

      How did they go about doing this?

    4. Data was collected from 9812 households in Kenya, Uganda, Nigeria, Burkina Faso and Senegal, and linked with high resolution gridded climate data from station and satellite sources.

      good

    5. Instead of focusing on “climate refugees,” resources should be used to identify populations that are at risk of becoming trapped in place, or being forced to move locally.

      very good

    6. A Digital Elevation Model was used to extract mean land slope. Secondly, the global WorldClim data set containing historical climate information was used. Lastly, GIS was used to link communities to the closest rainfall station of which data was available. The findings were different from expected, showing that both fast and slow climate change are likely to influence human migration, but it is possible that potential migrants will be trapped in place rather than displaced.

      interesting, was a model used that incorporated both the household surveys as well as the geospatial climate and topology data?

    7. These populations lack the freedom of protective security, as they are unable to migrate out of climate impacted regions.

      Very good, particularly your classification in terms of voluntary migration. Have you had a chance to consider push / pull factors?

    1. Very good work. Some thoughts.

      (1) I'm wondering a bit about which dimension of health care / disease you may be focusing upon in relation to Tajikistan. Your problem statement indicates you are interested to investigate respiratory health but your annotations do not seem to provide that much information in support of this direction. Perhaps further clarify which dimension of health care you are interested to investigate as part of your literature review. A common approach is start with either infectious diseases, vector borne diseases, non-communicable diseases or psychological disorders. I think you are interested in non-communicable diseases (respiratory / pulmonary, circulatory / heart) and the factors contributing to their prevalence in Tajikistan.

      (2) As you have noted, it's likely going to be difficult to find a perfectly matched source to the problem you have presented. Still, you can consider what if scenarios, in conjunction with geospatial data that is available in other locations and contexts as well as the applicable method. For example, what if the authors used a hierarchical bayesian model to spatially describe, analyze and predict outcomes in the context of a particularly selected variable (as well as the predictors). It's one option forward, I'm sure there are many others as well.

      Otherwise, just keep going!

    2. Tajikistan seems to have data that is not used effectively, or able to be synthesized and used, which is a direction that I would like to explore in the next part of this assignment.

      excellent

    3. The methodology used for analyzing the data is the institutional synergy and system approaches, pattern methods, and multidimensional comparative analysis. The authors are investigating developmental processes themselves. It seems to be a sort of meta-analysis of methods rather than a standard analysis of processes.

      Is there a spatial dimension to this analysis?

    4. The main cause of premature death in Tajikistan in 2017 is lower respiratory infection. It has increased by 10% since 2007. Uzbekistan, a nearby country, has lower respiratory infection as its second most cause of premature death, yet the rate in Uzbekistan has decreased by about 30% since 2007.

      interesting, I didn't know this at all

    5. If a person is preoccupied with maintaining their health, or the health of others, it prevents them from engaging in something else. Suffering in health is a vicious cycle that prevents natural human development.

      good

    1. Great stuff. Research focus is clearly defined. Some thoughts.

      (1) You have many economic and demographic elements to consider through your investigation. I think you likely want to start with the survey data and the hierarchical bayesian model. This will probably move you in the right direction of better understanding your problem in terms of scope and scale. Your issue as you progress is identifying how your topic intersects with the larger development theme you are considering (wasn't there another dimension you were investigating as well?)...Perhaps start with the survey, machine learning / hierarchical bayesian model approach and consider what data is out there that could be used in the same manner. Even if you don't find a study that is specific to your location, you can still hypothesize how data or a model from a particular study could be improved upon with an improved methodology.

      (2) I also want to encourage you to explore what else is out there, beyond hierarchical bayesian models / machine learning methods. Are there any other machine learning approaches that could be useful towards spatial description, analysis and prediction of your development problem. What about deep learning approaches? Is there anything new on the horizon that presents great hope and promise towards a fuller understanding of a large, complex problem from a national to regional scale?

      Other than that, just keep going!