Hypothesis

519 Matching Annotations

Mar 2023
docs.neurodata.io docs.neurodata.io

9.3. Network Sparsity — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

14
1. jovo 28 Mar 2023
  
  in Public
  
  One of the most popular statistics to use to determine sparsity in realized networks is the network density, but there are many others that have their own advantages [7], [8].
  
  delete?
2. jovo 28 Mar 2023
  
  in Public
  
  infinite
  
  doesn't need to be infinite i dont think
3. jovo 23 Mar 2023
  
  in Public
  
  x→(n), this quantity could be written: 𝕩
  
  why does 'x' look different
4. jovo 23 Mar 2023
  
  in Public
  
  ip:
  
  no . after Pr
5. jovo 23 Mar 2023
  
  in Public
  
  element xi(n) is a random
  
  why superscript
6. jovo 23 Mar 2023
  
  in Public
  
  9.3.1.3. The algorithmic implications#
  
  give example with a big Matvec Op
7. jovo 23 Mar 2023
  
  in Public
  
  Unfortunately, Fisher’s exact test has a slight caveat: it can be extremely computationally intensive to compute, especially when the number of data observations that we have (in this case, 200,000) is really big (it could be even bigger than 200,000).
  
  no
8. jovo 23 Mar 2023
  
  in Public
  
  SNP 1, alternative base T
  
  SNP 300M
9. jovo 23 Mar 2023
  
  in Public
  
  Let’s assume that we have a small task, where for each row in the matrix X, we want to compute the row-wise sum. Stated another way, for a given row i, the quantity that you want to compute is ∑j=1mxij. If you ignore sparsity all-together, you can do this operation pretty easily: there are n rows, and m terms that you need to add together for each row, which means that you will have n⋅m total operations to perform (for each of n rows, perform an addition involving m terms).
  
  no m
10. jovo 23 Mar 2023
  
  in Public
  
  If the rows can be sparse, the columns could be too; let’s assume that we have a matrix where m′ of the columns are sparse. Following a similar approach to the above, if we had a list Y with m′ elements telling us which columns were not sparse, we could just store the m′ non-sparse columns (each of which has n rows), and then the list of the m′ non-zero elements. Like above, we can store this information with 64⋅(n⋅m′+m′+1) bits.
  
  necessary
11. jovo 23 Mar 2023
  
  in Public
  
  Let’s say that of these n rows, we know ahead of time that a lot of the rows are sparse. By “row sparse”, what we mean is that xij=0 for all of these sparse rows i. Let’s assume that of the n total rows, only n′ are not sparse. We could, for instance, store the non-sparse rows in a little set X which has n′ elements telling us which rows are not sparse. For these non-sparse rows, we store all m pieces of column-wise information, but for the sparse rows, we just ignore them entirely. To store this entire matrix, we will need 64⋅(n′⋅m) (64 bits for each entry of a non-sparse row) +64⋅n′ (64 bits to store each element of X) +64 (to store the total number of rows that the matrix has), for a total of 64⋅(n′⋅m+n′+1) bits.
  
  matrix sparsity
12. jovo 23 Mar 2023
  
  in Public
  
  the rows are sparse. By “row sparse”, what we mean is that xij=0 for all of these sparse rows i.
  
  these are different
13. jovo 23 Mar 2023
  
  in Public
  
  but a common cutoff is if the number of non-zero elements is at most the number of rows or columns.
  
  dont think so
14. jovo 23 Mar 2023
  
  in Public
  
  L’Hopital’s
  
  i hope not
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/next/ch10/sparsity.html
May 2022
docs.neurodata.io docs.neurodata.io

1.4. Examples of applications — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

1
1. jovo 13 May 2022
  
  in Public
  
  Examples
  
  problems in NML:
  
  pizza hut nodes
  
  pendants
  
  disconnected networks
  
  directed networks
  
  big weights
  
  1 para per ch. 4 thing but also re-read Homl and check if any are easily portable (eg, too small network, or too dense detwork)
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/foundations/ch1/examples-of-applications.html
docs.neurodata.io docs.neurodata.io

1.3. Types of Network Machine Learning Problems — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

10
1. jovo 13 May 2022
  
  in Public
  
  intuitively
  
  tie back to assumptions
2. jovo 13 May 2022
  
  in Public
  
  allows
  
  asuumes a particular form of
3. jovo 13 May 2022
  
  in Public
  
  attributes
  
  edge node network multi-network
4. jovo 13 May 2022
  
  in Public
  
  mentally
  
  alien
5. jovo 13 May 2022
  
  in Public
  
  Networks with cross-network attributes
  
  multiple networks with node attributes and/or labels
6. jovo 13 May 2022
  
  in Public
  
  For
  
  give AlphaFold example too
7. jovo 13 May 2022
  
  in Public
  
  with more than one element
  
  new sentence
  
  for it to be a meaningful network, there must be multiple nodes and edges
8. jovo 13 May 2022
  
  in Public
  
  usually
  
  is defined by a
9. jovo 13 May 2022
  
  in Public
  
  whether or not the approach can be used in isolation from a statistical model (non-model based or model-based network learning systems).
  
  add a paragraph about edge vs node vs community vs network
10. jovo 13 May 2022
  
  in Public
  
  and
  
  vs
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/foundations/ch1/types-nml-problems.html
docs.neurodata.io docs.neurodata.io

1.2. How do we study networks? — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

10
1. jovo 13 May 2022
  
  in Public
  
  As the internet became widespread and coding tools became easier to use – Python became prevalent in machine learning, for instance, and cloud computing came into its own with Amazon’s AWS and Microsoft’s Azure –
  
  delete
2. jovo 13 May 2022
  
  in Public
  
  cloud computing came into its own with Amazon’s AWS and Microsoft’s Azure –
  
  remove
3. jovo 13 May 2022
  
  in Public
  
  One crucially influential application for networks was in 1996, when a graduate student at Stanford named Larry Page made the PageRank algorithm. The idea was that websites on the internet (which, in 1996, had barely formed) could be ordered into a hierarchy by “link popularity”: a web page would rank higher the more links there were to it. Larry Page and his friend Sergey Brin realized that PageRank could be used to create a search engine – and so they used the PageRank algorithm to found a small web searching company they called Google.
  
  this paragraph is redundant
4. jovo 13 May 2022
  
  in Public
  
  machine
  
  refer back to venn diagram
5. jovo 13 May 2022
  
  in Public
  
  Fig
  
  'network population' --> 'network population assumption'
  
  'network sample = data'
  
  'network machine learning' <-- 'learn about the network sample'
  
  to the right, is 'guess about some property of network population'
6. jovo 13 May 2022
  
  in Public
  
  who could potentially have the mental illness
  
  psychological property, or skill
7. jovo 13 May 2022
  
  in Public
  
  network
  
  special cases
8. jovo 13 May 2022
  
  in Public
  
  1.2.2.3. We might errorfully observe the networks¶
  
  goes first
9. jovo 13 May 2022
  
  in Public
  
  , and although this book doesn’t focus on GNNs specifically, it does give you the fundamental ideas that you can build off of to understand them.
  
  . This book provides the basic foundational concepts and intuition required to understand how, when, and why GNNs, or any other network machine learning tool, works.
10. jovo 13 May 2022
  
  in Public
  
  organized
  
  can be thought of as
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/foundations/ch1/why-study-networks.html
docs.neurodata.io docs.neurodata.io

1. Preface — Network Machine Learning in Python

17
1. jovo 13 May 2022
  
  in Public
  
  Broadly
  
  replace ML with 'statistical learning'
  
  add pointer to ML which is the overlap of SL + DS
  
  add pointer graph theory = overlap of NS + DS
2. jovo 13 May 2022
  
  in Public
  
  Dr
  
  isn't he a section contributor
3. jovo 13 May 2022
  
  in Public
  
  independence
  
  hypothesis
4. jovo 13 May 2022
  
  in Public
  
  Microsoft
  
  DARPA program manager
5. jovo 13 May 2022
  
  in Public
  
  ericwb95 - at - gmail - dot - com
  
  use your neurodata email address. eric@neurodata.io
  
  ask jong
6. jovo 13 May 2022
  
  in Public
  
  Doksum
  
  add diversity to recommendations
7. jovo 13 May 2022
  
  in Public
  
  texts
  
  others
8. jovo 13 May 2022
  
  in Public
  
  would be
  
  is
9. jovo 13 May 2022
  
  in Public
  
  we think a reasonable
  
  our favorite
10. jovo 13 May 2022
  
  in Public
  
  learning
  
  add bullets to appendix
11. jovo 13 May 2022
  
  in Public
  
  which
  
  that
12. jovo 13 May 2022
  
  in Public
  
  unfortunately
  
  remove
13. jovo 13 May 2022
  
  in Public
  
  Machine Learning
  
  decapitalize
14. jovo 13 May 2022
  
  in Public
  
  easy to use
  
  hyphenate?
15. jovo 13 May 2022
  
  in Public
  
  everything unique
  
  overclaim
16. jovo 13 May 2022
  
  in Public
  
  Twitter
  
  maybe mention a chinese/indian/one
17. jovo 13 May 2022
  
  in Public
  
  nearly
  
  over
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/introduction/preface.html
docs.neurodata.io docs.neurodata.io

What Is A Network? — Network Machine Learning in Python

8
1. jovo 13 May 2022
  
  in Public
  
  the development of machine learning strategies for data that is a network.
  
  machine learning for network-valued data
2. jovo 13 May 2022
  
  in Public
  
  For
  
  al ot of 'for instances' here
3. jovo 13 May 2022
  
  in Public
  
  have
  
  choose
4. jovo 13 May 2022
  
  in Public
  
  We don’t really like that word
  
  we like it, there is a downside
5. jovo 13 May 2022
  
  in Public
  
  machine learning
  
  and data science
6. jovo 13 May 2022
  
  in Public
  
  nd each column represented the length and biological sex (male or female) of the lobster
  
  just 'sex'
7. jovo 13 May 2022
  
  in Public
  
  a piece of
  
  some
8. jovo 13 May 2022
  
  in Public
  
  wikipedia
  
  fix to say what wiki says
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/foundations/ch1/what-is-a-network.html
docs.neurodata.io docs.neurodata.io

1. The Network Machine Learning Landscape — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

1
1. jovo 13 May 2022
  
  in Public
  
  Learning
  
  make these pages auto generate a ToC
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/foundations/ch1/ch1.html
docs.neurodata.io docs.neurodata.io

Hands-on Network Machine Learning with Scikit-Learn and Graspologic — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

1
1. jovo 13 May 2022
  
  in Public
  
  Hands-on Network Machine Learning with Scikit-Learn and Graspologic¶
  
  list authors
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/coverpage.html
docs.neurodata.io docs.neurodata.io

10.2. Testing for Significant Edges — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

8
1. jovo 05 May 2022
  
  in Public
  
  SignalSubgraph
  
  check that it deals with ties properly
2. jovo 05 May 2022
  
  in Public
  
  10.2.3.2. Classification with Bayes Plugin Classifier¶
  
  Graph Classification
3. jovo 05 May 2022
  
  in Public
  
  10.2.3.1. Bayes Plugin Classifier (Statistical Intuition)¶
  
  appendix
4. jovo 05 May 2022
  
  in Public
  
  humans
  
  humans
5. jovo 05 May 2022
  
  in Public
  
  astronauts
  
  martians
6. jovo 05 May 2022
  
  in Public
  
  Can we come up with a signal subnetwork classifier?
  
  this means find the subnets that differ
7. jovo 05 May 2022
  
  in Public
  
  astronauts
  
  the descendents of the astronauts are astronauts, are they?
8. jovo 05 May 2022
  
  in Public
  
  lobes
  
  did we change that? i thought we were going with sensory modalities
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch10/significant-edges.html
docs.neurodata.io docs.neurodata.io

10.1. Anomaly Detection For Timeseries of Networks — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

3
1. jovo 05 May 2022
  
  in Public
  
  Estimation
  
  consolidate bootstrap stuff, maybe in appendix
2. jovo 05 May 2022
  
  in Public
  
  above
  
  ensure the result is still (0,1)
3. jovo 05 May 2022
  
  in Public
  
  There will be the same number of adjacency matrices as there are time points, since our network will be changing over time.
  
  confusing
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch10/anomaly-detection.html
docs.neurodata.io docs.neurodata.io

9.4. Vertex Nomination For Two Networks — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

2
1. jovo 05 May 2022
  
  in Public
  
  implement
  
  run
2. jovo 05 May 2022
  
  in Public
  
  VNSGM
  
  introduce acronym earlier
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch9/multiple-vertex-nomination.html
docs.neurodata.io docs.neurodata.io

8.4. Single-Network Vertex Nomination — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

1
1. jovo 05 May 2022
  
  in Public
  
  8.4
  
  jovo didn't do this yet
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch8/single-vertex-nomination.html
docs.neurodata.io docs.neurodata.io

9.2. Graph Matching and Vertex Nomination — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

10
1. jovo 05 May 2022
  
  in Public
  
  -
  
  show match ratio here too
2. jovo 05 May 2022
  
  in Public
  
  Unshuffling
  
  Matching
3. jovo 05 May 2022
  
  in Public
  
  match ratio(𝑃,𝑃𝑢)
  
  update equation
4. jovo 05 May 2022
  
  in Public
  
  match_ratio
  
  put in graspologic
5. jovo 05 May 2022
  
  in Public
  
  𝑃𝐵PBPB
  
  transpose
6. jovo 05 May 2022
  
  in Public
  
  reorder
  
  not really
7. jovo 05 May 2022
  
  in Public
  
  0,1,2,3}
  
  i don't think this example works because there are multiple permutations that yield 0
8. jovo 05 May 2022
  
  in Public
  
  The
  
  /linebreak
9. jovo 05 May 2022
  
  in Public
  
  If we consider the worst possible case (every edge in 𝐴AA does not exist in 𝐵BB), 𝐴=012012⎛⎝⎜⎜011101110⎞⎠⎟⎟𝐵=012012⎛⎝⎜⎜000000000⎞⎠⎟⎟𝐴−𝐵=012012⎛⎝⎜⎜011101110⎞⎠⎟⎟||𝐴−𝐵||2𝐹=6
  
  seems unnecessary
10. jovo 05 May 2022
  
  in Public
  
  𝐴=012012⎛⎝⎜⎜011101110⎞⎠⎟⎟𝐵=012012⎛⎝⎜⎜011101110⎞⎠⎟⎟𝐴−𝐵=012012⎛⎝⎜⎜000000000⎞⎠⎟⎟||𝐴−𝐵||2𝐹=0
  
  this seems unnecessary
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch9/graph-matching-vertex.html
docs.neurodata.io docs.neurodata.io

9.2. Differences in Block Matrices — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

5
1. jovo 05 May 2022
  
  in Public
  
  package
  
  only sklean, scipy, graspologic
2. jovo 05 May 2022
  
  in Public
  
  stochastic_block_test
  
  put this in graspy
3. jovo 05 May 2022
  
  in Public
  
  familywise error rate
  
  previously FWER
4. jovo 05 May 2022
  
  in Public
  
  𝐵(
  
  need period
5. jovo 05 May 2022
  
  in Public
  
  special
  
  no
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch9/significant-communities.html
docs.neurodata.io docs.neurodata.io

9.1. Two-Sample Hypothesis Testing — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

1
1. jovo 05 May 2022
  
  in Public
  
  This
  
  and we dont need matched!!!!
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch9/two-sample-hypothesis.html
docs.neurodata.io docs.neurodata.io

8.6. Out-of-sample Embedding — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

1
1. jovo 05 May 2022
  
  in Public
  
  .8
  
  0.8
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch8/out-of-sample.html
docs.neurodata.io docs.neurodata.io

8.3. Model Selection — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

10
1. jovo 05 May 2022
  
  in Public
  
  multipletests
  
  does that work?
2. jovo 05 May 2022
  
  in Public
  
  𝑎≠𝑏a≠ba \neq b
  
  not nested
3. jovo 05 May 2022
  
  in Public
  
  assumptions
  
  guess
4. jovo 05 May 2022
  
  in Public
  
  Let’s formalize this situation a little bit more. We have the following three hypotheses. 𝐻0:𝑝1=𝑝2=𝑝3=𝑎H0:p1=p2=p3=aH_0: p_1 = p_2 = p_3 = a, against 𝐻1:𝑝1=𝑝2=𝑎H1:p1=p2=aH_1: p_1 = p_2 = a, but 𝑝3=𝑐p3=cp_3 = c. Finally, we have 𝐻2:𝑝1=𝑎H2:p1=aH_2: p_1 = a, 𝑝2=𝑏p2=bp_2 = b, and 𝑝3=𝑐p3=cp_3 = c. The hypothesis 𝐻HH is nested in the hypothesis 𝐻′H′H' if whenever 𝐻HH is true, 𝐻′H′H' is also true. In this sense, the hypothesis 𝐻′H′H' is said to contain the hypothesis 𝐻HH. Let’s consider 𝐻0H0H_0 and 𝐻1H1H_1, for instance. Notice that if 𝐻0H0H_0 is true, then 𝑝1=𝑝2=𝑝3=𝑎p1=p2=p3=ap_1 = p_2 = p_3 = a. However, 𝐻1H1H_1 is also true, since 𝑝1=𝑝2=𝑎p1=p2=ap_1 = p_2 = a, and 𝑝3=𝑐p3=cp_3= c can also be set equal to 𝑝1p1p_1 and 𝑝2p2p_2 if 𝑐=𝑎c=ac = a. A sequence of hypotheses 𝐻0,𝐻1,...,𝐻𝑛H0,H1,...,HnH_0, H_1, ..., H_n is called sequentially nested if 𝐻0H0H_0 is nested in 𝐻1H1H_1, which is nested in 𝐻2H2H_2, so on and so forth up to 𝐻𝑛−1Hn−1H_{n-1} is nested in 𝐻𝑛HnH_n. Note that the sequence of hypotheses that we presented for our three coin example are sequentially nested. We already saw that 𝐻0H0H_0 was nested in 𝐻1H1H_1. Now, let’s compare 𝐻2H2H_2 to 𝐻1H1H_1. Notet that if 𝑎=𝑏a=ba = b, that 𝑝1=𝑝2p1=p2p_1 = p_2, and 𝑝3=𝑐p3=cp_3 = c, exactly as in 𝐻1H1H_1, so 𝐻1H1H_1 is nested in 𝐻2H2H_2. Therefore, since 𝐻0H0H_0 is nested in 𝐻1H1H_1 and 𝐻1H1H_1 is nested in 𝐻2H2H_2, The sequence 𝐻0H0H_0, 𝐻1H1H_1, and 𝐻2H2H_2 are sequentially nested.
  
  dense.
  
  draw a diagram
5. jovo 05 May 2022
  
  in Public
  
  samples with which we are presented
  
  data
6. jovo 05 May 2022
  
  in Public
  
  and
  
  by
7. jovo 05 May 2022
  
  in Public
  
  presenting
  
  selecting among
8. jovo 05 May 2022
  
  in Public
  
  describe
  
  may describe
9. jovo 05 May 2022
  
  in Public
  
  =
  
  \neq
10. jovo 05 May 2022
  
  in Public
  
  faithful
  
  accurate, veridical,
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch8/model-selection.html
docs.neurodata.io docs.neurodata.io

8.2. Testing for Differences between Communities — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

17
1. jovo 05 May 2022
  
  in Public
  
  Pretty exciting, huh?
  
  this pvalue is not valid
  
  see appendix for a robust approach that has higher power for weighted networks.
2. jovo 05 May 2022
  
  in Public
  
  overcoming
  
  appendix
3. jovo 05 May 2022
  
  in Public
  
  . Unfortunately, if the data is not well-summarized by a normal distribution, the 𝑡tt-test tends to be a fairly poor choice for hypothesis testing.
  
  not quite right
4. jovo 05 May 2022
  
  in Public
  
  8.2.2.2.2. Weighted Networks¶
  
  appendix
5. jovo 05 May 2022
  
  in Public
  
  ,
  
  no space after comma
6. jovo 05 May 2022
  
  in Public
  
  below plot
  
  weird formatting
7. jovo 05 May 2022
  
  in Public
  
  8.2. Testing for Differences between Groups of Edges¶
  
  between known groups of edges
8. jovo 05 May 2022
  
  in Public
  
  the
  
  same here
9. jovo 05 May 2022
  
  in Public
  
  the number of adjacencies in cluster one with an adjacency of zero
  
  the # fo zero valued adjacencies
10. jovo 05 May 2022
  
  in Public
  
  8.2.2.1. Hypothesis Testing with coin flips¶
  
  these sections all go in appendix
11. jovo 05 May 2022
  
  in Public
  
  alternative
  
  null, as opposed to the alternative
12. jovo 05 May 2022
  
  in Public
  
  indicates
  
  i don't think they indicate anything
  
  they assert
13. jovo 05 May 2022
  
  in Public
  
  RDPG
  
  not true. GRDPG does
14. jovo 05 May 2022
  
  in Public
  
  8.2.1. The Structured Independent Edge Model is parametrized by a Cluster-Assignment Matrix and a probability vector
  
  this is a model, so goes in ch. 5
15. jovo 05 May 2022
  
  in Public
  
  higher chance two students are friends if they go to the same school than if they go to two different schools.
  
  RDPG must find this.
  
  so, use GRDGP or a different model/hypothesis
16. jovo 05 May 2022
  
  in Public
  
  resort
  
  re-sort
17. jovo 05 May 2022
  
  in Public
  
  the
  
  remove word
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch8/testing-differences.html
Apr 2022
docs.neurodata.io docs.neurodata.io

8.1. Community Detection — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

13
1. jovo 28 Apr 2022
  
  in Public
  
  8.1.1.2. Evaluating
  
  the interesting thing for k-means, silloutte, ARI, etc. is showing them in a graph, and showing when they get it wrong.
  
  and then showing AutoGMM gets it right.
2. jovo 28 Apr 2022
  
  in Public
  
  8.1.1.1
  
  non-graph things go in appendix, including: - k-means - silloutte score - ARI - confusion
3. jovo 28 Apr 2022
  
  in Public
  
  heatmap
  
  adjacency matrix
4. jovo 28 Apr 2022
  
  in Public
  
  hat if your true labels are disproportionate
  
  it doesn't normalize for chance.
5. jovo 28 Apr 2022
  
  in Public
  
  You
  
  add a section on graspologics thingy.
  
  that may require updating grapsologic documentation
6. jovo 28 Apr 2022
  
  in Public
  
  Temporary cluster assignments
  
  Find closest center for each point
7. jovo 28 Apr 2022
  
  in Public
  
  enters from previous iteration
  
  Compute all distances to center
8. jovo 28 Apr 2022
  
  in Public
  
  3 step
  
  2
9. jovo 28 Apr 2022
  
  in Public
  
  smack dab
  
  approximately
10. jovo 28 Apr 2022
  
  in Public
  
  ry to find reasonable guesses at the “centers”
  
  not our goal here
11. jovo 28 Apr 2022
  
  in Public
  
  dataset
  
  and the label of each pont
12. jovo 28 Apr 2022
  
  in Public
  
  ur goal is to learn about the block matrix, 𝐵BB,
  
  learn the latent community assignment vector
13. jovo 28 Apr 2022
  
  in Public
  
  these nodes tend to be more connected (more edges exist between and amongst them)
  
  communities are groups of nodes that are stochastically equivalent.
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/applications/ch8/community-detection.html
docs.neurodata.io docs.neurodata.io

Multigraph Representation Learning — Network Machine Learning in Python

19
1. jovo 28 Apr 2022
  
  in Public
  
  Non-Identifiability
  
  move to ase section?
2. jovo 28 Apr 2022
  
  in Public
  
  had to delete
  
  deleted
3. jovo 28 Apr 2022
  
  in Public
  
  and so, f
  
  Finally
4. jovo 28 Apr 2022
  
  in Public
  
  Embedding
  
  the point is that your embeddings are not in the same space.
5. jovo 28 Apr 2022
  
  in Public
  
  humans + aliens
  
  maybe clarify that
6. jovo 28 Apr 2022
  
  in Public
  
  and so forth
  
  remove
7. jovo 28 Apr 2022
  
  in Public
  
  forth
  
  ,
8. jovo 28 Apr 2022
  
  in Public
  
  first
  
  introduce mase before omni if you are explaining mase before omni
9. jovo 28 Apr 2022
  
  in Public
  
  used
  
  that used
10. jovo 28 Apr 2022
  
  in Public
  
  However, as you can see, the colors are flipped: the communities are in different places relative to each other.
  
  this doesn't make any sense.
  
  also, label communities L and R not 0 and 1.
11. jovo 28 Apr 2022
  
  in Public
  
  plot_latents
  
  plot these on the same scale
12. jovo 28 Apr 2022
  
  in Public
  
  one
  
  before this, show the true latent positions, label them Lhuman Rhuman Laliean Ralien. maybe all on one coordiate axis.
  
  consider showing that they are not rotations of one another.
13. jovo 28 Apr 2022
  
  in Public
  
  P = np.array([[pa, pb], [pc, pd]]) return sbm([n, n], P, return_labels=return_labels) # make nine human networks # and nine alien networks p1, p2, p3 = .12, .06, .03
  
  too many parameters and don't write 9 unless you sample 9
14. jovo 28 Apr 2022
  
  in Public
  
  because
  
  b
15. jovo 28 Apr 2022
  
  in Public
  
  bilateralized
  
  bilateral
16. jovo 28 Apr 2022
  
  in Public
  
  you’ll
  
  We'll
17. jovo 28 Apr 2022
  
  in Public
  
  you’ll just simulate
  
  we'll simulate human and ...
18. jovo 28 Apr 2022
  
  in Public
  
  simply
  
  remove
19. jovo 28 Apr 2022
  
  in Public
  
  aving less stuff to deal with
  
  ?
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/representations/ch6/multigraph-representation-learning.html
docs.neurodata.io docs.neurodata.io

1. Preface — Network Machine Learning in Python

1
1. jovo 28 Apr 2022
  
  in Public
  
  Advisors
  
  add:
  
  Sambit Ali Jason
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/introduction/preface.html
docs.neurodata.io docs.neurodata.io

1. Why embed networks? — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

6
1. jovo 06 Apr 2022
  
  in Public
  
  dependence
  
  logical and statistical
2. jovo 06 Apr 2022
  
  in Public
  
  wheel
  
  crank
3. jovo 06 Apr 2022
  
  in Public
  
  statsmodels
  
  use sklearn or scipy
4. jovo 06 Apr 2022
  
  in Public
  
  Curse
  
  make suck less
5. jovo 06 Apr 2022
  
  in Public
  
  element
  
  adjacency rows, but then d=n
6. jovo 06 Apr 2022
  
  in Public
  
  Spectral Embedding
  
  and GNN
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/representations/ch6/why-embed-networks.html
docs.neurodata.io docs.neurodata.io

5.4. Multi-Network Models — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

3
1. jovo 06 Apr 2022
  
  in Public
  
  5.6.1
  
  haven't done this yet
2. jovo 06 Apr 2022
  
  in Public
  
  RDPG
  
  a different RDPG
3. jovo 06 Apr 2022
  
  in Public
  
  Adm
  
  L, A, M
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/representations/ch5/multi-network-models.html
docs.neurodata.io docs.neurodata.io

5.5. Inhomogeneous Erdos Renyi (IER) Random Network Model — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

1
1. jovo 06 Apr 2022
  
  in Public
  
  IER
  
  venn diagram on 1 graph models and n-graph models
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/representations/ch5/single-network-models_IER.html
docs.neurodata.io docs.neurodata.io

5.1. Why Use Statistical Models? — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

2
1. jovo 06 Apr 2022
  
  in Public
  
  easily
  
  impossible. parachute
2. jovo 06 Apr 2022
  
  in Public
  
  coins
  
  unique
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/representations/ch5/why-use-models.html
docs.neurodata.io docs.neurodata.io

4.4. Regularization — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

16
1. jovo 06 Apr 2022
  
  in Public
  
  Ranking
  
  comment that binarization is decimation of ranking
2. jovo 06 Apr 2022
  
  in Public
  
  networks
  
  this comes after sparsification and truncation because you modify every edge
3. jovo 06 Apr 2022
  
  in Public
  
  normalization
  
  global rescaling
4. jovo 06 Apr 2022
  
  in Public
  
  Sparsification
  
  this is a special case of 'edge trimming'
  
  add truncation
5. jovo 06 Apr 2022
  
  in Public
  
  Lowering
  
  this isn't lowering edge bias
6. jovo 06 Apr 2022
  
  in Public
  
  done
  
  clarify that if it is weighted, the remaining edges keep their weights, as compared to binarization,
7. jovo 06 Apr 2022
  
  in Public
  
  Note
  
  One cannot get arbitrary densities if one has repeated values for weights unless one has a procedure for discarding replicates.
8. jovo 06 Apr 2022
  
  in Public
  
  exclude the diagonal
  
  check graspologic, and make issue/PR
9. jovo 06 Apr 2022
  
  in Public
  
  bias
  
  thresholding reduces variance, adds bias
10. jovo 06 Apr 2022
  
  in Public
  
  he task easier to estimate
  
  not necessarily
11. jovo 06 Apr 2022
  
  in Public
  
  The bias/variance tradeoff is
  
  reference ESL chapter
12. jovo 06 Apr 2022
  
  in Public
  
  Ignoring
  
  no. only do this when the matrix is stored as upper/lower. but then don't quite do this
13. jovo 06 Apr 2022
  
  in Public
  
  degree
  
  remove 'pendants' and 'pizza huts'
14. jovo 06 Apr 2022
  
  in Public
  
  4.4.1. Regularization of the Nodes
  
  Node pruning
15. jovo 06 Apr 2022
  
  in Public
  
  Degree
  
  show this, and re-order to do this node trimming first.
  
  show the degree distribution before and after
16. jovo 06 Apr 2022
  
  in Public
  
  You
  
  be more clear, and show result
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/representations/ch4/regularization.html
docs.neurodata.io docs.neurodata.io

4.2. Representations of Networks — Hands-on Network Machine Learning with Scikit-Learn and Graspologic

19
1. jovo 06 Apr 2022
  
  in Public
  
  Nodes
  
  desribe node latent space here, and network latent space in bag of networks
2. jovo 06 Apr 2022
  
  in Public
  
  space
  
  network latent space (as opposed to the node latent space we use to visualize nodes in a network)
3. jovo 06 Apr 2022
  
  in Public
  
  Embedding this new matrix will give us a point in space for each network.
  
  maybe move down?
4. jovo 06 Apr 2022
  
  in Public
  
  dissimilarity
  
  label the axes and update title to be dissimilarity of networks
5. jovo 06 Apr 2022
  
  in Public
  
  All you need to get out of this code is that you have six networks from the first group, and another twelve networks from the second.
  
  why not 5 and 5? or 10 and 10?
6. jovo 06 Apr 2022
  
  in Public
  
  the whole
  
  each
7. jovo 06 Apr 2022
  
  in Public
  
  Nodes plotted \nas 2d points
  
  Each node is a point.
  
  Add a caption to this figure:
  
  Each point is a node display in 2D latent space. Because there are 20 nodes in this graph, there are 20 points in this figure. Because there are 10 nodes in each community, we have colored 10 points in the figure to indicate which community it is in.
8. jovo 06 Apr 2022
  
  in Public
  
  on a coordinate axis
  
  in latent space
9. jovo 06 Apr 2022
  
  in Public
  
  Euclidean
  
  reals?
10. jovo 06 Apr 2022
  
  in Public
  
  moving
  
  mapping
11. jovo 06 Apr 2022
  
  in Public
  
  Euclidean
  
  not necessarily Euclidean
12. jovo 06 Apr 2022
  
  in Public
  
  issue
  
  for sound theoretical reasons
13. jovo 06 Apr 2022
  
  in Public
  
  statsmodels
  
  graspologic
14. jovo 06 Apr 2022
  
  in Public
  
  end
  
  probably
15. jovo 06 Apr 2022
  
  in Public
  
  outlier
  
  signal
16. jovo 06 Apr 2022
  
  in Public
  
  outlier
  
  signal
17. jovo 06 Apr 2022
  
  in Public
  
  that
  
  Clarify: the issue is not computing these features, but rather, interpreting them. and in particular, interpreting them in a causal light.
18. jovo 06 Apr 2022
  
  in Public
  
  you’d
  
  one could
19. jovo 06 Apr 2022
  
  in Public
  
  f you’re familiar with correlation, you’ll notice that these correlation numbers generally have a pretty high magnitude: each feature generally tells you a lot about each other feature.
  
  not quite. some are high, some are low, some say a lot about the others
Visit annotations in context

Annotators

jovo

URL

docs.neurodata.io/graph-stats-book/representations/ch4/network-representations.html

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL