Hypothesis

272 Matching Annotations

Mar 2021
docs.neurodata.io docs.neurodata.io

Network and Vertex feature joint representation learning — Network Machine Learning in Python

7
1. aloftus2 15 Mar 2021
  
  in Public
  
  from statistics.
  
  little too paternalistic
2. aloftus2 08 Mar 2021
  
  in Public
  
  communities
  
  probabilities
3. aloftus2 08 Mar 2021
  
  in Public
  
  Tuning the weight noticeably improved our clustering. Below, you can see the difference between our embedding prior to tuning and our embedding after tuning.
  
  BIC or sillhouette score both fine
  
  jovo prefers BIC >> take kmeans clusters, fit BIC score, under model that each cluster has spherical covariance matrix
4. aloftus2 08 Mar 2021
  
  in Public
  
  𝑋𝑋𝑇𝑖,𝑗XXi,jTXX^T_{i, j}
  
  need parenthesis
5. aloftus2 08 Mar 2021
  
  in Public
  
  a network is "vertices, edges, vertex attributes, edge attributes, graph attributes"
  
  title: joint representation learning
  
  make network be the tuple of all the things
  
  clarify what a network is in the "what is a network" section
  
  but a network can include graph, vertex, edge attributes. the network is the tuple of all of those things
  
  figure showing the other options for joint embedding (jointly embed covariates/network with omni or MASE, embed them both separately, do case) would be helpful
6. aloftus2 07 Mar 2021
  
  in Public
  
  title="Visualization of the covariates", xticks=[], ylabel="Rows
  
  add xlabel make ylabel more simple?
7. aloftus2 01 Mar 2021
  
  in Public
  
  page note test
Visit annotations in context

Annotators

aloftus2

URL

docs.neurodata.io/graph-stats-book/representations/ch6/joint-representation-learning.html
docs.neurodata.io docs.neurodata.io

2. Why Use Statistical Models? — Network Machine Learning in Python

4
1. aloftus2 13 Mar 2021
  
  in Public
  
  data that matches the data near perfectly
  
  had to think about what this meant
2. aloftus2 13 Mar 2021
  
  in Public
  
  the information we did not get to observe
  
  that information
3. aloftus2 13 Mar 2021
  
  in Public
  
  vertices
  
  nodes
4. aloftus2 13 Mar 2021
  
  in Public
  
  That is, the statistical network model indicates what, exactly, is random, and how we can characterize its behavior statistically.
  
  potentially could remove? but maybe fine
Visit annotations in context

Annotators

aloftus2

URL

docs.neurodata.io/graph-stats-book/representations/ch5/ch5.html
docs.neurodata.io docs.neurodata.io

Multigraph Representation Learning — Network Machine Learning in Python

1
1. aloftus2 09 Mar 2021
  
  in Public
  
  Figure showing the other options for joint embedding (jointly embed covariates/network with omni or MASE, embed them both separately, do case) would be helpful
  
  figure 1 here: http://www.cis.jhu.edu/~parky/CEP-Publications/MCPTP-JOC2010.pdf
Visit annotations in context

Annotators

aloftus2

URL

docs.neurodata.io/graph-stats-book/representations/ch6/multigraph-representation-learning.html
docs.neurodata.io docs.neurodata.io

Single-Network Models — Network Machine Learning in Python

59
1. aloftus2 08 Mar 2021
  
  in Public
  
  graphs
  
  networks
2. aloftus2 08 Mar 2021
  
  in Public
  
  we can still summarize that more complex model using a 𝑝pp term without much work
  
  had to think for a sec to figure out what this meant
3. aloftus2 08 Mar 2021
  
  in Public
  
  positive aspects to the ER network’s simplicity
  
  ways that the simplicity of an Erdos-Renyi network might be useful
4. aloftus2 08 Mar 2021
  
  in Public
  
  If we do not know any patterns defining how people are or are not friends
  
  If we don't know anything about the social community in our school,
5. aloftus2 08 Mar 2021
  
  in Public
  
  For instance, in our social network example, we might only know whether two people are from the same school, and might not know whether they are in the same grade or share classes together, even though we would expect these facts to impact whether they might have a higher chance of being friends.
  
  run-on sentence
6. aloftus2 07 Mar 2021
  
  in Public
  
  ss you are looking at a graph in which the vertices are already arranged in an order which respects the community struucture.
  
  unless we're looking at a network that already has reasonably-ordered nodes
7. aloftus2 07 Mar 2021
  
  in Public
  
  ipython-input-11-8581568507e8>:8: FutureWarning: Using a non-tuple sequence for multidimensional indexing is deprecated; use `arr[tuple(seq)]` instead of `arr[seq]`. In the future this will be interpreted as an array index, `arr[np.array(seq)]`, which will result either in an error or a different result. heatmap(A[[vtx_perm]] [:,vtx_perm])
  
  wot
8. aloftus2 07 Mar 2021
  
  in Public
  
  The graph has an identical block structuure (up to the reordering of the vertices) as the preceding graph illustrated.
  
  think you already said this
9. aloftus2 07 Mar 2021
  
  in Public
  
  ll. Consider, for instance, a s
  
  "For instance, look at this network:" (hidden code cell that makes the network) then description
10. aloftus2 07 Mar 2021
  
  in Public
  
  The below graph shows the exact same set of adjacencies as-above, but wherein 𝐴𝐴AA\pmb A has had its vertices resorted randomly.
  
  The heatmap below is the same network, it just has its nodes reordered randomly
11. aloftus2 07 Mar 2021
  
  in Public
  
  Consider, for instance, a similar adjacency matrix to the graph plotted above, with the exact same realization, up to a permutation (reordering) of the vertice
  
  Think about a network that's exactly the same as the one above, except we've reordered the nodes
12. aloftus2 07 Mar 2021
  
  in Public
  
  Indeed, the block structure may only be apparent given a particular ordering of the vertices,
  
  If you think about an adjacency matrix, we don't actually need to order the nodes any particular way. If two nodes that aren't in the same group are right on top of each other in an adjacency matrix, we won't even see any block structure at all! This doesn't mean that our network isn't an SBM, it just means that the way we ordered our nodes doesn't let us see any communities.
13. aloftus2 07 Mar 2021
  
  in Public
  
  may
  
  jargonerific
14. aloftus2 07 Mar 2021
  
  in Public
  
  a block structure is visually discernable.
  
  we can actually see the blocks
  
  or, changing the whole thing, "a network might be an SBM even if we can't see any blocks when we visualize it"
15. aloftus2 07 Mar 2021
  
  in Public
  
  The block structure is clearly delineated by the first 505050 vertices being from a single community, and the second 505050 vertices being from a different community.
  
  dont know if we need this if we say that blocks in SBM = communities in network earlier
16. aloftus2 07 Mar 2021
  
  in Public
  
  These blocks are the apparent “subgraphs”, or square patterns, observed in the above graph.
  
  "blocks in the heatmap of stochastic block models represent the communities in our network. Each community could be also be considered its own network, separate from the others! This is called a subgraph"
17. aloftus2 07 Mar 2021
  
  in Public
  
  n the above simulation, we can clearly see an apparent 444-“block structure”, which describes the fact that the probability of an edge existing depends upon which of the 444 “blocks” the edge falls into
  
  This Stochastic Block Model has two groups: the first is .....
18. aloftus2 07 Mar 2021
  
  in Public
  
  In the above simulation,
  
  "in this heatmap"
19. aloftus2 07 Mar 2021
  
  in Public
  
  # for simplicity, the simulation code generates samples wherein # vertices from the same community are ordered in the vertex set by # their community order.
  
  had to think about this for a sec to understand it
20. aloftus2 07 Mar 2021
  
  in Public
  
  the adjacency matrices describing a graph which has the 𝑆𝐵𝑀𝑛(𝜏⃗ ,𝐵𝐵)SBMn(τ→,BB)SBM_n(\vec \tau, \pmb B) distribution.
  
  generate and visualize a stochastic block model
21. aloftus2 07 Mar 2021
  
  in Public
  
  graph which has the 𝑆𝐵𝑀𝑛(𝜏⃗ ,𝐵𝐵)SBMn(τ→,BB)SBM_n(\vec \tau, \pmb B) distribution.
  
  a stochastic block model
22. aloftus2 07 Mar 2021
  
  in Public
  
  o the symmetry of 𝐵𝐵BB\pmb B, 𝐴𝑗𝑖|𝑣𝑖=𝑘,𝑣𝑗=𝑙∼𝐵𝑒𝑟𝑛𝑜𝑢𝑙𝑙𝑖(𝑏𝑘𝑙)Aji|vi=k,vj=l∼Bernoulli(bkl)A_{ji} | v_i = k, v_j = l \sim Bernoulli(b_{kl}), for all 𝑖,𝑗∈1,...,𝑛i,j∈1,...,ni,j \in 1,...,n.
  
  "due to the symmetry of B, ..." is jargonerific
  
  I think we should just assume that anybody reading this will forget what most of the single letters/symbols mean essentially as soon as they read them
23. aloftus2 07 Mar 2021
  
  in Public
  
  if vertex 𝑣𝑖viv_i is in community 𝑘kk and vetex 𝑣𝑗vjv_j is in community 𝑙ll, then an edge 𝑒𝑖𝑗eije_{ij} or 𝑒𝑗𝑖ejie_{ji} exists between 𝑣𝑖viv_i and 𝑣𝑗vjv_j with probability 𝑏𝑘𝑙=𝑏𝑙𝑘bkl=blkb_{kl}=b_{lk}. Fomally, we wite that 𝐴𝐴∼𝑆𝐵𝑀𝑛(𝜏⃗ ,𝐵𝐵)AA∼SBMn(τ→,BB)\pmb A \sim SBM_n(\vec \tau, \pmb B) if 𝐴𝑖𝑗|𝑣𝑖=𝑘,𝑣𝑗=𝑙∼𝐵𝑒𝑟𝑛𝑜𝑢𝑙𝑙𝑖(𝑏𝑘𝑙)Aij|vi=k,vj=l∼Bernoulli(bkl)A_{ij} | v_i = k, v_j = l \sim Bernoulli(b_{kl}),
  
  hella jargonerific
24. aloftus2 07 Mar 2021
  
  in Public
  
  ntuitionally, this would correspond to the graph in which each of the The m
  
  I like this, needs to be a real sentence (and probably be way bigger, the main thing we want to emphasize)
25. aloftus2 07 Mar 2021
  
  in Public
  
  Further, the matrix 𝐵𝐵BB\pmb B is supposed to be symmetric; that is, for any 𝑏𝑘𝑙bklb_{kl}, it is always the case that 𝑏𝑘,𝑙=𝑏𝑙𝑘bk,l=blkb_{k,l} = b_{lk} for all 𝑘=1,...,𝐾k=1,...,Kk = 1,..., K.
  
  also very jargonerific, "symmetric" is unclear
26. aloftus2 07 Mar 2021
  
  in Public
  
  𝑘=1,...,𝐾k=1,...,Kk = 1,..., K tend to exceed off-diagonal entries 𝑏𝑘𝑙bklb_{kl} where 𝑘≠𝑙k≠lk \neq l and 𝑘,𝑙=1,...,𝐾k,l=1,...,Kk,l = 1,...,K
  
  too formal
27. aloftus2 07 Mar 2021
  
  in Public
  
  𝐵𝐵BB\pmb B such that the diagonal entries 𝑏𝑘𝑘
  
  a non-mathematician will read this and say, "what the hell is B again? what is a b_kk??"
28. aloftus2 07 Mar 2021
  
  in Public
  
  𝐵𝐵BB\pmb B with entries 𝑏𝑘𝑙bklb_{kl} for 𝑘,𝑙=1,...,𝐾k,l=1,...,Kk, l = 1,..., K defines the probability of an edge existing between vertices which are in community 𝑘kk
  
  too formal
29. aloftus2 07 Mar 2021
  
  in Public
  
  For instance, in a social network in which the vertices are students and the edges define whether two students are friends, a vertex assignment vector might denote the school in which each student learns.
  
  probably move this analogy to the top and elaborate/expand it more, this kinda stuff is what we want I think
30. aloftus2 07 Mar 2021
  
  in Public
  
  ex assignment vector has entries 𝜏⃗ 𝑖τ→i\vec \tau_i, where 𝑖=1,...,𝑛i=1,...,ni = 1, ..., n, for each of the vertices in the graph. For a given vertex 𝑣𝑖∈vi∈Vv_i \in \mathcal V, the corresponding vertex assignment 𝜏⃗ 𝑖τ→i\vec \tau_i defines which of the 𝐾KK communities in which 𝑣𝑖viv_i is a member.
  
  too formal
31. aloftus2 07 Mar 2021
  
  in Public
  
  n this case, rather than having a single edge existence probability, each pair of communities has its own unique edge existence probabili
  
  relate this to ER?
  
  "We can make our erdyos-renyi graph a little more complicated by... in order to..."
32. aloftus2 07 Mar 2021
  
  in Public
  
  𝐺=(,)G=(V,E)G = (\mathcal V, \mathcal E) is an SBM with 𝑛nn vertices, each vertex 𝑣𝑖viv_i can take be a member of one (and only one) of 𝐾KK possible communities.
  
  introducing statistical notation way too fast
33. aloftus2 07 Mar 2021
  
  in Public
  
  The Stochastic Block Model, or SBM, is a random graph model which produces graphs in which edge existence probabilities depend upon which vertices a given edge is adjacent to
  
  probably start with an analogy: "imagine we had two groups of people..."
34. aloftus2 07 Mar 2021
  
  in Public
  
  l
  
  dont know what 'l' stands for
35. aloftus2 07 Mar 2021
  
  in Public
  
  Which is in close agreement to the true probability, 𝑝=0.3p=0.3p = 0.3
  
  "which is pretty close to the actual probability"
  
  I think in general we want to talk as if we're having a casual conversation with a friend
36. aloftus2 07 Mar 2021
  
  in Public
  
  Given a graph with an adjacency matrix 𝐴𝐴(𝑠)AA(s)\pmb A^{(s)}, we can also use graspologic to estimate the probability parameter of the 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) model:
  
  If we have a network, we can use graspologic to estimate the chance that a given pair of nodes is connected
37. aloftus2 07 Mar 2021
  
  in Public
  
  uare is dark red if an edge is present, and white if no edge is pr
  
  I'm thinking that for heatmaps in this book, especially unweighted, we maybe want them black-and-white with no colorbar?
  
  or just a binary colorbar, like I have set up in the covariates matrix code in the CASC subsection (see that subsection for matplotlib code that does that)
  
  was thinking of just PRing the ability to do a binary colorbar into graspologic too
38. aloftus2 07 Mar 2021
  
  in Public
  
  n the simple simulation above, we sample a single, undirected, network with loops, with adjacency matrix 𝐴𝐴(𝑠)AA(s)\pmb A^{(s)}.
  
  "Here's an example of what an erdos-renyi network looks like once we make it from our model"
39. aloftus2 07 Mar 2021
  
  in Public
  
  0 vertices ps = 0.3 # probability of an edge existing is .3 # sample a single adj. mtx from ER(50, .3) As = er_np(n=n, p=ps, directed=True, loops=True) # and plot it heatmap(As, title="ER(50, 0.3) Simulation")
  
  maybe this figure should be right in the beginning without code (using code cell hiding), and then the reader has an image in their head as they go?
  
  although it'd still be good to have this code visible somewhere
40. aloftus2 07 Mar 2021
  
  in Public
  
  single adj. mtx from ER(50, .3)
  
  adj. mtx > adjacency matrix
  
  dont think we should use abbreviation if we can help it
41. aloftus2 07 Mar 2021
  
  in Public
  
  The following python code can be used to generate and visualize a graph which is generated by the 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) model. Here, we let 𝑛=50n=50n=50 vertices, and the probability of an edge 𝑝=.3p=.3p=.3:
  
  graph > network
  
  "We'll use a network with 50 nodes, and each pair of nodes has a 30% probability of being connected"
42. aloftus2 07 Mar 2021
  
  in Public
  
  E[deg(vi)]=E[∑j≠iAij]=∑j≠iE[Aij]Expectation of a finite sum is the sum of the expectations=(n−1)p\begin{align*} \mathbb E[deg(v_i)] &= \mathbb E\left[\sum_{j \neq i} A_{ij}\right] \\ &= \sum_{j \neq i} \mathbb E[A_{ij}]\;\;\;\;\textrm{Expectation of a finite sum is the sum of the expectations} \\ &= (n-1) p \end{align*} Which follows by using the fact that all of the 𝑛−1n−1n-1 possible edges which are incident vertex 𝑣𝑖viv_i have the same expected probability of occurence, 𝑝pp, governed by the parameter for the 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) model. This tractability of theoretical results makes the 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) an ideal candidate graph to study in describing properties of networks to be expected if the network is 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p). Similarly, we can easily invert the properties of 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) networks, to study when a graph is not an 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) random graph, and may merit more careful inferential tasks. On another front, when one wishes to devise new computational techniques and deduce the efficiency or effectiveness of a technique on a network with a given number of nodes and a given number of edges, and is not concerned with how efficient the technique is if the network displays other (potentially exploitable) properties, the 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) model also makes a good candidate for analysis. This is particularly common when dealing with graphs which are known to be sparse; that is, 𝑝pp is very small (usually, on the order or less than 1/𝑛1/n1/n).
  
  very jargonerific paragraph, had to reread it a few times
43. aloftus2 07 Mar 2021
  
  in Public
  
  This is particularly common when dealing with graphs which are known to be sparse; that is, 𝑝pp is very small (usually, on the order or less than 1/𝑛1/n1/n).
  
  "network"
  
  sparse > jargonerific
44. aloftus2 07 Mar 2021
  
  in Public
  
  given number of nodes and a given number of edges, and is not concerned with how efficient the technique is if the network displays other (potentially exploitable) properties, the 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) model also makes a good candidate for analysis
  
  run-on sentence
45. aloftus2 07 Mar 2021
  
  in Public
  
  deduce the efficiency or effectiveness of a technique on a networ
  
  jargonerific
46. aloftus2 07 Mar 2021
  
  in Public
  
  larly, we can easily invert the properties of 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) networks, to study when a graph is not an 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) random graph, and may merit more careful inferential tasks. On
  
  not sure what this sentence means
47. aloftus2 07 Mar 2021
  
  in Public
  
  This tractability of theoretical results makes the 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p) an ideal candidate graph to study in describing properties of networks to be expected if the network is 𝐸𝑅𝑛(𝑝)ERn(p)ER_n(p).
  
  a little circular, if we know something is ER, then of course ER is a good candidate model to use
48. aloftus2 07 Mar 2021
  
  in Public
  
  incident vertex 𝑣𝑖viv_i
  
  jargonerific
49. aloftus2 07 Mar 2021
  
  in Public
  
  𝐸𝑅𝑛(𝑝)
  
  write out
50. aloftus2 07 Mar 2021
  
  in Public
  
  𝔼[𝑑𝑒𝑔(𝑣𝑖)]=𝔼[∑𝑗≠𝑖𝐴𝑖𝑗]=∑𝑗≠𝑖𝔼[𝐴𝑖𝑗]
  
  I think any time we use an equation, we should explain what it's doing in words above?
  
  also not sure if we need this equation? or at least needs motivation first
51. aloftus2 07 Mar 2021
  
  in Public
  
  a trivial exercise:
  
  "pretty simple"
  
  the phrase "trivial exercise" I think is pretty specific to mathematicians
52. aloftus2 07 Mar 2021
  
  in Public
  
  𝑑𝑒𝑔(𝑣𝑖)=∑𝑗≠𝑖𝐴𝑖𝑗deg(vi)=∑j≠iAijdeg(v_i) = \sum_{j \neq i}A_{ij}
  
  only summing across i or j, I think this implies summing everything that isn't diagonal
  
  also maybe just delete? dunno if we need this sum
53. aloftus2 07 Mar 2021
  
  in Public
  
  “identical” clarifies that the edge probability 𝑝pp (or the probability of no edge, 1−𝑝1−p1-p) is the same for all edges within the network.
  
  every pair of nodes in our network has the same probability of being connected
54. aloftus2 07 Mar 2021
  
  in Public
  
  the occurence or not occurence of other edges in the network does not impact the probability of occurence of a given edge
  
  edges don't affect each other
55. aloftus2 07 Mar 2021
  
  in Public
  
  (including which vertices an edge is incident, other edges within the graph, nor other factors)
  
  unclear sentence
56. aloftus2 07 Mar 2021
  
  in Public
  
  vertices
  
  do we want to say "nodes" or "vertices"? I think we should use the same word throughout the book. I vote "nodes".
57. aloftus2 07 Mar 2021
  
  in Public
  
  we may lack obvious things such as knowing
  
  we might not know
58. aloftus2 07 Mar 2021
  
  in Public
  
  generative model we select for our network will not be the true generative model that underlies our network
  
  using the word "generative model" twice sounds a little awkward
59. aloftus2 01 Mar 2021
  
  in Public
  
  The vertex assignment vector has entries $\vec \tau_i$, where $i = 1, …, n$, for each of the vertices in the graph. For a given vertex $v_i \in \mathcal V$, the corresponding vertex assignment $\vec \tau_i$ defines which of the $K$ communities in which $v_i$ is a member. For instance, in a social network in which the vertices are students and the edges define whether two students are friends, a vertex assignment vector might denote the school in which each student learns. The matrix $\pmb B$ with entries $b_{kl}$ for $k, l = 1,…, K$ defines the probability of an edge existing between vertices which are in community $k$ with vertices which are in community $l$. For instance, in the social network example, one might select $\pmb B$ such that the diagonal entries $b_{kk}$ for $k = 1,…, K$ tend to exceed off-diagonal entries $b_{kl}$ where $k \neq l$ and $k,l = 1,…,K$. Further, the matrix $\pmb B$ is supposed to be symmetric; that is, for any $b_{kl}$, it is always the case that $b_{k,l} = b_{lk}$ for all $k = 1,…, K$. Intuitionally, this would correspond to the graph in which each of the The matrix $\pmb B$ defines that if vertex $v_i$ is in community $k$ and vetex $v_j$ is in community $l$, then an edge $e_{ij}$ or $e_{ji}$ exists between $v_i$ and $v_j$ with probability $b_{kl}=b_{lk}$. Fomally, we wite that $\pmb A \sim SBM_n(\vec \tau, \pmb B)$ if $A_{ij} | v_i = k, v_j = l \sim Bernoulli(b_{kl})$, or equivalently due to the symmetry of $\pmb B$, $A_{ji} | v_i = k, v_j = l \sim Bernoulli(b_{kl})$, for all $i,j \in 1,…,n$.
  
  too mathy
Visit annotations in context

Annotators

aloftus2

URL

docs.neurodata.io/graph-stats-book/representations/ch5/single-network-models.html
docs.neurodata.io docs.neurodata.io

Network and Vertex feature joint representation learning — Network Machine Learning in Python

1
1. aloftus2 01 Mar 2021
  
  in Public
  
  We need a graph and some covariates. To start off, we’ll make a pretty straightforward Stochastic Block Model with 1500 nodes and 3 communities.
  
  need to add equations for sampling data then algorithm bit then demonstrate it works when it's supposed to work
Visit annotations in context

Annotators

aloftus2

URL

docs.neurodata.io/graph-stats-book/representations/ch6/joint-representation-learning.html

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL