Hypothesis

88 Matching Annotations

Mar 2020
tech.datopian.com tech.datopian.com

Harvesting | Datopian Technical Docs

7
1. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  CSW
  
  What is this?
2. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  does anyone use AMQP
  
  Are you asking me?
3. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  This the specification of the Source object
  
  I don't get this sentence.
  
  "This is the specification of the Source object:" you mean?
4. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  Apache Airflow
  
  Link to it.
5. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  DataFlows
  
  Link to it.
6. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  WUI
  
  What does this mean?
7. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  creaate
  
  typo
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

tech.datopian.com/harvesting/
hackmd.io hackmd.io

HackMD - Collaborative Markdown Knowledge Base

3
1. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  processes
  
  Operational system processes or another type of process?
2. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  on bare metal
  
  What's "bare metal" in your definition?
3. 7n50T2wVB5w487nErXli7eKM5 26 Mar 2020
  
  in Public
  
  communication between pods
  
  And what's a "pod?"
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

hackmd.io/bB_974JFRGaVGmJc-gxiEQ
developers.google.com developers.google.com

Short sentences | Technical Writing One | Google Developers

16
1. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Should you place descriptions of code inside code comments or in text (paragraphs or lists) outside of the sample code? Note that readers who copy-and-paste a snippet gather not only the code but also any embedded comments. So, put any descriptions that belong in the pasted code into the code comments. By contrast, when you must explain a lengthy or tricky concept, you should typically place the text before the sample program.
  
  .
2. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  When your readers are very experienced with a technology, don't explain what the code is doing, explain why the code is doing it.
  
  .
3. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  According to research by Sung and Mayer (2012), providing any graphics—good or bad—makes readers like the document more; however, only instructive graphics help readers learn.
  
  .
4. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Most readers appreciate at least a brief introduction under each heading to provide some context. Avoid placing a level three heading immediately after a level two heading, as in the following example
  
  .
5. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Conversely, don't make paragraphs too short. If your document contains plenty of one-sentence paragraphs, your organization is faulty.
  
  .
6. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Long paragraphs are visually intimidating. Very long paragraphs form a dreaded "wall of text" that readers ignore. Readers generally welcome paragraphs containing three to five sentences, but will avoid paragraphs containing more than about seven sentences.
  
  .
7. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Avoid putting too much text into a table cell. If a table cell holds more than two sentences, ask yourself whether that information belongs in some other format.
  
  .
8. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  If the list item is a sentence, use sentence capitalization and punctuation. Otherwise, do not use sentence capitalization and punctuation.
  
  .
9. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Consider starting all items in a numbered list with an imperative verb.
  
  .
10. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Sentences that start with There is or There are marry a generic noun to a generic verb. Generic weddings bore readers.
  
  .
11. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Many technical writers believe that the verb is the most important part of a sentence. Pick the right verb and the rest of the sentence will take care of itself.
  
  .
12. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Most readers mentally convert passive voice to active voice.
  
  .
13. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  passive verb = form of be + past participle verb
  
  .
14. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  In an active voice sentence, an actor acts on a target. That is, an active voice sentence follows this formula: Active Voice Sentence = actor + verb + target A passive voice sentence reverses the formula. That is, a passive voice sentence typically follows the following formula: Passive Voice Sentence = target + verb + actor
  
  .
15. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  Use either of the following tactics to disambiguate this and that: Replace this or that with the appropriate noun. Place a noun immediately after this or that.
  
  .
16. 7n50T2wVB5w487nErXli7eKM5 22 Mar 2020
  
  in Public
  
  As a rule of thumb, if more than five words separate your noun from your pronoun, consider repeating the noun instead of using the pronoun.
  
  .
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

developers.google.com/tech-writing/one/just-enough-grammar
numpy.org numpy.org

NumPy for Matlab users — NumPy v1.19.dev0 Manual

8
1. 7n50T2wVB5w487nErXli7eKM5 21 Mar 2020
  
  in Public
  
  See http://mathesaurus.sf.net/ for another MATLAB®/NumPy cross-reference.
  
  Why linking to another of the same? If the reason is to provide a reference, I would change the title of this section and make this explicit.
2. 7n50T2wVB5w487nErXli7eKM5 21 Mar 2020
  
  in Public
  
  from numpy import *
  
  Generally, this is not recommended. Why recommending it here? Just because they would look more similar to how Matlab users do? If so, I believe the best approach is to teach the proper and recommended way, and not allow them to get their results without complying with Python standards.
3. 7n50T2wVB5w487nErXli7eKM5 21 Mar 2020
  
  in Public
  
  ‘array’ or ‘matrix’? Which should I use?
  
  If the answer is so simple and obvious, I wouldn't make a question out of it. Just introduce arrays, then. And then explain how they work when compared to Matlab's version.
  
  As a footnote, you may say "What about numpy matrices?" and give a short context.
4. 7n50T2wVB5w487nErXli7eKM5 21 Mar 2020
  
  in Public
  
  In MATLAB®, arrays have pass-by-value semantics, with a lazy copy-on-write scheme to prevent actually creating copies until they are actually needed. Slice operations copy parts of the array. In NumPy arrays have pass-by-reference semantics. Slice operations are views into an array.
  
  The title of the table are "differences," but the first sentence here is about a similarity.
5. 7n50T2wVB5w487nErXli7eKM5 21 Mar 2020
  
  in Public
  
  MATLAB® uses 1 (one) based indexing. The initial element of a sequence is found using a(1). See note INDEXING Python uses 0 (zero) based indexing. The initial element of a sequence is found using a[0].
  
  The first sentence of both columns are unnecessary.
6. 7n50T2wVB5w487nErXli7eKM5 21 Mar 2020
  
  in Public
  
  In MATLAB®, the basic data type is a multidimensional array of double precision floating point numbers. Most expressions take such arrays and return such arrays. Operations on the 2-D instances of these arrays are designed to act more or less like matrix operations in linear algebra. In NumPy the basic type is a multidimensional array. Operations on these arrays in all dimensionalities including 2D are element-wise operations. One needs to use specific functions for linear algebra (though for matrix multiplication, one can use the @ operator in python 3.5 and above).
  
  Sentences are too long. The same information can be said in a different, but lighter, way.
7. 7n50T2wVB5w487nErXli7eKM5 21 Mar 2020
  
  in Public
  
  Some Key Differences
  
  This table could have column names.
8. 7n50T2wVB5w487nErXli7eKM5 21 Mar 2020
  
  in Public
  
  MATLAB® and NumPy/SciPy have a lot in common. But there are many differences. NumPy and SciPy were created to do numerical and scientific computing in the most natural way with Python, not to be MATLAB® clones.
  
  Weird way of putting it. Are they similar or not? I'm specially confused about the third sentence - I had to read it 3x to understand it.
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

numpy.org/devdocs/user/numpy-for-matlab-users.html
numpy.org numpy.org

Data types — NumPy v1.18.dev0 Manual

1
1. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  NumPy supports a much greater variety of numerical types than Python does.
  
  I expected to read more on how comparable types are different from Python. Put in different words: what's the difference between type() and dtype()?
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

numpy.org/devdocs/user/basics.types.html
numpy.org numpy.org

What is NumPy? — NumPy v1.19.dev0 Manual

1
1. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  Who Else Uses NumPy?
  
  I expected to see logos here. Maybe links to all the repositories in GitHub importing the lib.
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

numpy.org/devdocs/user/whatisnumpy.html
numpy.org numpy.org

Quickstart tutorial — NumPy v1.18.dev0 Manual

8
1. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  Beware: matplotlib also has a function to build histograms (called hist, as in Matlab) that differs from the one in NumPy.
  
  How exactly? When to use each?
2. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  See linalg.py in numpy folder for more.
  
  I can't get this. Where is this? If that's relevant, why not linking to it?
3. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  Linear Algebra¶ Work in progress. Basic linear algebra to be included here.
  
  This should probably be in an issue tracker.
  
  Anyway, could easily make its own documentation page.
4. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  rg
  
  Up to this point, rg has not being imported.
5. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  The matrix product can be performed using the @ operator (in python >=3.5) or the dot function or method
  
  What's the recommended way? Based on my experience, people will use np.dot more often. Also because it's easy to spot bugs where you convert a matrix multiplication from a math formula and just uses *.
6. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  elementwise
  
  What's "elementwise"? What's different from other approaches? It's a great place to show a difference from Python lists - try to do the same with lists.
7. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  [20,30,40,50]
  
  Improve spacing
8. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  Why numpy. when the others don't have it? Why not linking to docs?
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

numpy.org/devdocs/user/quickstart.html
numpy.org numpy.org

NumPy: the absolute basics for beginners — NumPy v1.19.dev0 Manual

36
1. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  it’s very simple
  
  Many would say that is not simple. Generally, I'd be in favor of not suggesting that something is simple or easy at all.
2. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  The best and easiest way to do this is to use Pandas.
  
  It would be interesting to add that Pandas is not part of NumPy. Why should I use another library?
3. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  You can save a NumPy array as a plain text file like a .csv or .txt file with np.savetxt.
  
  This is a place for organizing the hierarchy of the section. How many recommended ways I have for saving an array in disk? What are the practical differences? When to use each?
4. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  ::
  
  Duplicated.
5. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  You can save it as “filename.npy” with: >>>>>> np.save('filename', a) You can use np.load() to reconstruct your array. >>>>>> b = np.load('filename.npy')
  
  Weird that I use .npy just when loading. If the function supports saving with the extension in the name, I would add it in the example.
6. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  handle NumPy binary files with a .npy file extension, and a savez function that handles NumPy files with a .npz file extension.
  
  What's the practical difference between the two extensions? What's the preferred/recommended way? Those are my first questions in this section.
7. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  You will, at some point, want to save your arrays to disk and load them back without having to re-run the code.
  
  "re-run the code." What code? What does this do? Maybe suggest that this was an array that was generated after calling multiple functions, etc.
8. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  For example, this is the mean square error formula (a central formula used in supervised machine learning models that deal with regression): Implementing this formula is simple and straightforward in NumPy: What makes this work so well is that predictions and labels can contain one or a thousand values. They only need to be the same size.
  
  Amazing use of images and colors!
9. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  When it comes to the data science ecosystem, Python and NumPy are built with the user in mind.
  
  I'd remove this. What ecosystem isn't (at least wouldn't say so)?
10. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  This section covers help(), ?, ??
  
  ? and ?? feels that's something missing.
11. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  The primary difference between the two is that the new array created using ravel() is actually a reference to the parent array (i.e., a “view”). This means that any changes to the new array will affect the parent array as well.
  
  I've never knew this! Every time I needed it, I would google "the correct way" because often would not behave as I needed.
12. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  How to reverse an array
  
  I've never personally heard about this np.flip function. I'd consider leaving this section out, or tell why it's relevant not to use Python's reversed.
13. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  transpose your matrices
  
  Considering the assumptions (of the reader) in the rest of this page, it would be nice to explain what transposing is.
14. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  generate random numbers (actually, repeatable pseudo-random numbers)
  
  I believe that this information is obvious for someone who knows the difference between the two. For those who don't know, it might add confusion. I'd remove it or leave it for a different paragraph, with some optional context.
15. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  You
  
  I'm slightly confused about the different colors and shades.
16. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  NumPy
  
  Why two grades of purple?
17. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  Views are an important NumPy concept!
  
  Let's bold this, then!
18. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  shallow copy
  
  Why italic?
19. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  ou can also stack two existing arrays, both vertically and horizontally.
  
  Another excellent place for visualizations. Not only a static image, but possibly a GIF.
20. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  slicing and indexing, np.vstack(), np.hstack(), np.hsplit(), .view(), copy()
  
  Why just some use np.?
21. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  a%2==0
  
  Add spacing
22. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  You can visualize it this way:
  
  Amazing to see visualizations. I think they should be much more present.
23. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  This section covers ndarray.ndim, ndarray.size, ndarray.shape
  
  Might be intentional, but now start thinking if these function names should link to their own documentation.
24. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  , axis=0
  
  That's something that I've seen confusing many people and I still have to think for a moment what's the axis 0 and 1. It would be nice to add a sentence on that before using.
25. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  This section covers np.sort(), np.concatenate()
  
  Amazing for preparing the reader for what's coming.
26. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  np.arange(2, 9, 2)
  
  It would be nice to add the attribute names here, too.
27. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  Arrays and array operations are much more complicated than are captured here!
  
  I think it's possible to say the same without scaring the person away.
28. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  2D
  
  I believe it's ok to drop this.
29. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  You might occasionally hear an array referred to as a “ndarray,” which is shorthand for “N-dimensional array.” An N-dimensional array is simply an array with any number of dimensions. You might also hear 1-D, or one-dimensional array, 2-D, or two-dimensional array, and so on. The NumPy ndarray class is used to represent both matrices and vectors. A vector is an array with a single dimension (there’s no difference between row and column vectors), while a matrix refers to an array with two dimensions. For 3-D or higher dimensional arrays, the term tensor is also commonly used.
  
  Again, another place for images.
30. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  “0”
  
  *
  
  `0`
31. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  One way we can initialize NumPy arrays is from Python lists, using nested lists for two- or higher-dimensional data.
  
  I'd love to see a drawing here.
32. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  Why use NumPy?
  
  It would be nice to add that, often, NumPy array can be used interchangeably with lists, but without the performance gains.
33. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  it’s very easy to understand
  
  It's better to explain, not say how people are supposed to feel when learning this new thing.
34. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  We shorten
  
  Who's "we"?
35. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  Installing NumPy
  
  How to choose between Anaconda or pip? It seems that assumes that I know the difference.
36. 7n50T2wVB5w487nErXli7eKM5 20 Mar 2020
  
  in Public
  
  find more information
  
  *learn more
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

numpy.org/devdocs/user/absolute_beginners.html
Feb 2020
handbook.datopian.com handbook.datopian.com

Analysis: Needs, Design and Planning | Datopian Playbook

2
1. 7n50T2wVB5w487nErXli7eKM5 25 Feb 2020
  
  in Public
  
  Need Discovery
  
  I see that this nomenclature (e.g. needs summary, job epics, blueprint) is widely used in documents across the organization. At this at this moment, from my first day ~4 months ago to this first day of Tech Leaders Program, it's not clear the difference between these documents. I wonder if there are ways to make this more explicit to people who haven't deeply understand this dojo page yet.
2. 7n50T2wVB5w487nErXli7eKM5 25 Feb 2020
  
  in Public
  
  NB
  
  What does it mean?
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

handbook.datopian.com/dojo/analysis/
handbook.datopian.com handbook.datopian.com

Tech Leaders Program | Datopian Playbook

2
1. 7n50T2wVB5w487nErXli7eKM5 25 Feb 2020
  
  in Public
  
  The way I see it, it seems to be a role that not only is in the technical side, but in the product one. In this context: does a tech leader work with a product owner? If so, how's the interaction between them?
2. 7n50T2wVB5w487nErXli7eKM5 25 Feb 2020
  
  in Public
  
  These are the Datopian Tech Leaders.
  
  It would be interesting to have a directory of graduates here in this page. This way, people can refer to them in the future.
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

handbook.datopian.com/tech-leaders-program/
Dec 2019
handbook.datopian.com handbook.datopian.com

Analysis: Needs, Design and Planning | Datopian Playbook

4
1. 7n50T2wVB5w487nErXli7eKM5 30 Dec 2019
  
  in Public
  
  Needs Analysis
  
  Personally, I read it as "(it) needs analysis" for a few weeks while reading such doc about an existing Datopian project. Silly mistake, I know.
2. 7n50T2wVB5w487nErXli7eKM5 30 Dec 2019
  
  in Public
  
  need or want to requirements.
  
  TODO: change to
  
  "need" or "want" to "requirements."
3. 7n50T2wVB5w487nErXli7eKM5 30 Dec 2019
  
  in Public
  
  Plan: planning work to deliver that solution. This includes breaking down a design into tasks, clarifying their dependencies and estimating these (i.e. a roadmap).
  
  If the first steps are done by someone who's not going to be the developer, ideally, the estimation step should include the developers (e.g., engineers, designers).
4. 7n50T2wVB5w487nErXli7eKM5 30 Dec 2019
  
  in Public
  
  We emphasize that analysis is applicable both to large projects and to a single simple task. Just as test-driven development is worthwhile even for simple changes, so analyis will pay dividends even for small script or a minor change to a website.
  
  Reminds me of README-Driven Development, associated to Test-Driven Development.
Visit annotations in context

Annotators

7n50T2wVB5w487nErXli7eKM5

URL

handbook.datopian.com/dojo/analysis/

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL