Hypothesis

224 Matching Annotations

Oct 2025
gemini.google.com gemini.google.com

‎Gemini - chat to supercharge your ideas

7
1. tiendat 18 Oct 2025
  
  in Public
  
  The Limitation: Its greatest weakness is its inability to perform extrapolation.
  
  Important
2. tiendat 18 Oct 2025
  
  in Public
  
  It is fundamentally an exploratory approach.
  
  Important
3. tiendat 18 Oct 2025
  
  in Public
  
  It is fundamentally an exploitative approach, seeking the "best" answer according to the flawed map it's given.
  
  Important
4. tiendat 15 Oct 2025
  
  in Public
  
  The system's potential is demonstrated through concrete validation in three biomedical areas: drug repurposing, novel target discovery, and explaining mechanisms of antimicrobial resistance. For instance, it proposed drug candidates for acute myeloid leukemia that showed tumor inhibition in vitro, showcasing its ability to produce genuinely valuable and original scientific insights. This work frames a new vision for AI: a system that can navigate the high-dimensional space of existing scientific knowledge to discover the unknown.
  
  align
5. tiendat 15 Oct 2025
  
  in Public
  
  Foundational Models and Scientific Discovery: The AI Co-Scientist
  
  align
6. tiendat 15 Oct 2025
  
  in Public
  
  Proposes a multi-agent system based on Gemini 2.0 to automate hypothesis generation in scientific discovery, using a "generate, debate, and evolve" approach.
  
  align
7. tiendat 04 Oct 2025
  
  in Public
  
  AI for Science
  
  Important
Visit annotations in context

Annotators

tiendat

URL

gemini.google.com/
notebooklm.google.com notebooklm.google.com

Google NotebookLM | Note Taking & Research Assistant Powered by AI

3
1. tiendat 17 Oct 2025
  
  in Public
  
  A crucial operational maxim is to "be stubborn on the vision and flexible on the details," acknowledging that this flexibility is necessary because the world is changing [1].
  
  Important
2. tiendat 17 Oct 2025
  
  in Public
  
  change very slowly
  
  Important
3. tiendat 17 Oct 2025
  
  in Public
  
  points of stability
  
  Important
Visit annotations in context

Annotators

tiendat

URL

notebooklm.google.com/notebook/1c3d7ed9-67f7-4306-a063-676868e32c44
chatgpt.com chatgpt.com

ChatGPT

5
1. tiendat 07 Oct 2025
  
  in Public
  
  safety mechanism — it builds understanding rather than surface mimicry.
  
  Important
2. tiendat 07 Oct 2025
  
  in Public
  
  To understand, we need to find an internal model that can generate those patterns.
  
  Important
3. tiendat 07 Oct 2025
  
  in Public
  
  It becomes more honest as it becomes more powerful.
  
  Important
4. tiendat 07 Oct 2025
  
  in Public
  
  Formally, it minimizes something like log loss or cross-entropy between predicted probabilities and actual observed outcomes — a metric that rewards calibrated truthfulness.
  
  elaborate
5. tiendat 07 Oct 2025
  
  in Public
  
  In standard RL or reward-maximizing setups, an agent can over-optimize the reward proxy (Goodhart’s law) and exploit loopholes. A Bayesian, inference-only system doesn’t optimize for outcomes—it merely models them. The ensemble of hypotheses provides natural regularization against runaway single-metric optimization.
  
  elaborate
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68cb8675de0881919311a4d4a9910542-reading/c/68e46e25-835c-8322-958c-bf82d502abb2
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 04 Oct 2025
  
  in Public
  
  Toronto team’s output is on general AI capabilities (LLMs, generative models, agent learning), rather than domain-specific science tools.
  
  Important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68cd69f604f08191a6aef1c8467882bb-internship/c/68e096fe-1810-8324-a8ba-160cd61e053f
Sep 2025
gemini.google.com gemini.google.com

‎Gemini - chat to supercharge your ideas

11
1. tiendat 30 Sep 2025
  
  in Public
  
  the prevailing corporate governance models are fundamentally ill-suited to the unique, long-term safety requirements of AGI development.
  
  Important
2. tiendat 30 Sep 2025
  
  in Public
  
  Institutionalizing the Vision: The Role of LawZero
  
  Important
3. tiendat 30 Sep 2025
  
  in Public
  
  The "Guardrail" Function: Using Safe AI to Monitor Unsafe AI
  
  Important
4. tiendat 30 Sep 2025
  
  in Public
  
  non-agentic. Instead of being trained to take actions to achieve goals, it is designed to explain the world from observations.
  
  Important
5. tiendat 23 Sep 2025
  
  in Public
  
  graph Laplacian
  
  graph Laplacian of the causality graph for natural language, which is a tree, is zero?
6. tiendat 23 Sep 2025
  
  in Public
  
  boundary value problem on a graph
  
  Important
7. tiendat 23 Sep 2025
  
  in Public
  
  discrete Poisson equation
  
  need elaborate
8. tiendat 23 Sep 2025
  
  in Public
  
  Adapting Geometry for Directedness
  
  I have a feeling that we can learn the "shape" of the DAG if we investigate it under the lens of Discrete Differential Geometry
9. tiendat 23 Sep 2025
  
  in Public
  
  The GFlowNet "flow matching" loss condition can be interpreted using this language, relating the flow into a state to the flow out of it.
  
  elaborate
10. tiendat 23 Sep 2025
  
  in Public
  
  dynamics
  
  Important
11. tiendat 20 Sep 2025
  
  in Public
  
  The production rules of a grammar are the constraints that force the data to a low-dimensional manifold. A molecule's grammar, for instance, prevents the vast majority of random atom-and-bond combinations from ever being formed, concentrating all valid molecules onto a specific, structured surface.
  
  makes sense
Visit annotations in context

Annotators

tiendat

URL

gemini.google.com/
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 30 Sep 2025
  
  in Public
  
  improving predictive models of the world
  
  elaborate
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68db39d38f4c819194a2e9404f3e25d8-agi/c/68db3a13-d4f8-832e-be93-15b2dcce91fa
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 29 Sep 2025
  
  in Public
  
  Where MuZero actually sits
  
  seems important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68da5f2e-2bec-8325-b561-0b2d0de7f4e6
chatgpt.com chatgpt.com

ChatGPT

3
1. tiendat 27 Sep 2025
  
  in Public
  
  diffusion needs extra machinery to cope with discreteness
  
  need elaboration
2. tiendat 27 Sep 2025
  
  in Public
  
  Native fit for combinatorial/structured objects.
  
  need elaboration
3. tiendat 27 Sep 2025
  
  in Public
  
  Discrete/combinatorial data is awkward.
  
  ??
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68c8c1917f3881918ab5e86f0e017094-gflownet/c/68d7ea76-972c-8329-92aa-03f2d57fbd1c
chatgpt.com chatgpt.com

ChatGPT

11
1. tiendat 27 Sep 2025
  
  in Public
  
  these biases might over-constrain search and prevent discovery of novel molecules
  
  Super important
2. tiendat 27 Sep 2025
  
  in Public
  
  Effect: credit from terminal rewards can reach all ancestors in one step through the balancing equations — no need to wait for step-by-step temporal backups.
  
  this is so important
3. tiendat 27 Sep 2025
  
  in Public
  
  So instead of waiting for a single reward at the end of a trajectory, each local edge is trained to satisfy a conservation law consistent with terminal rewards.
  
  Important
4. tiendat 27 Sep 2025
  
  in Public
  
  Aggregating Multiple Trajectories
  
  need elaboration
5. tiendat 27 Sep 2025
  
  in Public
  
  Local Signal
  
  need elaboration
6. tiendat 27 Sep 2025
  
  in Public
  
  RL would need repeated training or MCMC-like sampling for each new query.
  
  elaborate
7. tiendat 27 Sep 2025
  
  in Public
  
  Efficient Exploration of Combinatorial Spaces
  
  ??
8. tiendat 27 Sep 2025
  
  in Public
  
  GFlowNets use flow-matching (inflow = outflow) as a local conservation law — mathematically simpler, more stable, better credit assignment.
  
  elaborate
9. tiendat 27 Sep 2025
  
  in Public
  
  Produces a distribution over solutions — crucial for scientific discovery, Bayesian inference, or multi-modal problems.
  
  elaborate
10. tiendat 27 Sep 2025
  
  in Public
  
  Inefficient Exploration
  
  elaborate
11. tiendat 27 Sep 2025
  
  in Public
  
  Single Trajectory Credit Assignment
  
  elaborate
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68c8c1917f3881918ab5e86f0e017094-gflownet/c/68d73c12-eca8-832b-9c12-96c6271af2cc
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 27 Sep 2025
  
  in Public
  
  interpolates/extrapolates reward structure: If many molecules with a substructure A had high reward, The policy will bias flow toward other molecules that also contain substructure A — even if it never explicitly saw them.
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68c8c1917f3881918ab5e86f0e017094/c/68d65e72-0074-832f-8999-6c7775505f13
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 26 Sep 2025
  
  in Public
  
  Generalization across modes
  
  elaborate
2. tiendat 26 Sep 2025
  
  in Public
  
  Every new sample requires running a potentially long Markov chain. No benefit is carried over between runs.
  
  elaborate
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68c8c1917f3881918ab5e86f0e017094-gflownet/c/68d65e72-0074-832f-8999-6c7775505f13
milayb.notion.site milayb.notion.site

Notion – The all-in-one workspace for your notes, tasks, wikis, and databases.

1
1. tiendat 23 Sep 2025
  
  in Public
  
  The above two equations are forced to be consistent (i.e. there is an FFF that gives rise to both PBP_BPB and PFP_FPF) when FFF satisfies the flow-matching constraint (the amount of entering flow equals the amount of outgoing flow), which is not necessarily true when FFF is estimated by a neural network that is being trained (and we have not fully completed training and brought the training loss to 0 everywhere).
  
  can we force this constraint by design?
Visit annotations in context

Annotators

tiendat

URL

milayb.notion.site/The-GFlowNet-Tutorial-95434ef0e2d94c24aab90e69b30be9b3
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 21 Sep 2025
  
  in Public
  
  System 1 (policy net) learns to approximate those judgments instantly.
  
  so planning ahead skill emerge?
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68c2b9a328388191986308e818ecab8e-interesting-thoughts/c/68cf53ed-9510-832f-8b25-69b055d21c46
chatgpt.com chatgpt.com

ChatGPT

5
1. tiendat 20 Sep 2025
  
  in Public
  
  They don’t use explicit grammar rules. Instead, the grammar is implicitly encoded in the model’s weights.
  
  important
2. tiendat 20 Sep 2025
  
  in Public
  
  The world is combinatorially large (endless possible combinations). Humans generalize well by reusing parts without needing to see every possible combo in training.
  
  Is this what happens to LLM and possibly stable diffusion?
3. tiendat 20 Sep 2025
  
  in Public
  
  AlphaZero → trained on massive self-play, now plays “intuitively” via forward neural evaluation (System 1).
  
  need elaboration
4. tiendat 20 Sep 2025
  
  in Public
  
  A small model might need symbolic rules to parse and generate. A large LLM trained on trillions of tokens can produce fluent sentences directly, implicitly encoding the grammar.
  
  important. need elaboration
5. tiendat 20 Sep 2025
  
  in Public
  
  Represented symbolically, compositionally, or in step-by-step logic. Conscious and accessible: you can bring it into working memory, verbalize it, and explain it.
  
  elaborate
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68c2b9a328388191986308e818ecab8e-interesting-thoughts/c/68ceb804-78b8-8323-8811-4f9000ca433b
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 20 Sep 2025
  
  in Public
  
  Drug discovery models (like GFlowNets, graph diffusion, VAEs) exploit this structure: they learn to navigate and generate only chemically valid molecules rather than arbitrary graphs.
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68ce29f0-fd90-8327-91a0-e854f9415a61
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 20 Sep 2025
  
  in Public
  
  Sequential Construction
  
  any relation to the sequentially adding noise step-by-step in diffusion model?
2. tiendat 20 Sep 2025
  
  in Public
  
  2. Initially introduced for active learning in discrete spaces
  
  need elaboration
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68c2b9a328388191986308e818ecab8e-interesting-thoughts/c/68ce2528-be28-832d-bafd-4d4ed59de2a1
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 19 Sep 2025
  
  in Public
  
  Instead of treating a design as one huge blob (like a voxel grid), CGID represents objects as a composition of functional parts.
  
  similar to the compositionality spirit of GflowNet
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68cb8675de0881919311a4d4a9910542-reading/c/68ccc560-2d78-8333-bb2e-863efe3d2d1d
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 18 Sep 2025
  
  in Public
  
  1. Does a GFlowNet need training data points? Not necessarily. If you have a reward function that you can compute for any candidate object, the GFlowNet can train purely by sampling objects (even ones it has never seen before) and scoring them. This is why they’re attractive for drug/material design: you don’t need a huge dataset of known good molecules, just a scoring function (simulator, energy model, ML predictor) to evaluate new candidates.
  
  important
2. tiendat 18 Sep 2025
  
  in Public
  
  Data-efficiency problem: to represent rare but valuable cases, you need a lot of data containing those cases.
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68cbe2b7-6cd0-832e-a797-d5260fe5ef61
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 18 Sep 2025
  
  in Public
  
  By itself: can sample realistic shapes, but functionality is random (not aligned with tasks).
  
  ??
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68cb8675de0881919311a4d4a9910542/c/68cbce4e-e0f8-8320-9afb-f91ed8f363ce
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 18 Sep 2025
  
  in Public
  
  Rather than finetuning the whole diffusion model, it optimizes the conditioning embeddings (e.g., CLIP embeddings) with feedback from simulation performance.
  
  ??
2. tiendat 18 Sep 2025
  
  in Public
  
  soft robots
  
  what is
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68cb8675de0881919311a4d4a9910542-reading/c/68cbce4e-e0f8-8320-9afb-f91ed8f363ce
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 16 Sep 2025
  
  in Public
  
  LLM priors keep evolving
  
  maybe Claude playing Pokemon indeed does this?
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68c91d2f-5514-8327-bd8f-ef5756f479f3
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 16 Sep 2025
  
  in Public
  
  But if they have an energy landscape (a surface with valleys and peaks corresponding to good vs bad configurations), then you don’t need to brute force. You just need to learn the gradient — the direction downhill toward stable/valid solutions.
  
  interesting. Need elaboration
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68c8c1917f3881918ab5e86f0e017094-demis/c/68c8d42a-9ff0-8333-9aae-f4afa407f04b
chatgpt.com chatgpt.com

ChatGPT

3
1. tiendat 15 Sep 2025
  
  in Public
  
  This assumption is empirical: generative models (autoencoders, GANs, diffusion models, LLMs) succeed precisely because such structure exists and is learnable.
  
  evidence that generative models can capture these structure just by observation, without interacting with the physical world?
2. tiendat 15 Sep 2025
  
  in Public
  
  In mathematical terms, if the total search space is size NNN, but the effective dimensionality of “plausible” or “stable” solutions is only size M≪NM \ll NM≪N, then we say the domain has exploitable structure.
  
  manifold
3. tiendat 15 Sep 2025
  
  in Public
  
  stable, reusable patterns (games with coherent strategy, protein physics shaped by evolution).
  
  elaborate
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68c80c9a-3204-8320-a8a5-b4a9526fb0c9
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 11 Sep 2025
  
  in Public
  
  What’s strong about P
  
  it makes me think about the general & universal PS system with core Transformer and different adaptors for each task. Can we build a similar universal planning system that shares the same core LLM?
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68c278db-d4f0-8329-b186-1dd91f5322c8
x.com x.com

Aidan McLaughlin on X: "@_xjdr ughghg half baked thoughts, but if best-of-n sampling is this good, there's probably a way better search paradigm. blackpill because we haven't found it, but whitepill that there's light at the end of the tunnel" / X

1
1. tiendat 09 Sep 2025
  
  in Public
  
  i'm not sure if this is blackpill or whitepill, but my there are a heap of new papers along with my own experiences that are showing "best of N is all you need" for most problems as long as: - sufficient core knowledge was included in the training data - the model is sufficiently large / you use more than 1 model to promote reasonable idea diversity - N is sufficiently large for the complexity of the problem at hand - you have some reasonable discrimination process at the end to determine / approximate "best" result we really haven't come close to leveraging the full potential of existing models, and the antiquated sampling process / approaches are the single biggest culprit
  
  what if we include some sampling distribution on the output side?
Visit annotations in context

Annotators

tiendat

URL

x.com/aidan_mclau/status/1833180168040681651
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 07 Sep 2025
  
  in Public
  
  This is already happening with smaller labs: open-source models are trained partly on outputs from frontier APIs.
  
  genius
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68bc3e1c2e3081918d22a848e9ad84ee-aidan/c/68bd4371-2058-8328-a572-13a45c28fa0d
chatgpt.com chatgpt.com

ChatGPT

3
1. tiendat 07 Sep 2025
  
  in Public
  
  What Culture Does for Humans
  
  is culture an emerging capability under the survival pressure?
2. tiendat 07 Sep 2025
  
  in Public
  
  Aidan hints at this: culture is to humans as wrappers are to models
  
  what does this mean?
3. tiendat 07 Sep 2025
  
  in Public
  
  If they achieve network effects (large user base, ecosystem of integrations, proprietary data flows), they could become the “operating system” for interacting with AI.
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68bcfbb3-4774-8328-826b-f2c4296c8af6
chatgpt.com chatgpt.com

ChatGPT

4
1. tiendat 07 Sep 2025
  
  in Public
  
  Consumers
  
  can researcher be customer?
2. tiendat 07 Sep 2025
  
  in Public
  
  5. Cultural and Human Analogies
  
  elaborate
3. tiendat 07 Sep 2025
  
  in Public
  
  Both groups are in a race to the bottom of commoditization, where only brand, UX, or network effects (e.g., Perplexity’s model-switching) can provide some edge
  
  why?
4. tiendat 07 Sep 2025
  
  in Public
  
  Wrappers that add UX, workflow, and integration value may persist, but rarely become billion-dollar companies.
  
  why?
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68bcf242-2a04-832a-85bb-007e0d52868e
yellow-apartment-148.notion.site yellow-apartment-148.notion.site

AI Search: The Bitter-er Lesson | Notion

1
1. tiendat 06 Sep 2025
  
  in Public
  
  Even if scaling is more generally efficient than search, search allows for quicker intelligence in narrow domains. Training larger foundation models is slow. With search, you don’t have to wait.
  
  this is the key lesson
Visit annotations in context

Annotators

tiendat

URL

yellow-apartment-148.notion.site/AI-Search-The-Bitter-er-Lesson-44c11acd27294f4495c3de778cd09c8d
www.jasonwei.net www.jasonwei.net

Practicing AI research — Jason Wei

3
1. tiendat 05 Sep 2025
  
  in Public
  
  I like research topics that are simple, general, and stand the test of time, and I try to avoid projects that are complicated, task-specific, or short-lived.
  
  elaborate I guess general research topics are more fundamental and tend to have longer lifespan?
2. tiendat 05 Sep 2025
  
  in Public
  
  Most people (including me) would benefit greatly by spending more time on idea selection, since doing this well is a huge multiplier on research impact. Conversely, working on a narrow topic with little headroom caps the impact of the project, no matter how well it is executed.
  
  elaborate
3. tiendat 05 Sep 2025
  
  in Public
  
  A good suggestion from a friend is to either (1) work on a hot topic and do it better than everyone else, or (2) work on something that might become the next hot topic. Strategy 1 is lower risk and requires working very hard. Strategy 2 is higher risk but has potentially very high reward.
  
  elaborate
Visit annotations in context

Annotators

tiendat

URL

jasonwei.net/blog/practicing-ai-research
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 05 Sep 2025
  
  in Public
  
  🔹 General Research Topic → Broad Benchmark Suite
  
  this is what happen to the PS project?
2. tiendat 05 Sep 2025
  
  in Public
  
  They scale with the frontier: as foundation models improve, broad topics grow in importance, while narrow ones fade.
  
  why these scale with frontier?
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68ba5951-9148-8324-a74d-f98be4fe68ea
www.jasonwei.net www.jasonwei.net

Some intuitions about large language models — Jason Wei

6
1. tiendat 05 Sep 2025
  
  in Public
  
  It is an open question why exactly scaling works, but here are two hand-wavy reasons. One is that small language models can’t memorize as much knowledge in their parameters, whereas large language models can memorize a huge amount of factual information about the world. A second guess is that while small language models are capacity-constrained, they might only learn first-order correlations in data. Large language models on the other hand, can learn complex heuristics in data.
  
  important
2. tiendat 05 Sep 2025
  
  in Public
  
  Intuition 3. Tokens can have very different information density, so give language models time to think.
  
  important I guess he realized this by examing the behaviors and output of LLMs. Then what happens next is that it inspires him about the intermediate thinking chains of thoughts
3. tiendat 05 Sep 2025
  
  in Public
  
  The solution to this is to give language models more compute by allowing them to perform natural language reasoning before giving the final answer.
  
  nice
4. tiendat 05 Sep 2025
  
  in Public
  
  You can imagine that if you’re ChatGPT, and as soon as you have to see the prompt you have to immediately start typing, it would be pretty hard to get that question right.
  
  important
5. tiendat 05 Sep 2025
  
  in Public
  
  Intuition 2. Learning input-output relationships can be cast as next-word prediction. This is known as in-context learning.
  
  need elaboration
6. tiendat 05 Sep 2025
  
  in Public
  
  This is an interesting example of how a simple objective, when combined with complex data can lead to highly intelligent behavior (assuming you agree that language models are intelligent).
  
  nice observation
Visit annotations in context

Annotators

tiendat

URL

jasonwei.net/blog/some-intuitions-about-large-language-models
alperenkeles.com alperenkeles.com

Verifiability is the Limit

2
1. tiendat 05 Sep 2025
  
  in Public
  
  It is important to understand that this does not mean LLMs will be gods producing 100x code, because virtually no domain that software engineering is useful has a perfect oracle. A perfect oracle is a type of feedback where you are given a “correct/incorrect” answer every single time, and they almost only appear in games as real world typically doesn’t have perfect models of correctness. Winning or losing a game is a perfect oracle, as well as creating a program that can pass the judge in a competitive programming contest.
  
  important and impressive advice
2. tiendat 05 Sep 2025
  
  in Public
  
  It is the limit that tells us what we cannot implement via LLMs, and it cannot be solved with agentic approaches.
  
  important
Visit annotations in context

Annotators

tiendat

URL

alperenkeles.com/posts/verifiability-is-the-limit/
www.jasonwei.net www.jasonwei.net

Asymmetry of verification and verifier’s law — Jason Wei

6
1. tiendat 05 Sep 2025
  
  in Public
  
  However, in scientific innovation, we are in a totally different realm where we only care about solving a single problem (train=test!) because it’s an unsolved problem and potentially extremely valuable.
  
  .
2. tiendat 05 Sep 2025
  
  in Public
  
  overfitting
  
  .
3. tiendat 05 Sep 2025
  
  in Public
  
  any solvable problem that fits those five properties will be solved in the next few years
  
  important
4. tiendat 05 Sep 2025
  
  in Public
  
  Speed of iteration
  
  .
5. tiendat 04 Sep 2025
  
  in Public
  
  If you consider the history of deep learning, we have seen that virtually anything that can be measured can be optimized
  
  great
6. tiendat 04 Sep 2025
  
  in Public
  
  If you consider the history of deep learning, we have seen that virtually anything that can be measured can be optimized
  
  great
Visit annotations in context

Annotators

tiendat

URL

jasonwei.net/blog/asymmetry-of-verification-and-verifiers-law
chatgpt.com chatgpt.com

ChatGPT

8
1. tiendat 04 Sep 2025
  
  in Public
  
  Prefer methods that pass a scaling test: their delta is flat or increasing as you go from S→M→L models.
  
  maybe we can just plug in models of various scales and see how the performance project?
2. tiendat 04 Sep 2025
  
  in Public
  
  Extra compute/search
  
  I guess this is needed when generator-verifier gap exists?
3. tiendat 04 Sep 2025
  
  in Public
  
  Hard constraints:
  
  hard domain-specific constraints
4. tiendat 04 Sep 2025
  
  in Public
  
  If a method is just a shortcut for weak models (e.g., brittle augmentations, dataset-specific features), it will fade.
  
  important
5. tiendat 04 Sep 2025
  
  in Public
  
  If a method is just a shortcut for weak models (e.g., brittle augmentations, dataset-specific features), it will fade.
  
  important
6. tiendat 04 Sep 2025
  
  in Public
  
  Method: anything you wrap around or plug into the core model: data curation, training tricks, inference procedures, retrieval, tools, constraints, rewards, routing, etc.
  
  maybe each AI startup out there is a different wrapper, specific for each domain?
7. tiendat 04 Sep 2025
  
  in Public
  
  Why methods often fade away
  
  maybe because these methods don't take scaling into account?
8. tiendat 04 Sep 2025
  
  in Public
  
  🔹 What does “method” mean here?
  
  So there are 2 types of hammers: - architectures that scale well with compute & data - methods that scale well with compute & data It might be interesting to bring these compute-scalable methods to integrate in architectures of other domains? What are historic examples for these compute-scalable methods? Are they general-purpose?
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68b95b55-4418-8323-a3bb-1fc6c3c81f1f
chatgpt.com chatgpt.com

ChatGPT

3
1. tiendat 04 Sep 2025
  
  in Public
  
  4. Broader principle
  
  best
2. tiendat 04 Sep 2025
  
  in Public
  
  Build Small Internal Tools
  
  need elaboration
3. tiendat 04 Sep 2025
  
  in Public
  
  summaries of what worked, not the nuanced landscape of the problem
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68b94405-a578-832e-a27c-773b5a21bf97
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 04 Sep 2025
  
  in Public
  
  you’re not competing — you’re complementing.
  
  important
2. tiendat 04 Sep 2025
  
  in Public
  
  adjacent signals: lots of startups are already building agent frameworks (LangChain, AutoGPT, CrewAI). Their bottlenecks hint at where research could contribute
  
  great advice
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68b8f057abf08191aef81b0c8fb5da0d-jason-wei/c/68b8fbc3-dc6c-8324-82ed-542279800651
x.com x.com

Jason Wei on X: "Reflecting back, these were the biggest technical lessons for me in AI in the past five years: 2020: you can cast any language task as sequence prediction and learn it via pretrain + finetune 2021: scaling to GPT-3 size enables doing arbitrary tasks specified via instructions" / X

1
1. tiendat 04 Sep 2025
  
  in Public
  
  2020: you can cast any language task as sequence prediction and learn it via pretrain + finetune 2021: scaling to GPT-3 size enables doing arbitrary tasks specified via instructions 2022: scaling to GPT-3.5/PaLM size unlocks reasoning via chain of thought 2023: LLMs themselves can be a product a lot of people will use 2024: to push capabilities past GPT-4, scale test-time compute
  
  there is a dominant trend and scaffolding structure here
Visit annotations in context

Annotators

tiendat

URL

x.com/_jasonwei/status/1873824593628275198
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 04 Sep 2025
  
  in Public
  
  Old habit: Just grab a standard benchmark (e.g., GLUE, ImageNet, MMLU) and test your method there. Problem today: Those benchmarks might not stress the thing your method is designed for. You’ll conclude your idea “doesn’t work,” when in fact you just used the wrong test.
  
  important
2. tiendat 04 Sep 2025
  
  in Public
  
  Now: Large language models are so capable and multi-task that whether a method works depends a lot on which dataset you test it on.
  
  elaborate on this
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68b8f057abf08191aef81b0c8fb5da0d-jason-wei/c/68b8f089-2b84-8333-8b33-76b92e5ac684
x.com x.com

Jason Wei on X: "There are traditionally two types of research: problem-driven research and method-driven research. As we’ve seen with large language models and now AlphaEvolve, it should be very clear now that total method-driven research is a huge opportunity. Problem-driven research is nice" / X

1
1. tiendat 03 Sep 2025
  
  in Public
  
  AlphaEvolve
  
  what is it?
Visit annotations in context

Annotators

tiendat

URL

x.com/_jasonwei/status/1929621539881996607
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 03 Sep 2025
  
  in Public
  
  The field rewards researchers who can translate expensive experimentation into deep, portable ideas.
  
  nice
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68b812e9-c284-832a-9862-3d048a30a5ec
timdettmers.com timdettmers.com

How to Pick Your Grad School — Tim Dettmers

4
1. tiendat 02 Sep 2025
  
  in Public
  
  AcknowledgementsThis blog post features contributions from Gabriel Ilharco. I would like to thank Hattie Zhou, Nelson Liu, Noah Smith, Gabriel Ilharco, Mitchell Wortsman, Luke Zettlemoyer, Aditya Kusupati, Jungo Kasai, and Ofir Press for their valuable feedback on drafts of this blog post.
  
  this guy certainly not feel shy when asking people for feedback on his work, no matter it's compicated as doing research or as simple as writing a blog
2. tiendat 02 Sep 2025
  
  in Public
  
  It is important to note that there is neither right or wrong nor good or bad research style.
  
  important
3. tiendat 02 Sep 2025
  
  in Public
  
  He builds hacks, understands the deep relationships of how his hack affects the system, and then extracts this insight in the most minimalistic and well-formulated way possible along with his practical hack.
  
  This is so good advice
4. tiendat 02 Sep 2025
  
  in Public
  
  Navigating this uncertainty is best done through fast iterations and balancing multiple projects to maximize the chances of a big success.
  
  important
Visit annotations in context

Annotators

tiendat

URL

timdettmers.com/2022/03/13/how-to-choose-your-grad-school/
Aug 2025
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 31 Aug 2025
  
  in Public
  
  The learning from one doesn’t always transfer, because the ideas are not anchored to a common goal.
  
  very important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68b42cb96edc81919b05efa07b262620/c/68b463e9-4df4-832e-a621-27a13c37481a
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 31 Aug 2025
  
  in Public
  
  ✅ Correct in spirit: Solving “make X work for the first time” usually looks like jumping from essentially no working method to a viable solution (say 0% → 70%).
  
  this looks far more impressive than incremental works
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68b42cb96edc81919b05efa07b262620-opinionated-guide-part-ii/c/68b454cc-7060-8327-9838-d25f42cf5890
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 31 Aug 2025
  
  in Public
  
  X = real-time object detection on a single GPU.
  
  nice example
2. tiendat 31 Aug 2025
  
  in Public
  
  If they join a crowded subfield late, the low-hanging fruit is gone.
  
  I read somewhere that great researchers never stay in a crowded place. They move on to the "next big thing". Related posts on low-hanging fruit: https://ai.engin.umich.edu/2023/08/17/eight-lessons-learned-in-two-years-of-ph-d/ https://www.notion.so/From-Michael-Nielsen-Principles-of-effective-research-25199f12ef0d8040be1ceb0d8b16cc67?source=copy_link#25199f12ef0d80649850edcef18f5519
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68b42cb96edc81919b05efa07b262620-opinionated-guide-part-ii/c/68b42de7-bc2c-8328-ab37-5b38d15e6214
chatgpt.com chatgpt.com

ChatGPT

3
1. tiendat 31 Aug 2025
  
  in Public
  
  Pick a meaningful dimension of stress (tokens, latency, noise, compositional depth, generalization to new domains, etc.).
  
  how to select out a "meaningful dimension" is an important follow-up question by it own. I have several points to add: - the dimension should also involve "feasibility" that fits our research condition - maybe to find out those "meaningful dimensions", we can look very top-down from applications. We can ask, if this dimension is extended, which kind of application can benefit. And with this, we quickly realize that "context length" is a super influential dimension, benefiting various applications. - how about borrowing from related domains like Jinwoo did in TokenGT?
2. tiendat 31 Aug 2025
  
  in Public
  
  Relevance to the field’s trajectory The capability should connect to active conversations in the community. Example: In 2020–2022, instruction-following was “in the air” because GPT-3 showed emergent abilities, but not controllability. So InstructGPT’s capability (follow human instructions) was both new and natural.
  
  I guess ChatGPT can help me with identifying the next natural new AI capabilities to work on
3. tiendat 31 Aug 2025
  
  in Public
  
  Do you want me to break down how to tell, when reading a new paper, whether it’s a “first-time X” paper or a “make X better” paper? That skill will help you classify work quickly.
  
  so there are many dimensions for "working better"? e.g more stable, more scalable, ...
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68b40a98-0900-8323-b430-085df7544768
mukhal.github.io mukhal.github.io

Eight Lessons Learned in Two Years of Ph.D.

6
1. tiendat 31 Aug 2025
  
  in Public
  
  Alternatively, armed with the knowledge gained from working on the first idea, you might move on to a different idea aligned with the same goal, with a higher chance of success.
  
  so knowledge from working on the first idea leads to higher chance of success in the second idea? So I guess it's all about fail fast and then iterate?
2. tiendat 31 Aug 2025
  
  in Public
  
  If you are thinking in terms of ideas, you’d be easily frustrated and might give the idea a few more attempts before finally giving up and moving on to another, possibly unrelated idea, repeating the same process
  
  "unrelated idea" is an important point
3. tiendat 31 Aug 2025
  
  in Public
  
  John Schulman argues that goals have more longevity than ideas
  
  did he?
4. tiendat 31 Aug 2025
  
  in Public
  
  The main issue is that ideas have a very short lifespan; an idea is unlikely to work at first, might not be novel enough, might be easily scooped
  
  elaborate
5. tiendat 31 Aug 2025
  
  in Public
  
  When you’re starting in a new area without a full understanding of the challenges or limitations, It is very tempting to run after a sole idea that you think will work.
  
  why?
6. tiendat 31 Aug 2025
  
  in Public
  
  What Yang was talking about is reading enough papers to cover most of the literature in your area. Needless to say, it is not only about the paper count you read, although the paper count can serve as a good indicator of how well you are engaged with the literature in your research area.
  
  the final goal is to cover understanding of your literature
Visit annotations in context

Annotators

tiendat

URL

mukhal.github.io/2023/08/20/Lessons-learned.html
joschu.net joschu.net

An Opinionated Guide to ML Research

12
1. tiendat 31 Aug 2025
  
  in Public
  
  enabling you to make larger leaps of progress.
  
  why?
2. tiendat 31 Aug 2025
  
  in Public
  
  new AI capabilities
  
  this is too broad. Is there any constraints to narrow down the set of new AI capabilities that we should vision in the scope 2 years?
3. tiendat 31 Aug 2025
  
  in Public
  
  showing how to do X
  
  so the problem of "making X works for the first time" is already resolved?
4. tiendat 30 Aug 2025
  
  in Public
  
  choosing a different problem from the rest of the community can lead you to explore different ideas.
  
  how different here exactly?
5. tiendat 30 Aug 2025
  
  in Public
  
  initial exploration
  
  what is this?
6. tiendat 30 Aug 2025
  
  in Public
  
  Goals also make it possible for a team of researchers to work together and attack different aspects of the problem, whereas idea-driven research is most effectively carried out by “teams” of 1-2 people.
  
  why??
7. tiendat 30 Aug 2025
  
  in Public
  
  On the other hand, with goal-driven research, your goal will give you a perspective that’s differentiated from the rest of the community. It will lead you to ask questions that no one else is asking,
  
  why??
8. tiendat 30 Aug 2025
  
  in Public
  
  To make breakthroughs with idea-driven research, you need to develop an exceptionally deep understanding of your subject, and a perspective that diverges from the rest of the community—some can do it, but it’s difficult.
  
  why??
9. tiendat 30 Aug 2025
  
  in Public
  
  you test a variety of existing methods from the literature, and then you develop your own methods that improve on them
  
  (1) Which existing methods to test? (2) Why we have to test them before we build our own methods? What does it mean by "improve on them"?
10. tiendat 30 Aug 2025
  
  in Public
  
  I’ll take goal-driven research to mean that your goal is more specific than your whole subfield’s goal, and it’s more like make X work for the first time than make X work better.
  
  what does this mean? "your goal is more specific than your whole subfield's goal" how does that align with "make X work for the first time than make X work better"?
11. tiendat 30 Aug 2025
  
  in Public
  
  solve problems that bring you closer to that goal.
  
  how to identify problems that if we solve, could bring us closer to that goal?
12. tiendat 30 Aug 2025
  
  in Public
  
  Follow some sector of the literature.
  
  what does this mean exactly?
Visit annotations in context

Annotators

tiendat

URL

joschu.net/blog/opinionated-guide-ml-research.html
www.usemotion.com www.usemotion.com

I Reviewed 20 Apps to Find The 9 Best Time Blocking Apps | Motion

1
1. tiendat 29 Aug 2025
  
  in Public
  
  This AI-driven auto-scheduling is a massive time-saver, eliminating the tedious manual process of trying to Tetris tasks into your day.
  
  important
Visit annotations in context

Annotators

tiendat

URL

usemotion.com/blog/best-time-blocking-apps.html
calnewport.com calnewport.com

Deep Habits: The Importance of Planning Every Minute of Your Work Day - Cal Newport

1
1. tiendat 28 Aug 2025
  
  in Public
  
  A 40 hour time-blocked work week, I estimate, produces the same amount of output as a 60+ hour work week pursued without structure.
  
  bring structure into life always wins
Visit annotations in context

Annotators

tiendat

URL

calnewport.com/deep-habits-the-importance-of-planning-every-minute-of-your-work-day/
chatgpt.com chatgpt.com

ChatGPT

5
1. tiendat 28 Aug 2025
  
  in Public
  
  Combine with MVO first → secure a skeleton first, then apply Pareto logic for depth.
  
  important
2. tiendat 28 Aug 2025
  
  in Public
  
  5. Pareto 80/20 Why #5: Still powerful — often a few papers/experiments/figures give most of the insight. But research is exploratory, so it’s easy to misjudge which 20% matters most until later. Works best if combined with checkpoints. Example: In experiment runs → 2–3 baseline setups cover 80% of insight; no need to sweep every hyperparameter.
  
  CS 197: Computer Science Research Step 1: Performing a literature search Keep track of how much you’re learning about the design axes as you consume additional papers. Typically, you’re learning the most at the very beginning, and the amount per paper starts going down after five papers or so.
3. tiendat 28 Aug 2025
  
  in Public
  
  6. Progressive Layering Idea: Build the output in layers: skeleton → basic fill → deeper detail → polish. How to apply: Do a quick pass that touches everything at a shallow level, then loop back to deepen. Good for: Ensuring balanced coverage and avoiding “holes.” Limits: Requires resisting the urge to perfect one section before moving on.
  
  important
4. tiendat 28 Aug 2025
  
  in Public
  
  4. Greedy Value-per-Time Idea: Always pick the next piece of work that gives the highest value for the time spent. How to apply: For each possible action, estimate: “How much value will this add?” / “How long will it take?” → Do the one with the best ratio. Good for: Gradually building up value in the most efficient order. Limits: Estimation can be rough; might overlook long-term gains.
  
  need elaboration
5. tiendat 28 Aug 2025
  
  in Public
  
  3. Time-Boxed Checkpoints Idea: Divide your time budget into checkpoints (e.g., 25%, 50%, 75% of T). How to apply: At each checkpoint, stop and take stock. Freeze the current delivery so it’s already usable, then decide whether to add or polish. Good for: Avoiding the trap of chasing “better” until the very last minute. Limits: Requires discipline to actually stop and review.
  
  need elaboration
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68b0543e46748191bb756e77bfdb2fe6-milestones/c/68b05516-251c-832c-867b-549a77089cb5
chatgpt.com chatgpt.com

ChatGPT

10
1. tiendat 26 Aug 2025
  
  in Public
  
  Lesson: the hidden blocker (“taxonomy dimension unclear”) surfaced cheaply at L0.
  
  important
2. tiendat 26 Aug 2025
  
  in Public
  
  Without early reality checks: you drift → big risks discovered late (wasted days). With L0/L1 reality checks: you “fail fast, small, and cheap,” so every later step builds on a working scaffold.
  
  fail fast
3. tiendat 26 Aug 2025
  
  in Public
  
  You fix it before scaling up.
  
  important
4. tiendat 26 Aug 2025
  
  in Public
  
  utility proxies (does this draft/pipeline already work in some small way?)
  
  super important
5. tiendat 26 Aug 2025
  
  in Public
  
  8) Common pitfalls (and the fix)
  
  elaboration
6. tiendat 26 Aug 2025
  
  in Public
  
  You’ll increase 1–2 knobs per level, never all.
  
  important
7. tiendat 26 Aug 2025
  
  in Public
  
  usability-focused.
  
  elaborate on this
8. tiendat 26 Aug 2025
  
  in Public
  
  Converts one far-off, sparse reward into 5–6 immediate feedback signals.
  
  important
9. tiendat 26 Aug 2025
  
  in Public
  
  Break your open-ended milestone into layers, each with a feedback proxy:
  
  important
10. tiendat 26 Aug 2025
  
  in Public
  
  Solution: tie every proxy to usability: “Can this artifact unblock the next step?” If no → proxy doesn’t count.
  
  need elaboration
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68ac5ed0c14c8191a648afde4e4f4e3f/c/68ad1f10-a558-8322-bb72-b400e859bfc5
chatgpt.com chatgpt.com

ChatGPT

4
1. tiendat 26 Aug 2025
  
  in Public
  
  Very sharp connection, Dat 👍 — you’re right: open-ended research tasks suffer from the same two problems as reinforcement learning (RL):
  
  both RL and research are hard. I guess feedback is the bottleneck for both.
2. tiendat 25 Aug 2025
  
  in Public
  
  Get to “usable” fast → then refine only if it proves valuable.
  
  gold
3. tiendat 25 Aug 2025
  
  in Public
  
  👉 Philosophy: The goal of early outputs isn’t to be complete — it’s to learn faster.
  
  important
4. tiendat 25 Aug 2025
  
  in Public
  
  Philosophy: Closure is manufactured, not discovered. If you don’t impose an end, the task never ends.
  
  gold
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68ac5ed0c14c8191a648afde4e4f4e3f-mvo/c/68ac8e19-655c-8325-b1eb-1eb83e03f8fb
chatgpt.com chatgpt.com

ChatGPT

2
1. tiendat 25 Aug 2025
  
  in Public
  
  If you ship something usable by the deadline → ✅ pipeline works. If not → ❌ pipeline broke (e.g., stuck in collection or polish).
  
  important
2. tiendat 25 Aug 2025
  
  in Public
  
  This is the essence of layered refinement: start thin, add depth only if needed.
  
  gold
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68ac5ed0c14c8191a648afde4e4f4e3f-mvo/c/68ac6027-d3c4-8322-88a3-3645f067c71c
chatgpt.com chatgpt.com

ChatGPT

1
1. tiendat 25 Aug 2025
  
  in Public
  
  Momentum
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68ab0fd01b14819196c8ec9868907b6f-dka/c/68ac13a1-3d80-8333-8e32-d25b8bd06f27
chatgpt.com chatgpt.com

ChatGPT

9
1. tiendat 24 Aug 2025
  
  in Public
  
  Closes the loop faster → You don’t wait months for evaluation. Prevents drift → Every day/week forces a checkpoint. Shapes behavior → You optimize for progress under time instead of perfect completeness.
  
  this is gold
2. tiendat 24 Aug 2025
  
  in Public
  
  Each week is a reward event. It shapes your trajectory forward, instead of leaving you wandering until the “final boss” (PhD defense).
  
  this is gold
3. tiendat 24 Aug 2025
  
  in Public
  
  🔹 2. Daily Micro-Outputs as Rewards Reward Rule: +1 if by end of day you capture something tangible (a new cluster, a new theme, or one updated paragraph of outline). Penalty Rule: 0 if you only consumed/collected without producing. 👉 This ensures every day has a checkpoint. Even if small, it signals whether the pipeline is alive.
  
  important
4. tiendat 24 Aug 2025
  
  in Public
  
  That is feedback about the weak link in your pipeline: maybe your clustering method is too slow, or you’re over-expanding the input set. Without the deadline, you’d never notice the bottleneck — you’d just keep drifting.
  
  gold
5. tiendat 24 Aug 2025
  
  in Public
  
  If you don’t ship → ❌ that’s feedback that your process got stuck in over-collection, over-polishing, or paralysis.
  
  this is gold
6. tiendat 24 Aug 2025
  
  in Public
  
  Iterate in Layers → Each week is one “thin slice” of output → feedback → refinement.
  
  need elaboration
7. tiendat 24 Aug 2025
  
  in Public
  
  Manufacture Feedback → Use proxy tests, peer review, or time deadlines instead of waiting for perfect signals.
  
  need elaboration
8. tiendat 24 Aug 2025
  
  in Public
  
  Force Output → Every cycle must produce something tangible (outline, synthesis, test).
  
  need elaboration
9. tiendat 24 Aug 2025
  
  in Public
  
  Cap Input → Don’t endlessly collect; pre-decide how much you’ll allow yourself.
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68ab0fd01b14819196c8ec9868907b6f-dka/c/68ab35eb-15cc-8324-8eab-51a7aef94add
chatgpt.com chatgpt.com

ChatGPT

6
1. tiendat 24 Aug 2025
  
  in Public
  
  The goal is closure and forward motion, not exhaustiveness.
  
  what is closure and forward motion?
2. tiendat 24 Aug 2025
  
  in Public
  
  MVO: a plain input box with keyword search that returns results by exact match. → This lets you confirm: Do users even use the search bar? If yes, you iterate.
  
  one example for delivering fast to get feedback fast
3. tiendat 24 Aug 2025
  
  in Public
  
  No natural stopping point Completeness instinct: A dev team building a new chat feature decides they must include typing indicators, message reactions, file sharing, and push notifications before launch. Result: Months pass before users even test the basic messaging experience. MVO version: Ship a bare-bones text-only chat. Once people actually use it, you’ll know if features like reactions are worth adding.
  
  very prone to over-planning
4. tiendat 24 Aug 2025
  
  in Public
  
  Layered refinement → MVO outputs become scaffolds. You can always enrich them later, but at least you have something concrete.
  
  elaborate on this
5. tiendat 24 Aug 2025
  
  in Public
  
  Progress > Perfection → momentum compounds over time, while perfectionism stalls.
  
  elaborate
6. tiendat 24 Aug 2025
  
  in Public
  
  Barely acceptable ≠ low quality. It means the smallest unit of work that is valid enough to be tested, evaluated, or built upon. The spirit is: cross the acceptance line quickly, then refine if it proves worthwhile.
  
  So the point is that it allows fast delivery of an artifact, thus enables rapid feedback and iteration and built upon
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68aac6e7-d284-8322-834e-f4a5bb63d1de
chatgpt.com chatgpt.com

ChatGPT

7
1. tiendat 24 Aug 2025
  
  in Public
  
  3. Reinforcement Learning (Sparse, Delayed Feedback)
  
  this is so similar to my scenario. So it deserves lots of elaboration. Yes I can only test whether my developed meta research approach is valid or not by using it to develop research question and implementing a project. So it's very delayed. However, if we can use MVO for the process of developing research question and implementation, these can be done rapidly and give the rapid feedback to the meta research approach.
2. tiendat 24 Aug 2025
  
  in Public
  
  If you reach the deadline and you have something coherent → ✔ that’s feedback that your process worked. If you reach the deadline and you’re still collecting without synthesis → ❌ that’s feedback that your process got stuck, and you must force closure.
  
  important
3. tiendat 24 Aug 2025
  
  in Public
  
  Your first synthesis attempt is the feedback resource. If it produces something coherent, that’s your signal to stop collecting and move forward. If it produces obvious holes, those holes tell you precisely what to collect next.
  
  gold
4. tiendat 24 Aug 2025
  
  in Public
  
  Instead of waiting for someone else to tell you “you have enough,” you let the process of synthesis tell you.
  
  This is gold
5. tiendat 24 Aug 2025
  
  in Public
  
  Force yourself to draft a 1-page proto-framework: Section 1: How I choose problems Section 2: How I generate ideas Section 3: How I design experiments Section 4: How I reflect and adapt Even if rough, you’ll quickly feel whether your collected advices are enough to fill these slots.
  
  super important
6. tiendat 24 Aug 2025
  
  in Public
  
  If your 1-page proto-style already helps you reason, structure, or discuss, that’s feedback enough. You don’t need 100% collection before synthesis.
  
  important
7. tiendat 24 Aug 2025
  
  in Public
  
  🔹 D. Time-Bound Feedback Impose a milestone deadline: “If I can’t cluster 15 advices into a draft framework by end of this week, I must ship an MVO draft anyway.” Here, the deadline itself is the feedback resource → it forces closure, telling you that completeness is no longer the metric; progress is.
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/g/g-p-68ab0fd01b14819196c8ec9868907b6f-dka/c/68aac6e7-d284-8322-834e-f4a5bb63d1de
chatgpt.com chatgpt.com

ChatGPT

6
1. tiendat 24 Aug 2025
  
  in Public
  
  It closes the milestone with evidence and prevents “analysis paralysis.”
  
  what is this?
2. tiendat 24 Aug 2025
  
  in Public
  
  Without this cap, “just one more run” can spiral into weeks of GPU drain.
  
  important
3. tiendat 24 Aug 2025
  
  in Public
  
  A single config eliminates “rabbit holes” of hyperparameter tuning. One config = just enough to generate a signal.
  
  what is this?
4. tiendat 24 Aug 2025
  
  in Public
  
  You don’t need the entire dataset to test whether your pipeline works.
  
  Khang Truong used to tell me this
5. tiendat 24 Aug 2025
  
  in Public
  
  Evidence-First Approach → once you have a minimal version, you can test if it holds water before investing more. Layered Refinement → each MVO creates a scaffold; if time allows, you can enrich it later.
  
  need elaboration
6. tiendat 24 Aug 2025
  
  in Public
  
  Instead of “complete everything,” you aim for the leanest but valid version that lets you move forward.
  
  very important mindset
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68aa798f-0e30-832a-a540-4904ac0eceb4
chatgpt.com chatgpt.com

ChatGPT

4
1. tiendat 24 Aug 2025
  
  in Public
  
  Balance Depth vs. Breadth Early milestones should prioritize breadth (collect many inputs). Later milestones should prioritize depth (synthesize into frameworks).
  
  need elaboration
2. tiendat 24 Aug 2025
  
  in Public
  
  Set Sprint-Style Boundaries Treat milestones as 1–2 week sprints (like in software dev). End each sprint with a “demo” artifact (outline, table, draft).
  
  need elaboration
3. tiendat 24 Aug 2025
  
  in Public
  
  Work Backwards from Deadline Decide: “I want a working draft in 1 month”. Break backwards into weekly checkpoints (Week 1 = Collect, Week 2 = Cluster, Week 3 = Synthesize, Week 4 = Draft).
  
  need elaboration
4. tiendat 23 Aug 2025
  
  in Public
  
  Planning everything down to the day months in advance is unrealistic in research. But if you don’t plan at all, you drift. The solution: think in layers of stability.
  
  important
Visit annotations in context

Annotators

tiendat

URL

chatgpt.com/c/68a89690-032c-8328-8fc5-51455d7f04a5

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL