Hypothesis

29 Matching Annotations

Feb 2024
people.ece.ubc.ca people.ece.ubc.ca

Kleio: A Hybrid Memory Page Scheduler with Machine Intelligence

4
1. savitoj 09 Feb 2024
  
  in Public
  
  The difference between the predicted and actualvalues is captured through the loss function and back-propagatedinto the network,
  
  This is the difference between this approach and our delta lstms. Our Delta lstms treat the target as a word in the vocabulary (bcz of the OHE), while this treats it as numerical value and uses the difference to compute loss function
2. savitoj 09 Feb 2024
  
  in Public
  
  Most importantly, they accept top-k predictions at atime, so as to increase the chances of a correct prediction
  
  In my understanding, Higher k => lower accuracy, higher coverage and higher timeliness issues
3. savitoj 09 Feb 2024
  
  in Public
  
  when the output valuespace is significantly large (number of different pages), the RNNprediction accuracy tends to be low
  
  this seems to be an issue, no matter what ML architecture we deploy
4. savitoj 09 Feb 2024
  
  in Public
  
  hybrid memory systems (HMS)
  
  what's the difference between HMS and diasggregated memory system?
Visit annotations in context

Annotators

savitoj

URL

people.ece.ubc.ca/shauryapatel/data/kleio.pdf
Sep 2023
www.usenix.org www.usenix.org

atc20-maruf.pdf

6
1. savitoj 26 Sep 2023
  
  in Public
  
  Boyer-Moor
  
  A pattern searching algorithm for strings
2. savitoj 26 Sep 2023
  
  in Public
  
  This extra wait-timedue to lazy cache eviction policy adds to the overall latency,especially in a high memory pressure scenario
  
  A previous paper we read (The Working Set Model for Program Behaviour -Peter J. Denning) suggested that we should not replace until we absolutely have to, cuz aggresively preloading pages can be futile
3. savitoj 26 Sep 2023
  
  in Public
  
  Data path latencies for two access patterns. Memory dis-aggregation systems have some constant implementation overheadsthat cap their minimum latency to around 1 μs
  
  Sequential prefetching is performing worse than regular disk accesses?
4. savitoj 26 Sep 2023
  
  in Public
  
  CDF
  
  What metric is this?
5. savitoj 26 Sep 2023
  
  in Public
  
  Linux ABIs
  
  An ABI (Application Binary Interface) defines how data structures or computational routines are accessed in machine code, which is a low-level, hardware-dependent format.
6. savitoj 21 Sep 2023
  
  in Public
  
  RDMA
  
  Remote Direct Memory Access
Visit annotations in context

Annotators

savitoj

URL

usenix.org/system/files/atc20-maruf.pdf
www.micahlerner.com www.micahlerner.com

Towards an Adaptable Systems Architecture for Memory Tiering at Warehouse-Scale

4
1. savitoj 13 Sep 2023
  
  in Public
  
  NUMA nodes
  
  Non-uniform memory access is a computer memory design used in multiprocessing, where the memory access time depends on the memory location relative to the processor. Under NUMA, a processor can access its own local memory faster than non-local memory.
2. savitoj 13 Sep 2023
  
  in Public
  
  zswap
  
  compressed write-back cache for swapped pages
3. savitoj 13 Sep 2023
  
  in Public
  
  A/B testing methodology
  
  A/B testing (also known as split testing or bucket testing) is a methodology for comparing two versions of a webpage or app against each other to determine which one performs better.
4. savitoj 13 Sep 2023
  
  in Public
  
  multi-tenant
  
  shared by multiple users and/or workloads which are referred to as "tenants"
Visit annotations in context

Annotators

savitoj

URL

micahlerner.com/assets/pdf/adaptable.pdf
dl.acm.org dl.acm.org

Towards an Adaptable Systems Architecture for Memory Tiering at Warehouse-Scale

3
1. savitoj 12 Sep 2023
  
  in Public
  
  A/B testing methodology
  
  A/B testing (also known as split testing or bucket testing) is a methodology for comparing two versions of a webpage or app against each other to determine which one performs better.
2. savitoj 12 Sep 2023
  
  in Public
  
  multi-tenant
  
  shared by multiple users and/or workloads which are referred to as "tenants"
3. savitoj 12 Sep 2023
  
  in Public
  
  zswap
  
  compressed write-back cache for swapped pages
Visit annotations in context

Annotators

savitoj

URL

dl.acm.org/doi/pdf/10.1145/3582016.3582031
Jul 2023
people.ece.ubc.ca people.ece.ubc.ca

HoPP-HPCA23.pdf

2
1. savitoj 05 Jul 2023
  
  in Public
  
  in-LLC accesses.
  
  are these not useful at all in making prefetching predictions (down the line, when they get evicted)?
2. savitoj 04 Jul 2023
  
  in Public
  
  interference pages that do not belongto any page stream
  
  What does it mean to not belong to any page stream?
Visit annotations in context

Annotators

savitoj

URL

people.ece.ubc.ca/~sasha/TMP/HoPP-HPCA23.pdf
Jun 2023
www.cs.utexas.edu www.cs.utexas.edu

A Hierarchical Neural Model of Data Prefetching

2
1. savitoj 21 Jun 2023
  
  in Public
  
  compulsory misses
  
  Compulsary miss: miss because of first access to the block
2. savitoj 21 Jun 2023
  
  in Public
  
  he number of experts would equal to thenumber of pages
  
  This would be mean no generality in the model, aka overfitting
Visit annotations in context

Annotators

savitoj

URL

cs.utexas.edu/~lin/papers/asplos21.pdf
dl.acm.org dl.acm.org

Classifying Memory Access Patterns for Prefetching

4
1. savitoj 07 Jun 2023
  
  in Public
  
  We can do this by following the pushes andpops of the stack through the dataflow graph showing thatthey are balanced between subsequent executions of thegraph kernel.
  
  So, let's say our next 10 instructions are: 5 POP, 5 PUSH We can avoid changing the stack pointer after every instruction?
2. savitoj 07 Jun 2023
  
  in Public
  
  If the memory latency given by theload chain is higher than the independent work executedbetween subsequent delinquent loads, hiding the memorylatency is impossible, even with infinite run-ahead.
  
  Why would this be a problem if we have an infinite run-ahead (which I am assuming means we can look far ahead into the pattern and know what needs to be fetched)?
3. savitoj 07 Jun 2023
  
  in Public
  
  WSC
  
  Wahington Systems Center?
4. savitoj 07 Jun 2023
  
  in Public
  
  aggres-sivenes
  
  Is aggressiveness the frequency with which we prefetch?
Visit annotations in context

Annotators

savitoj

URL

dl.acm.org/doi/pdf/10.1145/3373376.3378498
proceedings.mlr.press proceedings.mlr.press

Learning Memory Access Patterns

4
1. savitoj 01 Jun 2023
  
  in Public
  
  Finally, dealingwith rarely occurring deltas is non-trivial.
  
  I imagine the cache misses caused due to rarely occuring won't be that expensive because of their rarity. Why would this case be non-trivial then?
  
  Ps: I understand the concern of rare words in NLP, but I imagine, for prefetching, we won't need as high accuracies and thus, can neglect the rare ones.
2. savitoj 01 Jun 2023
  
  in Public
  
  we need highresolution in every area where addresses are used
  
  We need high resolution as we can not prefetch large chunks of memory, thus we can't use this quantization approach
3. savitoj 01 Jun 2023
  
  in Public
  
  unimodal regression
  
  Is multimodal regression problematic? Possibly because of the sparse addressing and loss of information?
4. savitoj 01 Jun 2023
  
  in Public
  
  Stride prefetchers
  
  This observes the strides between successive memory accesses and predicts that the same pattern will continue in the future.
Visit annotations in context

Annotators

savitoj

URL

proceedings.mlr.press/v80/hashemi18a/hashemi18a.pdf

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL