Hypothesis

32 Matching Annotations

Sep 2022
arxiv.org arxiv.org

Compressing Pre-trained Models of Code into 3 MBCompressing Pre-trained Models of Code into 3 MB

1
1. sunzhensu 19 Sep 2022
  
  in Public
  
  To tackle the search problem, two main challenges need to be ad-dressed. The first challenge is the huge search space with numerousplausible combinations. A model stacks many layers, each of whichcontains different numbers of parameters. Small changes to anyelement of the architecture may result in a new neural network thatcould produce largely different performance even when trainedon the same dataset. Model developers usually put laborious engi-neering effort into finding an appropriate architecture for the tinymodel, which is time-consuming and computing resource-hungry.The second challenge is that the objective of this search problem,i.e., the performance of the tiny model after distillation, is veryexpensive to compute. It is impractical and infeasible to train andevaluate each model we find in the searching process. Therefore,an easy-to-compute and effective predictive metric is desired to betailored for this difficult search problem
  
  estimator可以参考的表述
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2208.07120.pdf
arxiv.org arxiv.org

2102.04906.pdf

1
1. sunzhensu 06 Sep 2022
  
  in Public
  
  Networks with dynamic architectures not only saveredundant computation for canonical (”easy”) samples, butalso preserve their representation power when recognizingnon-canonical (”hard”) sampl
  
  Motivation of early stop
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2102.04906.pdf
Jul 2022
arxiv.org arxiv.org

A Hazard Analysis Framework for Code Synthesis Large Language ModelsA Hazard Analysis Framework for Code Synthesis Large Language Models

1
1. sunzhensu 29 Jul 2022
  
  in Public
  
  For example, one consequential wordis often the difference between Codex producing correct or incorrect results. Other factors such as:• the context of existing code by a user,• defined function and variable names,• existing comments and documentation by a user,• training data distribution, and• conciseness and length of prompt,
  
  codex 表现制约因素
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2207.14157.pdf
arxiv.org arxiv.org

2207.08466.pdf

1
1. sunzhensu 22 Jul 2022
  
  in Public
  
  It stands to reason that thelower layers have the most information about linear word order, and the higher layers may have moreinformation about semantic knowledge and task-specific knowledge.
  
  transformer不同层之间的功效
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2207.08466.pdf
arxiv.org arxiv.org

Attention: Not Just Another Dataset for Patch-Correctness CheckingAttention: Not Just Another Dataset for Patch-Correctness Checking

1
1. sunzhensu 15 Jul 2022
  
  in Public
  
  visit all state-of-the-art PCC techniques
  
  改进evaluate方法然后revisit是个蛮不错的思路
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2207.06590.pdf
arxiv.org arxiv.org

Grounded Copilot: How Programmers Interact with Code-Generating ModelsGrounded Copilot: How Programmers Interact with Code-Generating Models

3
1. sunzhensu 14 Jul 2022
  
  in Public
  
  我发现其实读用户评论也是个很有价值的渠道，用来挖掘新idea
2. sunzhensu 14 Jul 2022
  
  in Public
  
  Further, in our interviews, multiple people described their usual commenting workflow as beingpost-hoc: they add comments after completing code. So, comments put in before completing thecode is out-of-place for these participants
  
  copilot将先code后comment的模式变为先comment后code
3. sunzhensu 14 Jul 2022
  
  in Public
  
  Comment cleaning. Cleaning up their comments after completing a Copilot interaction was acommon occurrence. Many participants, P3, P4, P7, and, P8 would repeatedly delete comments thatwere meant for Copilot. P19 said that cleaning up comments written for Copilot is essential:
  
  这是个很有趣的点，但就是太小了，不太值得发论文
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2206.15000.pdf
arxiv.org arxiv.org

2108.07732.pdf

2
1. sunzhensu 12 Jul 2022
  
  in Public
  
  Table 4: Qualitative analysis of highest- and lowest-performing problems
  
  对评价结果的定性分析
2. sunzhensu 12 Jul 2022
  
  in Public
  
  Synthesis Performance Correlates Poorly with BLEU Score
  
  BLEU不是个好指标的论据
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2108.07732.pdf
arxiv.org arxiv.org

2207.04285.pdf

1
1. sunzhensu 12 Jul 2022
  
  in Public
  
  TABLE 1: Semantic-preserving code transformation in our experiment
  
  很全的SPT
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2207.04285.pdf
arxiv.org arxiv.org

GitHub Copilot AI pair programmer: Asset or Liability?

1
1. sunzhensu 08 Jul 2022
  
  in Public
  
  Copilot has difficulty understandingsome requirements in the description of tasks
  
  copilot 难于实现某些问题
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2206.15331.pdf
tianyi-zhang.github.io tianyi-zhang.github.io

Expectation vs. Experience: Evaluating the Usability of Code Generation Tools Powered by Large Language ModelsExpectation vs. Experience: Evaluating the Usability of Code Generation Tools Powered by Large Language Models

1
1. sunzhensu 07 Jul 2022
  
  in Public
  
  One way to help users understand the generated code is to provideexplanations using inline comments.
  
  这也许是个可行的idea
Visit annotations in context

Annotators

sunzhensu

URL

tianyi-zhang.github.io/files/chi2022-lbw-copilot.pdf
arxiv.org arxiv.org

An Exploratory Study on Regression VulnerabilitiesAn Exploratory Study on Regression Vulnerabilities

3
1. sunzhensu 06 Jul 2022
  
  in Public
  
  Table 1: Categories of the regression vulnerabilities accord-ing to the Common Weakness Enumeration (CWE)
  
  bug分类
2. sunzhensu 06 Jul 2022
  
  in Public
  
  Security was rarely a concern among the com-ments in the issue reports of bugs whose fixing introducedvulnerability regressions.
  
  fix的安全问题不被concern
3. sunzhensu 06 Jul 2022
  
  in Public
  
  For each interview participant, we donated 30 USD tothe Mozilla Foundation or a charity chosen by the interviewee as atoken of appreciation for their time and effort.
  
  招募participant
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2207.01942.pdf
arxiv.org arxiv.org

2012.12324.pdf

1
1. sunzhensu 05 Jul 2022
  
  in Public
  
  Cyclomatic complexity [4] is the most widely used com-plexity metric. McCabe computed the complexity usingv(G) = e − n + 2 where e and n refer to number of edges andnodes in a control flow graph
  
  Cyclomatic complexity
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2012.12324.pdf
Jun 2022
arxiv.org arxiv.org

Using Pre-Trained Models to Boost Code Review AutomationUsing Pre-Trained Models to Boost Code Review Automation

1
1. sunzhensu 30 Jun 2022
  
  in Public
  
  Figure 2: Examples of perfect and alternative predictions
  
  论文里放example的实例
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2201.06850.pdf
arxiv.org arxiv.org

2206.13690.pdf

2
1. sunzhensu 29 Jun 2022
  
  in Public
  
  Below, we briefly describe all the SRS datasets used in the numerical study.
  
  需求文档数据集
2. sunzhensu 29 Jun 2022
  
  in Public
  
  SRS docu-ments describe the functionality and expected performance for software products, naturally affecting all the subsequent phasesin the process. The requirement set defined in SRS documents are analyzed and refined in the design phase, which results invarious design documents. Then, the developers proceed with these documents to build the code for the software system3.
  
  需求文档的表述
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2206.13690.pdf
arxiv.org arxiv.org

NatGen: Generative pre-training by ``Naturalizing'' source codeNatGen: Generative pre-training by ``Naturalizing'' source code

1
1. sunzhensu 22 Jun 2022
  
  in Public
  
  Applying Transformation. Assume a set of transformationrules Φ = {𝜙1, 𝜙2, 𝜙3, ...}. Given original code 𝑐𝑖 , 𝜙 𝑗 (𝑐𝑖 ) transformsthe code, changing the structure while preserving semantics. Fig-ure 3 shows how to apply such transformation to 𝑐𝑖 . It works inthree steps:• Find Transformation Location. Given a piece of source code (𝑐𝑖 ),we first use tree-sitter3 to parse out the AST (𝑇𝑐𝑖 ). From theAST, we extract potential locations for de-naturalization. Theselocations are nodes (𝑛𝑘 ) in 𝑇𝑐𝑖 . While choosing location 𝑛𝑘 from𝑇𝑐𝑖 , we consult Φ – we extract the nodes where at least one of𝜙 𝑗 ∈ Φ is applicable.• Select Transformation Rule. Once we have a set of such nodes,we filter out the transformation rules that cannot be appliedto any node of in 𝑇𝑐𝑖 . After such a filtration, we have a set oftransformations Φ𝑎 ⊆ Φ. At this stage, we randomly select onetransformation pattern 𝜙 𝑗 ∈ Φ𝑎 to apply at an application loca-tion (AST node) 𝑛𝑘 .• Apply Transformation. We apply 𝜙 𝑗 to 𝑛𝑘 to get the transformednode 𝑛′𝑘 . We then structurally match 𝑛′𝑘 with the original AST𝑇𝑐𝑖 , specifically 𝑛𝑘 . We adapt the context of 𝑛𝑘 to the transformednode’s (𝑛′𝑘 ) context. In that way, we get the transformed AST(𝑇 ′𝑐𝑖 ), which we then translate to get the transformed code 𝑐 ′𝑖 .We designed the transformation function 𝜙 𝑗 and subsequentcontext adaptation in such a way that preserves the meaning orfunctionality of the original code. We use AST analysis and (ap-proximated) data flow analysis on code AST
  
  SPT的应用表述
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2206.07585.pdf
arxiv.org arxiv.org

Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LMEfficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM

1
1. sunzhensu 10 Jun 2022
  
  in Public
  
  the high numberof compute operations required can result in unrealistically longtraining times (e.g., training GPT-3 with 175 billion parameters [11 ]would require approximately 288 years with a single V100 NVIDIAGPU).
  
  GPT3训练成本
  
  GreenAI
Visit annotations in context

Tags

GreenAI

Annotators

sunzhensu

URL

arxiv.org/pdf/2104.04473
www.semanticscholar.org www.semanticscholar.org

Semantic Scholar

1
1. sunzhensu 10 Jun 2022
  
  in Public
  
  As a concrete measure, we suggest reporting the total number of floating point operations (FPO) required togenerate a result.13 FPO provides an estimate to the amount of work performed by a computational process. It iscomputed analytically by defining a cost to two base operations, ADD and MUL. Based on these operations, the FPOcost of any machine learning abstract operation (e.g., a tanh operation, a matrix multiplication, a convolution operation,or the BERT model) can be computed as a recursive function of these two operations. FPO has been used in the pastto quantify the energy footprint of a model [26, 42, 12, 41], but is not widely adopted in AI
  
  FLOPs的介绍
  
  GreenAI
Visit annotations in context

Tags

GreenAI

Annotators

sunzhensu

URL

semanticscholar.org/reader/fb73b93de3734a996829caf31e4310e0054e9c6b
arxiv.org arxiv.org

2104.10350.pdf

1
1. sunzhensu 06 Jun 2022
  
  in Public
  
  For example, NVIDIA estimated that 80–90% of the ML workload is inference processing [Leo19]. Similarly,Amazon Web services claimed that 90% of the ML demand in the cloud is for inference [Bar19].
  
  inference整体能耗论据
  
  GreenAI
Visit annotations in context

Tags

GreenAI

Annotators

sunzhensu

URL

arxiv.org/ftp/arxiv/papers/2104/2104.10350.pdf
ar5iv.labs.arxiv.org ar5iv.labs.arxiv.org

Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training

1
1. sunzhensu 01 Jun 2022
  
  in Public
  
  A clear recent trend in the AI community is that models are getting significantly larger. It only took 3 months to shift the title of the largest model from BERT-Large to GPT-2 (Radford et al. 2019) in 2020 while the number of parameters of GPT-2 is around 5 times larger than that of BERT-Large. Moreover, GPT-2 further evolves into GPT-3 (Brown et al. 2020) with 175 Billion parameters. More recently, GLM (Du et al. 2021) has clinched the title with surprisingly 1.75 Trillion parameters. These large models consume more data and have better performance than their smaller counterparts
  
  AI模型不断变大的发展趋势
  
  GreenAI
Visit annotations in context

Tags

GreenAI

Annotators

sunzhensu

URL

ar5iv.labs.arxiv.org/html/2110.14883
May 2022
arxiv.org arxiv.org

2205.13522.pdf

1
1. sunzhensu 31 May 2022
  
  in Public
  
  Code Abstraction
  
  代码抽象工具和表述
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2205.13522.pdf
dl.acm.org dl.acm.org

TOSEM2601-03

1
1. sunzhensu 26 May 2022
  
  in Public
  
  Table VII. Classification of the Quality of Candidate LinkLists Produced by Automated Methods
  
  古人使用的标准来划分query质量
Visit annotations in context

Annotators

sunzhensu

URL

dl.acm.org/doi/pdf/10.1145/3078841
arxiv.org arxiv.org

Untitled document

2
1. sunzhensu 25 May 2022
  
  in Public
  
  Categorization of the 18 SE tasks to which CodePTMs have been applied.
  
  代码任务的种类汇总，还是比较全的
2. sunzhensu 25 May 2022
  
  in Public
  
  Forinstance, source code is not as homogeneous as NL: it is com-posed of both the code in a function body, which is written inprogramming language (PL), as well as optional commentswritten in NL
  
  代码的异质性
Visit annotations in context

Annotators

sunzhensu

URL

arxiv.org/pdf/2205.11739.pdf
www.semanticscholar.org www.semanticscholar.org

Multi-task Learning based Pre-trained Language Model for Code Completion

1
1. sunzhensu 19 May 2022
  
  in Public
  
  Python programs (typically a single function), and evaluates overall functional accuracy(pass rate) across examples using several test cases for each program
  
  可以用test case来测试多行代码生成的准确率
Visit annotations in context

Annotators

sunzhensu

URL

semanticscholar.org/reader/a8fc183c089bd596ccc48b3d666f8814e1b41e55
tianyi-zhang.github.io tianyi-zhang.github.io

Expectation vs. Experience: Evaluating the Usability of Code Generation Tools Powered by Large Language ModelsExpectation vs. Experience: Evaluating the Usability of Code Generation Tools Powered by Large Language Models

1
1. sunzhensu 19 May 2022
  
  in Public
  
  Table 1: Individual and average task completion times. Cells with an orange cell background indicate that the participant neversucceeded because they were stopped after approximately 20 minutes of trying. DNF implies the participant did not finish ontime.
  
  低质量的suggestion反而会降低开发效率
Visit annotations in context

Annotators

sunzhensu

URL

tianyi-zhang.github.io/files/chi2022-lbw-copilot.pdf
www.semanticscholar.org www.semanticscholar.org

Semantic Scholar

1
1. sunzhensu 18 May 2022
  
  in Public
  
  Both the ethical and security problems of DL code models mani-fest an emerging appeal from the open-source community: To es-tablish an effective protection mechanism against the unau-thorized usage of their open-source code in deep learningtasks
  
  Motivations of this paper
Visit annotations in context

Annotators

sunzhensu

URL

semanticscholar.org/reader/02183e69f1dfd6e9b2d0fb876153299bab4bb82b

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL