We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SE

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Software Engineering

Title: Code2Snapshot: Using Code Snapshots for Learning Representations of Source Code

Abstract: There are several approaches to encode source code in the input vectors of neural models. These approaches attempt to include various syntactic and semantic features of input programs in their encoding. In this paper, we investigate Code2Snapshot, a novel representation of the source code that is based on the snapshots of input programs. We evaluate several variations of this representation and compare its performance with state-of-the-art representations that utilize the rich syntactic and semantic features of input programs. Our preliminary study on the utility of Code2Snapshot in the code summarization task suggests that simple snapshots of input programs have comparable performance to the state-of-the-art representations. Interestingly, obscuring the input programs have insignificant impacts on the Code2Snapshot performance, suggesting that, for some tasks, neural models may provide high performance by relying merely on the structure of input programs.
Comments: 8 pages, 2 figures, 1 table
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG); Programming Languages (cs.PL)
Cite as: arXiv:2111.01097 [cs.SE]
  (or arXiv:2111.01097v2 [cs.SE] for this version)

Submission history

From: Md Rafiqul Islam Rabin [view email]
[v1] Mon, 1 Nov 2021 17:07:02 GMT (184kb,D)
[v2] Fri, 24 Dec 2021 20:14:42 GMT (417kb,D)

Link back to: arXiv, form interface, contact.