Making Parsed Source Code Data Available Externally #314

daomcgill · 2024-10-10T02:05:38Z

Purpose

This issue is an extension of issue #313. The purpose here is to create configurable /exec scripts that make data tables available externally. The new scripts will add usability to the syntax extraction process by providing a usable way to perform source code annotations and XML querying.

Process

Create script for annotating source code using srcML.
Create script for querying the annotated data. This will accept a predefined query or a user-defined XPath query.
Documentation

New Scripts

/exec/syntaxextractor.R: Script for running the syntax extractor using existing functions in R/src.R. The functionality for this is split into two parts:

Annotation: Takes in a source code folder and uses srcML to generate an annotated XML file.
Querying: Accept predefined XPath queries to extract syntactic elements from the XML files. Allows custom XPath queries to be specified by the user. Outputs the query results.

Task List

Prerequisite: completion of issue Expanding the Syntax Extractor #313
Create a new script in /exec
Implement functionality for annotation
Implement functionality for queries
Documentation: explain how to use exec scripts, configuration and parameters

daomcgill · 2024-10-10T02:05:48Z

@carlosparadis part II

carlosparadis · 2024-10-10T02:24:24Z

@daomcgill For this one I would consider making two execs, one that annotates, and the other that can query the file. Annotating can take a long time depending on the size of the project, hence the split.

Otherwise, I think this is good! We can take another pass once #313 is done.

Thanks!

carlosparadis assigned daomcgill Oct 10, 2024

daomcgill mentioned this issue Oct 10, 2024

File representation as commit and issue messages #316

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making Parsed Source Code Data Available Externally #314

Making Parsed Source Code Data Available Externally #314

daomcgill commented Oct 10, 2024

daomcgill commented Oct 10, 2024

carlosparadis commented Oct 10, 2024

Making Parsed Source Code Data Available Externally #314

Making Parsed Source Code Data Available Externally #314

Comments

daomcgill commented Oct 10, 2024

Purpose

Process

New Scripts

Task List

daomcgill commented Oct 10, 2024

carlosparadis commented Oct 10, 2024