Engineering a Scalable Laboratory Infrastructure for Assembly Language Scaffolding: Design and Deployment of a Locally Optimized GenAI Assistant for the CODE-2 Educational Architecture

García Crespí, Federico

Please use this identifier to cite or link to this item: https://hdl.handle.net/11000/39810

Engineering a Scalable Laboratory Infrastructure for Assembly Language Scaffolding: Design and Deployment of a Locally Optimized GenAI Assistant for the CODE-2 Educational Architecture

Ver/Abrir:
caee_paper_umh.pdf

187,83 kB
Adobe PDF
Compartir:

Title:
Engineering a Scalable Laboratory Infrastructure for Assembly Language Scaffolding: Design and Deployment of a Locally Optimized GenAI Assistant for the CODE-2 Educational Architecture

Authors:
García Crespí, Federico

Editor:
Universidad Miguel Hernández de Elche

Department:
Departamentos de la UMH::Ingeniería de Computadores

Issue Date:
2026

URI:
https://hdl.handle.net/11000/39810

Abstract:
The transition from high-level programming to assembly language constitutes a welldocumented pedagogical bottleneck in computer engineering curricula, particularly in largecohort laboratory settings where individualized scaffolding cannot scale. This paper presents the design, implementation, and technical evaluation of a locally deployable generative AI assistant engineered specifically for the CODE-2 educational processor architecture. The system is intended as laboratory infrastructure, not as a replacement for human instruction; its primary contribution is enabling scalable, privacy-preserving syntax scaffolding without dependency on cloud services or internet connectivity. A synthetic task bank of 50,000 instruction pairs was procedurally generated to cover the full CODE-2 curriculum. Three fine-tuning strategies were compared on a consumer GPU: Prompt Tuning, Low-Rank Adaptation (LoRA), and Full Fine-Tuning of a T5-Small encoder-decoder model. Full Fine- Tuning achieved 94.10% Exact Match on the held-out evaluation set, demonstrating that rigid assembly syntax requires full parameter adaptation. Post-training INT8 quantization via ONNX Runtime reduced inference latency by 69% (from 1,689 ms to 526 ms) on standard laboratory hardware (Intel i5, 8 GB RAM), with a precision loss below 1%. The resulting system operates entirely offline, precluding data exfiltration by design. The system is integrated into laboratory workflows as a supervised scaffolding tool, requiring mandatory emulator-based verification of all AI-generated code. Pedagogical implications are discussed as plausible benefits; no controlled learning-gains study is reported. The work demonstrates a replicable pipeline for building domain-specific language model infrastructure tailored to CPU-only educational environments.

Keywords/Subjects:
engineering education
assembly programming
fine-tuning
quantization
laboratory infrastructure

Knowledge area:
CDU: Ciencias aplicadas: Ingeniería. Tecnología

Type of document:
info:eu-repo/semantics/article

Access rights:
info:eu-repo/semantics/openAccess
Attribution-NonCommercial-NoDerivatives 4.0 Internacional

DOI:
https://doi.org/10.1002/cae.70196

Published in:
Computer Applications in Engineering Education

Aparece en las colecciones:
Artículos Ingeniería de computadores

Mostrar el registro Dublin Core completo del ítem Ver estadísticas

La licencia se describe como: Atribución-NonComercial-NoDerivada 4.0 Internacional.