Engineering a Scalable Laboratory Infrastructure for Assembly Language Scaffolding: Design and Deployment of a Locally Optimized GenAI Assistant for the CODE-2 Educational Architecture

García Crespí, Federico

Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/11000/39810

Engineering a Scalable Laboratory Infrastructure for Assembly Language Scaffolding: Design and Deployment of a Locally Optimized GenAI Assistant for the CODE-2 Educational Architecture

Ver/Abrir:
caee_paper_umh.pdf

187,83 kB
Adobe PDF
Compartir:

Título :
Engineering a Scalable Laboratory Infrastructure for Assembly Language Scaffolding: Design and Deployment of a Locally Optimized GenAI Assistant for the CODE-2 Educational Architecture

Autor :
García Crespí, Federico

Editor :
Universidad Miguel Hernández de Elche

Departamento:
Departamentos de la UMH::Ingeniería de Computadores

Fecha de publicación:
2026

URI :
https://hdl.handle.net/11000/39810

Resumen :
The transition from high-level programming to assembly language constitutes a welldocumented pedagogical bottleneck in computer engineering curricula, particularly in largecohort laboratory settings where individualized scaffolding cannot scale. This paper presents the design, implementation, and technical evaluation of a locally deployable generative AI assistant engineered specifically for the CODE-2 educational processor architecture. The system is intended as laboratory infrastructure, not as a replacement for human instruction; its primary contribution is enabling scalable, privacy-preserving syntax scaffolding without dependency on cloud services or internet connectivity. A synthetic task bank of 50,000 instruction pairs was procedurally generated to cover the full CODE-2 curriculum. Three fine-tuning strategies were compared on a consumer GPU: Prompt Tuning, Low-Rank Adaptation (LoRA), and Full Fine-Tuning of a T5-Small encoder-decoder model. Full Fine- Tuning achieved 94.10% Exact Match on the held-out evaluation set, demonstrating that rigid assembly syntax requires full parameter adaptation. Post-training INT8 quantization via ONNX Runtime reduced inference latency by 69% (from 1,689 ms to 526 ms) on standard laboratory hardware (Intel i5, 8 GB RAM), with a precision loss below 1%. The resulting system operates entirely offline, precluding data exfiltration by design. The system is integrated into laboratory workflows as a supervised scaffolding tool, requiring mandatory emulator-based verification of all AI-generated code. Pedagogical implications are discussed as plausible benefits; no controlled learning-gains study is reported. The work demonstrates a replicable pipeline for building domain-specific language model infrastructure tailored to CPU-only educational environments.

Palabras clave/Materias:
engineering education
assembly programming
fine-tuning
quantization
laboratory infrastructure

Área de conocimiento :
CDU: Ciencias aplicadas: Ingeniería. Tecnología

Tipo de documento :
info:eu-repo/semantics/article

Derechos de acceso:
info:eu-repo/semantics/openAccess
Attribution-NonCommercial-NoDerivatives 4.0 Internacional

DOI :
https://doi.org/10.1002/cae.70196

Publicado en:
Computer Applications in Engineering Education

Aparece en las colecciones:
Artículos Ingeniería de computadores

Mostrar el registro Dublin Core completo del ítem Ver estadísticas

La licencia se describe como: Atribución-NonComercial-NoDerivada 4.0 Internacional.