Automating Simulations

Setting up a simulation for materials science and engineering requires not only to understand the employed simulation technique but also proficient knowledge about the file format used to configure the simulation run. Large Language Models (LLMs) can provide support for that.

As part of the FULL-MAP project, co-researchers to support the simulation setup will be developed. The considered simulation tools are DAMASK and LAMMPS.

Questions

aside from fine-tuning a code LLM, what could be other techniques:
- RAG - less expensive.
if fine-tuning then dataset generation?
- we have the intended outputs from the simulation software examples
- we dont have the natural language description of the scripts for example: “write an lammps script to compute the melting point of Copper” - stuff like this is missing // prompts (problem description) to output (input scripts) mapping is missing
- we also do not have any dataset available for DAMASK - we have to create that
benchmarking?
- performance of vanilla llm vs llm + RAG vs llm vs fine-tuned llm from custom dataset

Automating Simulations

Questions

Technical Aspects

MCP

ollama

Training strategies

Further links