OpenEvolve
- Since evaluator should make use of full system resources, we may need to:
- Run only one island
- Have a global locking mechanism, but anything running in parallel would still be consuming system resources, thus contributing noise and leading to inaccurate evaluation
- If the host system has more resources than target system, then run evaluation in cgroup
- Run evaluation remotely
- Even if we run in one island, there seems to be noise (might be due to codex waiting for command to complete), and the codex measured evaluation results and manually measured evaluation results are different
This post is licensed under CC BY 4.0 by the author.



