Run the baseline:
PYTHONPATH=. python experiments/run_mmlu.py --mode=DirectAnswerRun fully-connected swarm ablation:
PYTHONPATH=. python experiments/run_mmlu.py --num-truthful-agents=3 --mode=FullConnectedSwarmRun randomly-connected swarm ablation:
PYTHONPATH=. python experiments/run_mmlu.py --num-truthful-agents=3 --mode=RandomSwarmRun the main experiment with optimization and eventual evaluation:
PYTHONPATH=. python experiments/run_mmlu.py --num-truthful-agents=3 --mode=OptimizedSwarmRun the REINFORCE algorithm for edge optimization with three agents as described in the paper.
PYTHONPATH=. python experiments/run_crosswords.pyRun node optimization that improves the demonstration examples of each node.
PYTHONPATH=. python experiments/run_humaneval.py --learn_demonstration TrueRun the general assistant tasks.
PYTHONPATH=. python experiments/run_gaia.py