Have a personal or library account? Click to login
Advancing Large Language Model Agent via Iterative Contrastive Trajectory Optimization Cover

Advancing Large Language Model Agent via Iterative Contrastive Trajectory Optimization

By: Chengang Jing,  Xin Jing and  Kun Li  
Open Access
|Dec 2024

Figures & Tables

Figure 1.

Iterative Contrastive Trajectory Optimization (ICTO) Framework
Iterative Contrastive Trajectory Optimization (ICTO) Framework

Figure 2.

Iterative Learning Progress of ICTO
Iterative Learning Progress of ICTO

Figure 3.

Case Study of WebShop
Case Study of WebShop

Figure 4.

Iterative Learning Progress of ICTO
Iterative Learning Progress of ICTO

Comparison of ICTO and Baseline Performances

MethodWebShopScienceWorldALFWorld
SFT63.170.0%12.5
ETO67.472.3%11.2
IPR68.373.8%10.8
RLCD65.871.5%11.5
NAT66.572.0%11.0
ICTO (ours)70.275.6%9.7

Experimental environment

ComponentDetails
CPUIntel Core i9-10900K
GPUNVIDIA Tesla V100 PCIe 32GB
LLM Agent ModelLlama2-7B Chat
OptimizerAdamW Optimizer
Experiment Management ToolDeepSpeed

Generalization Performance of ICTO on OOD Tasks

MethodWebShopScienceWorldALFWorld
SFT52.360.0%15.0
ETO55.862.0%14.2
IPR57.163.5%13.8
RLCD54.261.0%14.5
NAT56.062.5%14.0
ICTO (ours)59.566.0%12.5

Ablation Study of ICTO Modules

Training SchemeWebShopScienceWorldALFWorld
w/o Contrastive Learning64.267.8%11.6
w/o Behavioral Cloning60.762.5%13.1
Iteration=166.169.2%12.8
Iteration=268.570.6%12.3
Iteration=370.972.3%11.7
Iteration=472.373.1%11.0
Iteration=572.072.8%10.5
Language: English
Page range: 19 - 27
Published on: Dec 31, 2024
Published by: Xi’an Technological University
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2024 Chengang Jing, Xin Jing, Kun Li, published by Xi’an Technological University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.