Hi,
Thanks for releasing such a high-quality and large-scale dataset. For the Two-Agent Task Completion (TATC) benchmark, the paper proposes a rule-based approach for both agents as the baseline. Do you have the plan to release the implementations of such rule-based agents? It would be very helpful to work on this benchmark with such baseline implementations.
Thanks!
Hi,
Thanks for releasing such a high-quality and large-scale dataset. For the Two-Agent Task Completion (TATC) benchmark, the paper proposes a rule-based approach for both agents as the baseline. Do you have the plan to release the implementations of such rule-based agents? It would be very helpful to work on this benchmark with such baseline implementations.
Thanks!