Note
(1) Abramson, J., Ahuja, A., Barr, I., Brussee, A., Carnevale, F., Cassin, M., Chhaparia, R., Clark, S., Damoc, B., Dudzik, A. and Georgiev, P., 2020. Imitation of interactive intelligence. arxiv preprint arxiv: 2012.05672.
(2) Abramson, J., Ahuja, A., Brussee, A., Carnevale, F., Cassin, M., Fischer, F., Georgiev, P., Goldin, A., Harley, T. and Hill, F., 2021. Creating a multimodal interactive agent with imitation and self-teacher learning. arxiv preprint arxiv: 2112.03763.
(3) Abramson, J., Ahuja, A., Carnevale, F., Georgiev, P., Goldin, A., Hung, A., Landon, J., Lillicrap, T., Muldal, A., Richards. B. and Santoro, A., 2022. Multimodal interactive agent evaluation. arxiv preprint arxiv: 2205.13274.
(4) Bai, Y., Jones, A., Ndusse, K., Askell, A., Chen, A., Dassarma, N., Drain, D., Fort, S., Ganguli, D., Henighan, T. and Joseph, N., 2022. Beneficial and harmless assistant training with reinforcement learning from human feedback. arxiv preprint arxiv: 2204.05862.
(5) Christiano, P. F., Leike, J., Brown, T., Martic, M., Legg, S. and Amodei, D., 2017. Advances in neural information processing systems, 30.