NeurIPS 2022: Controllable Text Generation with Neurally-Decomposed Oracle

Related Works

InsNet-v2: The GPT Moment for Insertion-based Language models ICML 2024: NADOv2: Improved Training and Low-Rank Adaptation of Neurally-Decomposed Oracles for Controlling Language Models ICML 2024: Open-Domain Text Evaluation via Contrastive Distribution Methods NeurIPS 2022: Controllable Text Generation with Neurally-Decomposed Oracle NeurIPS 2022: InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model ICML 2019: Neurally-Guided Structure Inference ICML 2019: CoT: Cooperative Training for Generative Modeling of Discrete Data IJCAI-2018: Neural Text Generation: Past, Present and Beyond SIGIR-2018: Texygen: A Benchmarking Platform for Text Generation Models AAAI-2018: Long Text Generation via Adversarial Training with Leaked Information

更多>>

Abstract

We propose a general and efficient framework to control auto-regressive generation models with NeurAlly-Decomposed Oracle (NADO). Given a pre-trained base language model and a sequence-level boolean oracle function, we aim to decompose the oracle function into token-level guidance to steer the base model in text generation. Specifically, the token-level guidance is provided by NADO, a neural model trained with examples sampled from the base model, demanding no additional auxiliary labeled data. Based on posterior regularization, we present the close-form optimal solution to incorporate the decomposed token-level guidance into the base model for controllable generation. We further discuss how the neural approximation affects the quality of the solution. These experiments conducted on two different applications: (1) text generation with lexical constraints and (2) machine translation with formality control demonstrate that our framework efficiently guides the base model towards the given oracle while keeping high generation quality.

Paper Link

https://proceedings.neurips.cc/paper_files/paper/2022/hash/b40d5797756800c97f3d525c2e4c8357-Abstract-Conference.html

Status

Accepted (Oral/Spotlight)