Sequence Generation/Beam Search produce better text

hskramer · August 30, 2019, 2:17am

I have merged the LSTM-based language models with sequence and beam search. The text output from Sherlockholmes is given as the input text followed mostly by it would be nice to see if it can produce better text. I know perplexity is the most important measure of nlp models since they can’t be compared by the text they produce, but text is more interesting. I have used the d2l and straight dope example for text generation, but I know the LSTM LM and transformer models can do much better.

ThomasDelteil · September 1, 2019, 6:07pm

To produce high quality text generative model a few tricks are useful:

Use long sentences / paragraph / documents when training your language model. That’s mainly why GPT-2 gave such good results with the WebText dataset, and why text generated from being trainined on the Google 1 Billion word dataset is so poor (only randomized short sentences).
Use very large model (200M+ parameters)

Topic		Replies	Views
Speeding up Machine Translation with RNNs D2L Book performance , gpu , docs	3	420	February 22, 2019
Hw 9.3.2 Courses	1	491	April 20, 2019
How to predict with bert model?	0	373	December 26, 2019
LSTM shape error Discussion	1	743	December 28, 2018
Create a model to classificy a sentence logical or not Discussion	2	392	June 7, 2019

Sequence Generation/Beam Search produce better text

Related Topics