83 Commits

Author SHA1 Message Date
wassname 3e7fdd5a3b More examples 2018-11-05 08:15:23 +08:00
wassname ed0c76ce82 Save/load 2018-11-04 22:45:47 +08:00
wassname c9f8cb90ff readme, clean up 2018-11-04 19:19:27 +08:00
wassname 1db17076fe using language model from horoscope_language_model
https://github.com/rodgzilla/pytorch-openai-transformer-lm/blob/horoscope_language_model
2018-11-04 17:38:07 +08:00
wassname 2f9faab044 with decoding language model 2018-11-04 15:57:28 +08:00
wassname 87b738c089 clean 2018-11-04 13:23:29 +08:00
wassname 208daec592 working with gutenberg data set but multiheaded 2018-11-04 13:22:49 +08:00
Thomas Wolf eafc28abdf Merge pull request #37 from rodgzilla/removing_unused_decoder
Removing unused decoder for TransformedModel class.
2018-08-27 13:25:52 +02:00
Grégory Châtel 91b84b0acd Removing unused decoder for TransformedModel class. 2018-08-27 13:13:58 +02:00
Thomas Wolf a287899b79 Merge pull request #27 from sharpsy/sharpsy/add-position-info-comment
Add comment explaining the encoding of the position information.
2018-07-31 15:46:14 +02:00
Marko Pranjic 4e73dc3e00 Add comment explaining the encoding of the position information. 2018-07-18 14:35:17 +02:00
Thomas Wolf 561d4096be Merge pull request #25 from rodgzilla/multiple_choice_head
Simplifying the use of the model to perform different tasks
2018-07-18 10:43:44 +02:00
Thomas Wolf d9142286d3 Merge pull request #23 from rodgzilla/master
Removing unused field of the Transformer class.
2018-07-18 10:35:42 +02:00
Grégory Châtel cbccdb0004 Fixing a syntax problem in SimilarityHead. 2018-07-16 11:18:52 +02:00
Grégory Châtel cf65569610 Adding TODO and fixing indentation. 2018-07-16 11:09:37 +02:00
Grégory Châtel 08b83b4036 Moving loss function to a new module. 2018-07-16 10:40:46 +02:00
Grégory Châtel e1adeda208 Fixing indentation error. 2018-07-16 10:24:55 +02:00
Grégory Châtel df88759637 Adding inference option to DoubleHeadModel constructor. 2018-07-13 17:51:54 +02:00
Grégory Châtel 87b4901a81 Modifying the code of DoubleHeadModel to allow different task heads. 2018-07-13 17:27:33 +02:00
Grégory Châtel ac2250881a Adding the addition of the two sentence orderings in similarity head. 2018-07-13 17:26:32 +02:00
Grégory Châtel ea7f5006d5 Fixing similarity head to output a (batch, 3) dimensional tensor. 2018-07-13 17:25:26 +02:00
Grégory Châtel 4e6775287d Small changes to task head code. 2018-07-13 16:45:27 +02:00
Grégory Châtel 7270ba812e Similarity Head for the transformer. 2018-07-13 16:45:02 +02:00
Grégory Châtel e049859323 More generic DoubleHeadModel (WIP). 2018-07-12 14:43:11 +02:00
Grégory Châtel 5f3348430d New ClfHead module. 2018-07-12 14:34:22 +02:00
Grégory Châtel ed8bb28b50 Renaming ClfHead in MultipleChoiceHead. 2018-07-12 14:23:09 +02:00
Grégory Châtel 93522a3b59 Removing unused field of the Transformer class. 2018-07-12 14:14:17 +02:00
Thomas Wolf ea08605319 Merge pull request #22 from rodgzilla/noise_shape_dropout
Noise shape dropout
2018-07-12 11:18:36 +02:00
Grégory Châtel e40f8811fc Adding commentaries and fixing a syntax mistake. 2018-07-11 18:01:48 +02:00
Grégory Châtel 1c6c802545 Replacating the noise shape behavior. 2018-07-11 17:57:15 +02:00
Thomas Wolf 253ca422bb Merge pull request #20 from rodgzilla/fixing_travis
Removing python 2.7 check from Travis.
2018-07-09 14:04:21 +02:00
Grégory Châtel 90bbbf055e Removing python 2.7 check from Travis. 2018-07-09 13:57:48 +02:00
Thomas Wolf 55ba4d7840 Merge pull request #17 from rodgzilla/data_encoding_cleaning
Cleaning the data encoding code.
2018-07-05 15:43:07 +02:00
Thomas Wolf 7adb2a0091 Merge pull request #16 from rodgzilla/master
Updating README.md
2018-07-05 15:42:52 +02:00
Grégory Châtel c88d4656a1 Cleaning the data encoding code. 2018-07-05 14:46:18 +02:00
Grégory Châtel c149818fba Cleaner version of the encoding code. 2018-07-05 14:32:00 +02:00
Grégory Châtel a1c3b384ac Updating refenrences from model_py.py to model_pytorch.py. 2018-07-05 11:09:38 +02:00
Grégory Châtel 615ac6c05d Updating README.md with new module and class names. 2018-07-05 11:07:04 +02:00
Thomas Wolf ec0c86da2e Merge pull request #15 from rodgzilla/master
Fixing issue #13 and #14.
2018-07-04 13:57:01 +02:00
Grégory Châtel be407cdd37 Solving missing variable issue. 2018-07-04 13:50:19 +02:00
Grégory Châtel 2b7e97e307 Creating the model no matter what the number of GPU is. 2018-07-04 13:49:50 +02:00
thomwolf 37e77aff19 small fix cpu only version 2018-06-28 14:47:01 +02:00
thomwolf 03c9e74b1a clean up multi gpu logic 2018-06-28 13:41:15 +02:00
thomwolf aded2b058a Clean up multi-processing logic -- Switch to PyTorch 0.4 style 2018-06-28 10:43:15 +02:00
thomwolf 49ff9b5355 Merge pull request #10 from nottombrown/multi-gpu
[WIP] Add multi-gpu support
2018-06-27 21:11:20 +02:00
thomwolf 6a40c7d296 Merge pull request #9 from nottombrown/master
Remove globals to make code easier to follow
2018-06-27 09:03:34 +02:00
nottombrown db2f819d2a Complete todo in readme \o/ 2018-06-26 20:31:47 -07:00
nottombrown 09d28722a7 Multi-GPU fine-tuning works correctly 2018-06-26 20:23:56 -07:00
nottombrown 3906aa6801 rm setup.py 2018-06-26 19:51:48 -07:00
nottombrown a2820f3970 rm extra README.md 2018-06-26 19:51:02 -07:00