nottombrown
2439b3c971
Remove globals
2018-06-26 19:49:29 -07:00
nottombrown
9e5c14dd48
Refactor
2018-06-26 19:44:23 -07:00
nottombrown
1199c206b0
Cleanup
2018-06-26 19:31:46 -07:00
thomwolf
b0cdf6bf60
Merge pull request #8 from rodgzilla/master
...
Using negative index to reshape the input tensor.
2018-06-26 09:11:57 +02:00
Grégory Châtel
71bcad84f9
Using negative index to reshape the input tensor.
2018-06-22 15:33:29 +02:00
thomwolf
d74d00c8b5
Updated test accuracy to 85.84 with batch size 20
2018-06-19 22:20:09 +02:00
thomwolf
7b1d4775d9
small tweaks
2018-06-19 22:02:52 +02:00
thomwolf
f2a1dcf22f
updated test accuracy
2018-06-19 21:31:27 +02:00
thomwolf
f413a81700
clean up tied weights
2018-06-19 17:55:20 +02:00
thomwolf
cca20335bf
update optimizer scope in training loop
2018-06-19 17:48:44 +02:00
thomwolf
7702cc141b
Merge pull request #5 from cclauss/patch-1
...
Create .travis.yml
2018-06-19 17:45:01 +02:00
thomwolf
a0c2b61fd2
Merge pull request #6 from soskek/patch-2
...
fix the scope of optimizer
2018-06-19 17:44:41 +02:00
Sosuke Kobayashi
39891aae3c
fix the scope of optimizer
...
The target of an optimizer should contain `clf_head` (new task-specific output matrix) in addition to `model` (Transformer encoder).
The code might fail to do that, right?
2018-06-19 21:29:16 +09:00
cclauss
a9055bd370
Create .travis.yml
...
Output: https://travis-ci.org/huggingface/neuralcoref/jobs/393000348
2018-06-16 09:32:00 +02:00
thomwolf
e339361770
Merge pull request #4 from soskek/patch-1
...
fix arguments for initializing a model
2018-06-16 08:53:30 +02:00
Sosuke Kobayashi
3c485eb1d1
fix arguments for initializing a model
...
In the sample ROCStory script, the line for model construction seems wrong. This will fix it.
2018-06-16 08:26:08 +09:00
thomwolf
2ce292bfcb
typos
2018-06-15 21:13:29 +02:00
thomwolf
3388d7e1d2
Update readme
2018-06-15 21:13:29 +02:00
Mark Neumann
5b9cbd7468
fixes to load the model easier
2018-06-15 21:13:29 +02:00
thomwolf
aa800ef759
more details in readme
2018-06-14 17:29:36 +02:00
thomwolf
d05519131b
instruction on how to download openai weights
2018-06-14 17:27:19 +02:00
thomwolf
19cbffe6f8
updated readme
2018-06-14 17:24:17 +02:00
thomwolf
3d8d70937c
fix model and training
2018-06-14 16:40:00 +02:00
thomwolf
6a20d66253
updated readme
2018-06-14 15:09:50 +02:00
thomwolf
8d44206590
updated readme
2018-06-14 15:03:06 +02:00
thomwolf
49d628f68a
fixed modified Adam + added evaluation code
2018-06-14 14:29:53 +02:00
thomwolf
0704c84ab9
added openAIAdam optimizer
2018-06-14 13:19:07 +02:00
thomwolf
0b730579c5
model train run - no eval
2018-06-14 12:10:05 +02:00
thomwolf
89ab479aad
refactoring
2018-06-14 03:50:41 +02:00
thomwolf
1f209c42b1
first version of model + weights transfer
2018-06-14 00:18:48 +02:00
thomwolf
5eb7937f07
loss computation
2018-06-13 16:48:44 +02:00
thomwolf
8858f99438
code
2018-06-13 16:07:58 +02:00
thomwolf
83202ae27e
gitignore
2018-06-13 16:07:26 +02:00