You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for this interesting work. I contributed TAPAS some time ago to HuggingFace Transformers and was curious to see how well it performed on this dataset. Fun to see that you used it in your experiments :D
However, when checking out the experiments notebook, I saw that you instantiate the model as follows:
However, this will instantiate a randomly initialized TAPAS model, i.e. it will not instantiate the base of the model with the pre-trained weights. This is because you instantiate the model based on a configuration, rather than using the from_pretrained method.
Hence, fine-tuning this particular one will result in random results (as illustrated by your paper).
TAPAS seems to be the only one where you don't instantiate the model using from_pretrained, so curious to hear your reply!
And btw, we recently merged a new table-based model by Microsoft into the library called TAPEX. It claims to outperform TAPAS on several benchmarks, including TabFact. So would be interesting to see how well this one performs on PubHealthTab.
Kind regards,
Niels
ML engineer @ HuggingFace
The text was updated successfully, but these errors were encountered:
Hi,
Thanks for this interesting work. I contributed TAPAS some time ago to HuggingFace Transformers and was curious to see how well it performed on this dataset. Fun to see that you used it in your experiments :D
However, when checking out the experiments notebook, I saw that you instantiate the model as follows:
However, this will instantiate a randomly initialized TAPAS model, i.e. it will not instantiate the base of the model with the pre-trained weights. This is because you instantiate the model based on a configuration, rather than using the
from_pretrained
method.Hence, fine-tuning this particular one will result in random results (as illustrated by your paper).
TAPAS seems to be the only one where you don't instantiate the model using
from_pretrained
, so curious to hear your reply!And btw, we recently merged a new table-based model by Microsoft into the library called TAPEX. It claims to outperform TAPAS on several benchmarks, including TabFact. So would be interesting to see how well this one performs on PubHealthTab.
Kind regards,
Niels
ML engineer @ HuggingFace
The text was updated successfully, but these errors were encountered: