Learning what makes a difference from counterfactual examples and gradient supervision

Teney, D.; Abbasnejad, M.; Van Den Hengel, A.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/129158

Scopus	Web of Science®	Altmetric
Citations
?	?

Full metadata record

DC Field	Value	Language
dc.contributor.author	Teney, D.	-
dc.contributor.author	Abbasnejad, M.	-
dc.contributor.author	Van Den Hengel, A.	-
dc.contributor.editor	Vedaldi, A.	-
dc.contributor.editor	Bischof, H.	-
dc.contributor.editor	Brox, T.	-
dc.contributor.editor	Frahm, J.-M.	-
dc.date.issued	2020	-
dc.identifier.citation	Lecture Notes in Artificial Intelligence, 2020 / Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (ed./s), vol.12355, pp.580-599	-
dc.identifier.isbn	3030586065	-
dc.identifier.isbn	9783030586065	-
dc.identifier.issn	0302-9743	-
dc.identifier.issn	1611-3349	-
dc.identifier.uri	http://hdl.handle.net/2440/129158	-
dc.description.abstract	One of the primary challenges limiting the applicability of deep learning is its susceptibility to learning spurious correlations rather than the underlying mechanisms of the task of interest. The resulting failure to generalise cannot be addressed by simply using more data from the same distribution. We propose an auxiliary training objective that improves the generalization capabilities of neural networks by leveraging an overlooked supervisory signal found in existing datasets. We use pairs of minimally-different examples with different labels, a.k.a counterfactual or contrasting examples, which provide a signal indicative of the underlying causal structure of the task. We show that such pairs can be identified in a number of existing datasets in computer vision (visual question answering, multi-label image classification) and natural language processing (sentiment analysis, natural language inference). The new training objective orients the gradient of a model’s decision function with pairs of counterfactual examples. Models trained with this technique demonstrate improved performance on out-of-distribution test sets.	-
dc.description.statementofresponsibility	Damien Teney, Ehsan Abbasnedjad, and Anton van den Hengel	-
dc.language.iso	en	-
dc.publisher	Springer	-
dc.relation.ispartofseries	Lecture Notes in Computer Science; 12355	-
dc.rights	© Springer Nature Switzerland AG 2020	-
dc.source.uri	https://link.springer.com/book/10.1007/978-3-030-58607-2	-
dc.title	Learning what makes a difference from counterfactual examples and gradient supervision	-
dc.type	Conference paper	-
dc.contributor.conference	European Conference on Computer Vision Workshops (ECCV) (23 Aug 2020 - 28 Aug 2020 : virtual online)	-
dc.identifier.doi	10.1007/978-3-030-58607-2_34	-
dc.publisher.place	Switzerland	-
pubs.publication-status	Published	-
dc.identifier.orcid	Teney, D. [0000-0003-2130-6650]	-
dc.identifier.orcid	Van Den Hengel, A. [0000-0003-3027-8364]	-
Appears in Collections:	Aurora harvest 4 Australian Institute for Machine Learning publications

Files in This Item:

There are no files associated with this item.

Show simple item record

Adelaide Research & Scholarship