Talk:Autoencoder

Training section
Much of the training section does not seem to be directly related to auto-encoders in particular but about neural networks in general. No? BrokenSegue 09:17, 29 August 2011 (UTC)

Clarification of "An output layer, where each neuron has the same meaning as in the input layer"
I don't understand what "has the same meaning as in the input layer" means in the output layer definition in the article. Can someone explain or clarify in the article, please. Many thanks, p.r.newman (talk) 09:53, 10 October 2012 (UTC)

==Answer: The outputs are the same as the inputs, i.e. y_i = x_i. The autoencoder tries to learn the identity function. Although it might seem that if the number of hidden units >= the number of input units (/output units) the resulting weights would be the trivial identity, in practice this does not turn out to be the case (probably due to the fact that the weights start so small). Sparse autoencoders, where a limited number of hidden units can be activated at once, avoid this problem even in theory. 216.169.216.1 (talk) 16:47, 17 September 2013 (UTC) Dave Rimshnick

Where is the structure section taken from?
I was wondering if there were any book sources that could be added as reference where a similar approach in describing the autoencoder is taken. Also, what are the W and b terms? It's not very clear what role W and b play on the decoding and enconding process.

Hi there, for anyone struggling to get the correct scientific quote for Autoencoder and where the argmin stuff can be found, the source you're looking for "Threaded Ensembles of Supervised and Unsupervised Neural Networks for Stream Learning" & anyone who unlike me cares enough could add that quote to the article, glhf — Preceding unsigned comment added by 2003:EB:6724:3F08:B8F2:3F33:F768:C858 (talk) 16:46, 9 November 2019 (UTC)

Split proposed
I think it would make sense to split out the "variational autoencoder" section, given that they are generative models and their purpose differs significantly from classic autoencoders. Thoughts? Skjn (talk) 15:48, 19 May 2020 (UTC)
 * I feel like the current content in that section is already too WP:TEXTBOOK and if anything should be trimmed or gutted, rather than expanded into its own article. Rolf H Nelson (talk) 04:54, 21 May 2020 (UTC)
 * I second this proposal. The sheer magnitude of variational autoencoder based methods that have been developed in the past two years is immense. Definitely worth an independent article. — Preceding unsigned comment added by Parthzoozoo (talk • contribs) 17:22, 18 June 2020 (UTC)
 * Also agree with the proposal - they really are a quite different concept, as asserted in the text — Preceding unsigned comment added by 193.129.26.79 (talk) 15:18, 17 August 2020 (UTC)
 * I agree, they use variational inference which is very different from standard autoencoders. — Preceding unsigned comment added by 62.226.49.10 (talk) 22:34, 30 August 2020 (UTC)

Autoencoders variational equation
The equation that yields the parametrization of the auto encoder and its conjugate auto decoder appears to be wrong. The minimum extends over all x in X and all sampled parametrizations of phi and psi and the "arg" that realizes the minimum yields the optimized phi and psi parametrization.RutiWinkler (talk) 14:37, 3 December 2021 (UTC)

India Education Program course assignment
This article was the subject of an educational assignment at College Of Engineering Pune supported by Wikipedia Ambassadors through the India Education Program&#32;during the 2011 Q3 term.&#32;Further details are available on the course page.

The above message was substituted from by PrimeBOT (talk) on 20:09, 1 February 2023 (UTC)