Supersymmetric Artificial Neural Network

Template:Original research

Thought Curvature or the "Supersymmetric Artificial Neural Network" hypothesis, (accepted to the 2019 String Theory and Cosmology Conference GRC ^[1] ) is a Lie Superalgebra bound algorithmic learning model, on the horizon of evidence pertaining to Supersymmetry in the biological brain.^[2]

It was introduced by Jordan Micah Bennett on May 10, 2016.

"Thought Curvature" or the "Supersymmetric Artificial Neural Network (2016)" is reasonably observable as a new branch or field of Deep Learning in Artificial Intelligence, called Supersymmetric Deep Learning, by Bennett. Supersymmetric Artificial Intelligence (though not Deep Gradient Descent-like machine learning) can be traced back to work by Czachor et al, concerning a single section/four paragraph thought experiment via segment "Supersymmetry and dimensional Reduction" on a so named "Supersymmetric Latent Semantic Analysis (2004)" based thought experiment; i.e. supersymmetry based single value decomposition, absent neural/gradient descent. Most of that paper apparently otherwise focusses on comparisons between non supersymmetric LSA/Single Value Decomposition, traditional Deep Neural Networks and Quantum Information Theory.^[3] Biological science/Neuroscience saw application of supersymmetry, as far back as 2007 by Perez et al. (See reference 3 from Bennett's paper ^[4])

Method

Notation 1 - Manifold Learning: $ϕ (x; θ)^{⊤} w$ ^[5]

Notation 2 - Supermanifold Learning: $ϕ (x; θ, \bar{θ})^{⊤} w$ ^[4]

Instead of some $θ$ neural network representation as is typical in mean field theory or manifold learning models^[6]^[7]^[8], the Supersymmetric Artificial Neural Network is parameterized by the Supersymmetric directions $θ, \bar{θ}$ .

An informal proof of the representation power gained by deeper abstractions of the “Supersymmetric Artificial Neural Network”

Machine learning non-trivially concerns the application of families of functions that guarantee more and more variations in weight space. This means that machine learning researchers study what functions are best to transform the weights of the artificial neural network, such that the weights learn to represent good values for which correct hypotheses or guesses can be produced by the artificial neural network.

The 'Supersymmetric Artificial Neural Network' is yet another way to represent richer values in the weights of the model; because supersymmetric values can allow for more information to be captured about the input space. For example, supersymmetric systems can capture potential-partner signals, which are beyond the feature space of magnitude and phase signals learnt in typical real valued neural nets and deep complex neural networks respectively. As such, a brief historical progression of geometric solution spaces for varying neural network architectures follows:

Template:Ordered list

Naive Architecture for the “Supersymmetric Artificial Neural Network"

Following, is another view of “solution geometry” history, which may promote a clear way to view the reasoning behind the subsequent naive architecture sequence:

Template:Ordered list

The “Edward Witten/String theory powered artificial neural network”, is simply an artificial neural network that learns supersymmetric^[9] weights.

Looking at the above progression of ‘solution geometries’; going from $S O (n)$ ^[10] representation to $S U (n)$ ^[11] representation has guaranteed richer and richer representations in weight space of the artificial neural network, and hence better and better hypotheses were generatable. It is only then somewhat natural to look to $S U (m | n)$ representation, i.e. the “Edward Witten/String theory powered artificial neural network” (“Supersymmetric Artificial Neural Network”).

To construct an “Edward Witten/String theory powered artificial neural network”, it may be feasible to compose a system, which includes a grassmann manifold artificial neural network^[12] then generate ‘charts’^[13] until scenarios occur^[9] where the “Edward Witten/String theory powered artificial neural network” is achieved, in the following way:

See points 1 to 5 in this reference^[14]

It seems feasible that a $C^{\infty}$ bound atlas-based learning model, where said $C^{\infty}$ is in the family of supermanifolds from supersymmetry, may be obtained from a system, which includes charts $(_{k}^{n})$ of grassmann manifold networks $G R_{k, n}$ and stiefel manifolds $G F_{k, n}$ , in $(ϕ_{I}, U_{I})$ terms, where there exists some invertible submatrix $A \in ϕ_{I} (U_{I} \cap U_{J})$ for $U_{I} = π (V_{i})$ entailing matrix for where $π$ is a submersion mapping on some stiefel manifold $G F_{k, n}$ , thereafter enabling some differentiable grassmann manifold $G R_{k} (ℝ^{n})$ , and $V_{I} = {u \in ℝ^{n \times k} : d e t (u_{I}) \neq 0}$ .^[15]

Artificial Neural Network/Symmetry group landscape visualization

1. $O (n)$ structure – Orthogonal is not connected enough, therefore not amenable to gradient descent in machine learning. (Paper: See note 2 at end of page 2, in reference ^[16] .)

2. $S O (n)$ structure – Special Orthogonal; is connected, gradient descent compatible, while preserving orthogonality, concerning normal space-time. (Paper: See paper in item 1).

3. $S U (n)$ structure – Special Unitary; is connected, gradient descent compatible; complex generalization of $O (n)$ , but only a subspace of larger unitary space, concerning normal space-time. (The Unitary Evolution Recurrent Neural Network^[17] related to complex unit circle seen in $S U (1)$ in physics (See page 2 in (See page 7 in ^[18]).))

4. $U (n)$ structure – Unitary; is connected, gradient descent compatible; Larger unitary landscape than $S U (n)$ , concerning normal space-time. ^[19]

5. $S U (m | n)$ structure – Supersymmetric; is connected, thereafter reasonably gradient descent compatible and even larger than the $U (n)$ landscape, to permit sparticle invariance, being a Poincare group extension (See page 7 in ^[20]) containing both normal space-time and anti-commuting components, as seen in the Supersymmetric Artificial Neural Network which this page proposes.

Ending Remarks

Pertinently, the “Edward Witten/String theory powered supersymmetric artificial neural network”, is one wherein supersymmetric weights are sought. Many machine learning algorithms are not empirically shown to be exactly biologically plausible, i.e. Deep Neural Network algorithms, have not been observed to occur in the brain, but regardless, such algorithms work in practice in machine learning.

Likewise, regardless of Supersymmetry's elusiveness at the LHC, as seen above, it may be quite feasible to borrow formal methods from strategies in physics even if such strategies are yet to show related physical phenomena to exist; thus it may be pertinent/feasible to try to construct a model that learns supersymmetric weights, as I proposed throughout this paper, following the progression of solution geometries going from $S O (n)$ to $S U (n)$ and onwards to $S U (m | n)$ .^[21]

References

Template:Reflist

↑ Template:Cite web
↑ “Supersymmetric methods in the traveling variable: inside neurons and at the brain scale”; P´erez et al
↑ Template:Cite journal
↑ ^4.0 ^4.1 Template:Cite journal
↑ Deep Learning Book; Bengio et al
↑ Template:Cite web
↑ Template:Cite journal
↑ Template:Cite journal
↑ ^9.0 ^9.1 Supersymmetry
↑ Perceptron
↑ Template:Cite journal
↑ Template:Cite journal
↑ Template:Cite web
↑ Template:Cite web
↑ Template:Cite web
↑ Template:Cite journal
↑ Template:Cite journal
↑ Template:Cite web
↑ Template:Cite journal
↑ Template:Cite web
↑ Supergroup (physics)

[1] Template:Cite web

[P´erez_et_al-2] “Supersymmetric methods in the traveling variable: inside neurons and at the brain scale”; P´erez et al

[3] Template:Cite journal

[Thought_Curvature-4] 4.0 ^4.1 Template:Cite journal

[Bengio_et_al-5] Deep Learning Book; Bengio et al

[6] Template:Cite web

[7] Template:Cite journal

[8] Template:Cite journal

[Supersymmetry-9] 9.0 ^9.1 Supersymmetry

[Perceptron-10] Perceptron

[UnitaryRnn-11] Template:Cite journal

[DeepGrassmannManifoldNetwork-12] Template:Cite journal

[13] Template:Cite web

[14] Template:Cite web

[15] Template:Cite web

[CheapOrthogonalConstraints-16] Template:Cite journal

[UnitaryEvolution-17] Template:Cite journal

[18] Template:Cite web

[19] Template:Cite journal

[20] Template:Cite web

[Supergroup_Physics-21] Supergroup (physics)

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

Supersymmetric Artificial Neural Network

Contents

Method

An informal proof of the representation power gained by deeper abstractions of the “Supersymmetric Artificial Neural Network”

Naive Architecture for the “Supersymmetric Artificial Neural Network"

Artificial Neural Network/Symmetry group landscape visualization

Ending Remarks

References

Navigation menu

Supersymmetric Artificial Neural Network

Method

An informal proof of the representation power gained by deeper abstractions of the “Supersymmetric Artificial Neural Network”

Naive Architecture for the “Supersymmetric Artificial Neural Network"

Artificial Neural Network/Symmetry group landscape visualization

Ending Remarks

References

Navigation menu

Search