We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Combining optimal path search with task-dependent learning in a neural network

Abstract: Finding optimal paths in connected graphs requires determining the smallest total cost for traveling along the graph's edges. This problem can be solved by several classical algorithms where, usually, costs are predefined for all edges. Conventional planning methods can, thus, normally not be used when wanting to change costs in an adaptive way following the requirements of some task. Here we show that one can define a neural network representation of path finding problems by transforming cost values into synaptic weights, which allows for online weight adaptation using network learning mechanisms. When starting with an initial activity value of one, activity propagation in this network will lead to solutions, which are identical to those found by the Bellman Ford algorithm. The neural network has the same algorithmic complexity as Bellman Ford and, in addition, we can show that network learning mechanisms (such as Hebbian learning) can adapt the weights in the network augmenting the resulting paths according to some task at hand. We demonstrate this by learning to navigate in an environment with obstacles as well as by learning to follow certain sequences of path nodes. Hence, the here-presented novel algorithm may open up a different regime of applications where path-augmentation (by learning) is directly coupled with path finding in a natural way.
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as: arXiv:2201.11104 [cs.LG]
  (or arXiv:2201.11104v1 [cs.LG] for this version)

Submission history

From: Tomas Kulvicius [view email]
[v1] Wed, 26 Jan 2022 18:29:00 GMT (7361kb,D)
[v2] Thu, 27 Jan 2022 12:07:34 GMT (7361kb,D)
[v3] Tue, 24 May 2022 13:53:42 GMT (7520kb,D)
[v4] Wed, 26 Jul 2023 16:13:33 GMT (8727kb,D)
[v5] Sat, 21 Oct 2023 13:29:55 GMT (8727kb,D)
[v6] Thu, 2 Nov 2023 09:14:45 GMT (8727kb,D)

Link back to: arXiv, form interface, contact.