AN OPTIMIZATION NETWORK FOR MATRIX INVERSION

Similar documents
LU Decomposition Method

Ones Assignment Method for Solving Traveling Salesman Problem

Redundancy Allocation for Series Parallel Systems with Multiple Constraints and Sensitivity Analysis

Fuzzy Minimal Solution of Dual Fully Fuzzy Matrix Equations

Polynomial Functions and Models. Learning Objectives. Polynomials. P (x) = a n x n + a n 1 x n a 1 x + a 0, a n 0

Pattern Recognition Systems Lab 1 Least Mean Squares

Exact Minimum Lower Bound Algorithm for Traveling Salesman Problem

Cluster Analysis. Andrew Kusiak Intelligent Systems Laboratory

BOOLEAN MATHEMATICS: GENERAL THEORY

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

Lecture 18. Optimization in n dimensions

Chapter 3 Classification of FFT Processor Algorithms

ANN WHICH COVERS MLP AND RBF

Solving Fuzzy Assignment Problem Using Fourier Elimination Method

How do we evaluate algorithms?

Civil Engineering Computation

Orientation. Orientation 10/28/15

Parabolic Path to a Best Best-Fit Line:

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le

XIV. Congress of the International Society for Photogrammetry Hamburg 1980

Fuzzy Membership Function Optimization for System Identification Using an Extended Kalman Filter

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem

BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM

Consider the following population data for the state of California. Year Population

THIN LAYER ORIENTED MAGNETOSTATIC CALCULATION MODULE FOR ELMER FEM, BASED ON THE METHOD OF THE MOMENTS. Roman Szewczyk

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana

x x 2 x Iput layer = quatity of classificatio mode X T = traspositio matrix The core of such coditioal probability estimatig method is calculatig the

APPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS

Bezier curves. Figure 2 shows cubic Bezier curves for various control points. In a Bezier curve, only

Force Network Analysis using Complementary Energy

A Comparative Study on Modeling Methods for Switched Reluctance Machines

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs

An Efficient Algorithm for Graph Bisection of Triangularizations

ISSN (Print) Research Article. *Corresponding author Nengfa Hu

Optimum Solution of Quadratic Programming Problem: By Wolfe s Modified Simplex Method

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence

Numerical Methods Lecture 6 - Curve Fitting Techniques

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

Lecture 1: Introduction and Strassen s Algorithm

EE260: Digital Design, Spring /16/18. n Example: m 0 (=x 1 x 2 ) is adjacent to m 1 (=x 1 x 2 ) and m 2 (=x 1 x 2 ) but NOT m 3 (=x 1 x 2 )

A Study on the Performance of Cholesky-Factorization using MPI

MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fitting)

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control

Optimization for framework design of new product introduction management system Ma Ying, Wu Hongcui

EM375 STATISTICS AND MEASUREMENT UNCERTAINTY LEAST SQUARES LINEAR REGRESSION ANALYSIS

Filter design. 1 Design considerations: a framework. 2 Finite impulse response (FIR) filter design

A Note on Least-norm Solution of Global WireWarping

Python Programming: An Introduction to Computer Science

Computational Geometry

Higher-order iterative methods free from second derivative for solving nonlinear equations

New Fuzzy Color Clustering Algorithm Based on hsl Similarity

Lecture 5. Counting Sort / Radix Sort

Fast Fourier Transform (FFT) Algorithms

Parallel Polygon Approximation Algorithm Targeted at Reconfigurable Multi-Ring Hardware

PARALLEL AND DISTRIBUTED MULTI-ALGORITHM CIRCUIT SIMULATION. A Thesis RUICHENG DAI

FFT: An example of complex combinational circuits

An Efficient Algorithm for Graph Bisection of Triangularizations

Algorithm. Counting Sort Analysis of Algorithms

The Research Based on Fuzzy Sliding Mode Control for Linear Double. Inverted Pendulum

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8)

Neural Networks A Model of Boolean Functions

Learning Sensor Data Characteristics in Unknown Environments.

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS)

Second-Order Domain Decomposition Method for Three-Dimensional Hyperbolic Problems

Linearising Calibration Methods for a Generic Embedded Sensor Interface (GESI)

Image Segmentation EEE 508

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP

Data-Driven Nonlinear Hebbian Learning Method for Fuzzy Cognitive Maps

Bayesian approach to reliability modelling for a probability of failure on demand parameter

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

ECE4050 Data Structures and Algorithms. Lecture 6: Searching

NON-LINEAR MODELLING OF A GEOTHERMAL STEAM PIPE

1 Enterprise Modeler

6.854J / J Advanced Algorithms Fall 2008

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

EE123 Digital Signal Processing

The Counterchanged Crossed Cube Interconnection Network and Its Topology Properties

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

2D Isogeometric Shape Optimization considering both control point positions and weights as design variables

EE 435. Lecture 26. Data Converters. Architectures. Characterization

Reversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits

FINITE DIFFERENCE TIME DOMAIN METHOD (FDTD)

A log n lower bound to compute any function in parallel Reduction and broadcast in O(log n) time Parallel prefix (scan) in O(log n) time

The measurement of overhead conductor s sag with DLT method

An Algorithm of Mobile Robot Node Location Based on Wireless Sensor Network

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation

Major CSL Write your name and entry no on every sheet of the answer script. Time 2 Hrs Max Marks 70

Cubic Polynomial Curves with a Shape Parameter

COARSE ANGLE ROTATION MODE CORDIC BASED SINGLE PROCESSING ELEMENT QR-RLS PROCESSOR

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19

Classification of binary vectors by using DSC distance to minimize stochastic complexity

Nodal Circuit Analysis Using KCL. Most useful for when we have mostly current sources Node analysis uses KCL to establish the currents

BASED ON ITERATIVE ERROR-CORRECTION

Dimensionality Reduction PCA

Algorithms for Disk Covering Problems with the Most Points

New HSL Distance Based Colour Clustering Algorithm

Eigenimages. Digital Image Processing: Bernd Girod, 2013 Stanford University -- Eigenimages 1

Structuring Redundancy for Fault Tolerance. CSE 598D: Fault Tolerant Software

Transcription:

397 AN OPTIMIZATION NETWORK FOR MATRIX INVERSION Ju-Seog Jag, S~ Youg Lee, ad Sag-Yug Shi Korea Advaced Istitute of Sciece ad Techology, P.O. Box 150, Cheogryag, Seoul, Korea ABSTRACT Iverse matrix calculatio ca be cosidered as a optimizatio. We have demostrated that this problem ca be rapidly solved by highly itercoected simple euro-like aalog processors. A etwork for matrix iversio based o the cocept of Hopfield's eural etwork was desiged, ad implemeted with electroic hardware. With slight modificatios, the etwork is readily applicable to solvig a liear simultaeous equatio efficietly. Notable features of this circuit are potetial speed due to parallel processig, ad robustess agaist variatios of device parameters. INTRODUCTION Highly itercoected simple aalog processors which mmc a biological eural etwork are kow to excel at certai collective computatioal tasks. For example, Hopfield ad Tak desiged a etwork to solve the travelig salesma problem which is of the p -complete class,l ad also desiged a AID coverter of ovel architecture 2 based o the Hopfield's eural etwork model?' 4 The etwork could provide good or optimum solutios durig a elapsed time of oly a few characteristic time costats of the circuit. The essece of collective computatio is the dissipative dyamics i which iitial voltage cofiguratios of euro-like aalog processors evolve simultaeously ad rapidly to steady states that may be iterpreted as optimal solutios. Hopfield has costructed the computatioal eergy E (Liapuov fuctio), ad has show that the eergy fuctio E of his etwork decreases i time whe couplig coefficiets are symmetric. At the steady state E becomes oe of local miima. I this paper we cosider the matrix iversio as a optimizatio problem, ad apply the cocept of the Hopfield eural etwork model to this problem. CONSTRUCTION OF THE ENERGY FUNCTIONS Cosider a matrix equatio AV=I, where A is a iput X matrix, V is the ukow iverse matrix, ad I is the idetity matrix. Followig Hopfield we defie eergy fuctios E Ie' k = 1, 2,...,, E 1 = (1I2)[(~ A 1j Vj1-1)2 + (~A2) Vj1 )2 +... + (~Aj Vj1)2] )-1 j-1 E2 = (1/2)[(~A1)V)2l + (~A2)V)2-1)2 + )=1 )=1 )-1 + (~A)V}2)2] }-1 America Istitute of Physics 1988

398 E = (1/2)[(~ A1J VJ)2 + (~A2J Vj )2 +... + (~A) VJ _1)2] (1) j=l }=1 J-1 where AiJ ad ViL.are the elemets of ith row ad jth colum of matrix A ad V, respectively. whe A is a osigular matrix, the miimum value (=zero) of each eergy fuctio is uique ad is located at a poit i the correspodig hyperspace whose coordiates are { V u:, V 2k ' "', V k }, k = 1, 2, "',. At this miimum value of each eergy fuctio the values of V 11' V 12'..., V become the elemets of the iverse matrix A -1. Whe A is a sigular matrix the miimum value (i geeral, ot zero) of each eergy fuctio is ot uique ad is located o a cotour lie of the miimum value. Thus, if we costruct a model etwork i which iitial voltage cofiguratios of simple aalog processors, called euros, coverge simultaeously ad rapidly to the miimum eergy poit, we ca say the etwork have foud the optimum solutio of matrix iversio problem. The optimum solutio meas that whe A is a osigular matrix the result is the iverse matrix that we wat to kow, ad whe A is a sigular matrix the result is a solutio that is optimal i a least-square sese of Eq. (1). DESIGN OF THE NETWORK AND THE HOPFIELD MODEL Desigig the etwork for matrix iversio, we use the Hopfield model without iheret loss terms, that is, --= dt a ---Ek(V 11' V 2k'..., Vk ) av ik i,k=1,2,..., (2) where u ik is the iput voltage of ith euro i the kth etwork, Vik is its output, ad the fuctio gik is the iput-output relatioship. But the euros of this scheme operate i all the regios of gik differetly from Hopfield's oliear 2- state euros of associative memory models. 3 4 From Eq. (1) ad Eq. (2), we ca defie couplig coefficiets Tij betwee ith ad jth euros ad rewrite Eq. (2) as --= dt - ~ TiJ V)k + Aki, j=l TiJ = ~ AliAIJ = Tji ' 1=1 It may be oted that T i is idepedet of k ad oly oe set of hardware is eeded for all k. The implemeted etwork is show i Fig. 1. The same set of hardware with bias levels, ~ A Ji h), ca be used to solve a liear simultaeous )=1 (3)

399 equatio represeted by Ax=b for a give vector b. INPUT OUTPUT Fig. 1. Implemeted etwork for matrix iversio with exterally cotrollable couplig coefficiets. Noliearity betwee the iput ad the output of euros is assumed to be distributed i the adder ad the itegrator. The applicatio of the gradiet Hopfield model to this problem gives the result that is similar to the steepest descet method.s But the oliearity betwee the iput ad the output of euros is itroduced. Its effect to the computatioal capability will be cosidered ext. CHARACTERISTICS OF THE NETWORK For a simple case of 3 x3 iput matrices the etwork is implemeted with electroic hardware ad its dyamic behavior is simulated by itegratio of the Eq. (3). For osigular iput matrices, exact realizatio of Tij coectio ad bias Ali is a importat factor for calculatio accuracy, but the iitial coditio ad other device parameters such as steepess, shape ad uiformity of gil are ot. Eve a complex gik fuctio show i Fig. 2 ca ot affect the computatioal capability. Covergece time of the output state is determied by the characteristic time costat of the circuit. A example of experimetal results is show i Fig. 3. For sigular iput matrices, the coverged output voltage cofiguratio of the etwork is depedet upo the iitial state ad the shape of gil'

400,...- Vm-t- --::==----r A ik > 1 = 1 < 1 Vm Ui\< Fig. 2. gile fuctios used i computer simulatios where Aile is the steepess of sigmoid fuctio tah (Aile uile)' iput matrix [ 12 I] A = -I r 1 1 0-1 (cf) 0.5 -I A-I = [ 0 1 0.5 -I -o.~] -1.5 output matrix 0.50-0.98-0.49J [ V = 0.02 0.99 1.00 0.53-0.98-1.50 o 0.5 Fig. 3. A example of experimetal results

401 COMPLEXITY ANALYSIS By coutig operatios we compare the eural et approach with other wellkow methods such as Triagular-decompositio ad Gauss-Jorda elimiatio. 6 (1) Triagular-decompositio or Gauss-Jorda elimiatio method takes 0 (8 3 /3) multiqlicatios/divisios ad additios for large X matrix iversio, ad o (2 /3) multiplicatios/divisios ad additios for solvig the liear simultaeous equatio Ax=b. (2) The eural et approach takes the umber of operatios required to calculate Tij (othig but matrix-matrix multiplicatio), that is, 0 ( 3 /2) multiplicatios ad additios for both matrix iversio ad solvig the liear simultaeous equatio. Ad the time required for output stablizatio is about a few times the characteristic time costat of the etwork. The calculatio of couplig coefficiets ca be directly executed without multiple iteratios by a specially desiged optical matrix-matrix multiplier,' while the calculatio of bias values i solvig a liear simultaeous equatio ca be doe by a optical vector-matrix multiplier. 8 Thus, this approach has a defiite advatage i potetial calculatio speed due to global itercoectio of simple parallel aalog processors, though its calculatio accuracy may be limited by the ature of aalog computatio. A large umber of cotrollable Tij itercoectios may be easily realized with optoelectroic devices. 9 CONCLUSIONS We have desiged ad implemeted a matrix iversio etwork based o the cocept of the Hopfield's eural etwork model. 1bis etwork is composed of highly itercoected simple euro-like aalog processors which process the iformatio i parallel. The effect of sigmoid or complex oliearities o the computatioal capability is uimportat i this problem. Steep sigmoid fuctios reduce oly the covergece time of the etwork. Whe a osigular matrix is give as a iput, the etwork coverges spotaeously ad rapidly to the correct iverse matrix regardless of iitial coditios. Whe a sigular matrix is give as a iput, the etwork gives a stable optimum solutio that depeds upo iitial coditios of the etwork. REFERENCES 1. J. J. Hopfield ad D. W. Tak, BioI. Cyber. 52, 141 (1985). 2. D. W. Tak ad J. J. Hopfield, IEEE Tras. Circ. Sys. CAS-33, 533 (1986). 3. J. J. Hopfield, Proc. Natl. Acad. Sci. U.S.A. 79, 2554 (1982). 4. J. J. Hopfield, Proc. Natl. Acad. Sci. U.S.A. 81, 3088 (1984). 5. G. A. Bekey ad W. J. Karplus, Hybrid Computatio (Wiley, 1968), P. 244. 6. M. J. Maro, Numerical Aalysis: A Practical Approach (Macmilla, 1982), p. 138. 7. H. Nakao ad K. Hotate, Appl. Opt. 26, 917 (1987). 8. J. W. Goodma, A. R. Dias, ad I. M. Woody, Opt. Lett. ~ 1 (1978). 9. J. W. Goodma, F. J. Leoberg, S-Y. Kug, ad R. A. Athale, IEEE Proc. 72, 850 (1984).