Deep Neural Network Inverse Modeling for Integrated Photonics

Similar documents
Dynamic Window Decoding for LDPC Convolutional Codes in Low-Latency Optical Communications

ratio of the volume under the 2D MTF of a lens to the volume under the 2D MTF of a diffraction limited

SILICON PHOTONICS WAVEGUIDE AND ITS FIBER INTERCONNECT TECHNOLOGY. Jeong Hwan Song

Wide-angle and high-efficiency achromatic metasurfaces for visible light

Visible-frequency dielectric metasurfaces for multi-wavelength achromatic and highly-dispersive holograms

Supporting Information: Highly tunable elastic dielectric metasurface lenses

SUPPLEMENTARY INFORMATION

Supplementary Figure 1 Optimum transmissive mask design for shaping an incident light to a desired

A Single Grating-lens Focusing Two Orthogonally Polarized Beams in Opposite Direction

NEAR-IR BROADBAND POLARIZER DESIGN BASED ON PHOTONIC CRYSTALS

Coupling of surface roughness to the performance of computer-generated holograms

Calculating the Distance Map for Binary Sampled Data

How to Simulate and Optimize Integrated Optical Components. Lumerical Solutions, Inc.

Mid-wave infrared metasurface microlensed focal plane array for optical crosstalk suppression

Metasurfaces. Review: A.V. Kildishev, A. Boltasseva, V.M. Shalaev, Planar Photonics with Metasurfaces, Science 339, 6125 (2013)

Optimization-based Dielectric Metasurfaces for Angle-Selective Multifunctional Beam Deflection

Predicting resonant properties of plasmonic structures by deep learning

Mitigating Chromatic Dispersion with Hybrid Optical Metasurfaces

Study of 1x4 Optical Power Splitters with Optical Network

Systematic design process for slanted graing couplers,

Channel Locality Block: A Variant of Squeeze-and-Excitation

Heriot-Watt University

arxiv: v2 [physics.optics] 19 Jun 2017

SUPPLEMENTARY INFORMATION

Design of mechanically robust metasurface lenses for RGB colors

Flat Optics based on Metasurfaces

Numerical Study of Grating Coupler for Beam Steering. Wei Guo

Full-angle Negative Reflection with An Ultrathin Acoustic Gradient Metasurface: Floquet-Bloch Modes Perspective and Experimental Verification

THz Transmission Properties of Metallic Slit Array

Experiment 8 Wave Optics

Broadband metasurfaces for anomalous transmission and spectrum splitting at visible frequencies

Modeling of Elliptical Air Hole PCF for Lower Dispersion

Liquid Crystal Displays

A Viewer for PostScript Documents

Broadband Multifocal Conic-Shaped Lens

Xuechang Ren a *, Canhui Wang, Yanshuang Li, Shaoxin Shen, Shou Liu

Neural Network Weight Selection Using Genetic Algorithms

CMOS compatible highly efficient grating couplers with a stair-step blaze profile

HOLOEYE Photonics. HOLOEYE Photonics AG. HOLOEYE Corporation

Analysis of waveguide-splitter-junction in highindex Silicon-On-Insulator waveguides

specular diffuse reflection.

Advanced modelling of gratings in VirtualLab software. Site Zhang, development engineer Lignt Trans

Real-time Object Detection CS 229 Course Project

Title: Authors: Accepted: Posted Doc. ID:

A Direct Simulation-Based Study of Radiance in a Dynamic Ocean

Under My Finger: Human Factors in Pushing and Rotating Documents Across the Table

E x Direction of Propagation. y B y

Comparison of Beam Shapes and Transmission Powers of Two Prism Ducts

Study of Residual Networks for Image Recognition

Basic Polarization Techniques and Devices 1998, 2003 Meadowlark Optics, Inc

Structural color printing based on plasmonic. metasurfaces of perfect light absorption

Diffraction Efficiency

Fast Image Registration via Joint Gradient Maximization: Application to Multi-Modal Data

Achieve more with light.

Learning local evidence for shading and reflectance

Modeling and Optimization of Thin-Film Optical Devices using a Variational Autoencoder

The Steerable Pyramid: A Flexible Architecture for Multi-Scale Derivative Computation

UBCx Phot1x: Silicon Photonics Design, Fabrication and Data Analysis

1 Introduction j3. Thicknesses d j. Layers. Refractive Indices. Layer Stack. Substrates. Propagation Wave Model. r-t-φ-model

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

ORIGINAL ARTICLE Fiber Optic Based Pressure Sensor Using Comsol Multiphysics Software

Article. Reflection tuning via destructive interference in metasurface. Opto-Electronic Engineering. 1 Introduction

Classifying a specific image region using convolutional nets with an ROI mask as input

All-angle Negative Reflection with An Ultrathin Acoustic Gradient Metasurface: Floquet-Bloch Modes Perspective and Experimental Verification

Toward Spatial Queries for Spatial Surveillance Tasks

RicWaA v RicWaA User s Guide. Introduction. Features. Study with an example.

Some fast and compact neural network solutions for artificial intelligence applications

3D model classification using convolutional neural network

DISTRIBUTION A: Distribution approved for public release.

Fast Region-of-Interest Transcoding for JPEG 2000 Images

Surface Plasmon and Nano-metallic layout simulation with OptiFDTD

Three-dimensional imaging of 30-nm nanospheres using immersion interferometric lithography

Reproducing the hierarchy of disorder for Morpho-inspired, broad-angle color reflection

Electromagnetic wave manipulation using layered. systems

Diffraction. Single-slit diffraction. Diffraction by a circular aperture. Chapter 38. In the forward direction, the intensity is maximal.

SEMANTIC COMPUTING. Lecture 8: Introduction to Deep Learning. TU Dresden, 7 December Dagmar Gromann International Center For Computational Logic

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

COMPUTATIONAL INTELLIGENCE

Optimization of integrated optic components by refractive index profile measurements

MMI reflectors with free selection of reflection to transmission ratio Kleijn, E.; Vries, de, T.; Ambrosius, H.P.M.M.; Smit, M.K.; Leijtens, X.J.M.

Supplementary Figure 1: Schematic of the nanorod-scattered wave along the +z. direction.

Robust Face Recognition Based on Convolutional Neural Network

Supplementary Information. Phase recovery and holographic image reconstruction using deep learning in neural networks

Diffraction Gratings

COMSOL Multiphysics Software as a Metasurfaces Design Tool for Plasmonic-Based Flat Lenses

Four-zone reflective polarization conversion plate

Coherent Gradient Sensing Microscopy: Microinterferometric Technique. for Quantitative Cell Detection

Extending reservoir computing with random static projections: a hybrid between extreme learning and RC

HENet: A Highly Efficient Convolutional Neural. Networks Optimized for Accuracy, Speed and Storage

Diffraction Gratings as Anti Reflective Coatings Noah Gilbert. University of Arizona ngilbert .arizona.edu Phone: (520)

Convolutional Neural Networks: Applications and a short timeline. 7th Deep Learning Meetup Kornel Kis Vienna,

The role of light source in coherence scanning interferometry and optical coherence tomography

WAVELENGTH MANAGEMENT

Specification of Diffraction Orders for Grating Regions

Robust design of topology-optimized metasurfaces

Experimental demonstration of metasurface-based ultrathin carpet cloaks for millimetre waves

Copyright Shane Colburn

Fiber Fourier optics

Theoretical Treatments for the Butt-Coupling Characteristics of Fibers

Y-shaped beam splitter by graded structure design in a photonic crystal

Transcription:

MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Deep Neural Network Inverse Modeling for Integrated Photonics TaherSima, M.; Kojima, K.; Koike-Akino, T.; Jha, D.; Wang, B.; Lin, C.; Parsons, K. TR2018-183 December 29, 2018 Abstract We propose a deep neural network model that instantaneously predicts the optical response of nanopatterned silicon photonic power splitter topologies, and inversely approximates compact (2.6 x 2.6 um2) and efficient (above 92%) power splitters for target splitting ratios. Optical Fiber Communication Conference and Exposition and the National Fiber Optic Engineers Conference (OFC/NFOEC) This work may not be copied or reproduced in whole or in part for any commercial purpose. Permission to copy in whole or in part without payment of fee is granted for nonprofit educational and research purposes provided that all such whole or partial copies include the following: a notice that such copying is by permission of Mitsubishi Electric Research Laboratories, Inc.; an acknowledgment of the authors and individual contributions to the work; and all applicable portions of the copyright notice. Copying, reproduction, or republishing for any other purpose shall require a license with payment of fee to Mitsubishi Electric Research Laboratories, Inc. All rights reserved. Copyright c Mitsubishi Electric Research Laboratories, Inc., 2018 201 Broadway, Cambridge, Massachusetts 02139

Deep Neural Network Inverse Modeling for Integrated Photonics Mohammad H. Tahersima, Keisuke Kojima*, Toshiaki Koike-Akino, Devesh Jha, Bingnan Wang, Chungwei Lin, Kieran Parsons Mitsubishi Electric Research Labs. (MERL), 201 Broadway, Cambridge, MA 02139, USA. *kojima @ merl.com Abstract: We propose a deep neural network model that instantaneously predicts the optical response of nanopatterned silicon photonic power splitter topologies, and inversely approximates compact (2.6 2.6 µm 2 ) and efficient (above 92%) power splitters for target splitting ratios. OCIS codes: 250.5300, 160.3918, 100.4996. 1. Introduction Subwavelength nanopatterned devices can be used to control incident electromagnetic fields into specific transmitted and reflected wavefronts [1 4]. However, optimization of such nanostructures, with a large design space is computationally costly. For example, computing the electromagnetic field profile via finite-difference time-domain (FDTD) methods may require long simulation time, several minutes to hours depending on the area of photonic device. To resolve the issue, we previously developed an artificial intelligence integrated optimization process using neural networks (NN) that can accelerate optimization by reducing the required number of numerical simulations [5, 6]. To design a photonic power splitter with arbitrary power ratios, photonic designers often begin with an overall structure based on intuition or analytical models, and fine tune the structure using a parameter sweep in numerical simulations [7, 8]. We demonstrate an alternative approach that uses deep learning methods to learn the large design space of a broadband integrated photonic power splitter in a compact deep neural network (DNN) model. To achieve this, we 1) investigate how to construct and train a DNN (8 layers), 2) apply an inverse design method to instantaneously obtain desirable performance, and 3) show that it is possible to design power splitters with multiple splitting ratios from the same trained DNN. Fig. 1: By optimizing binary sequence of position of etch holes (white circles in the left figure) it is possible to manipulate light propagation towards either of ports. The DNN can take device topology as input and spectral response of the metadevice as label or vice versa. 2. Forward Prediction and Inverse Design The goal of a nanostructured integrated photonics power splitter is to organize optical interaction events, such that the collective effect of the ensemble of scattering events guides the beam to a target port within target power intensity. To design the power ratio splitter using DNN we chose a simple three port structure on a standard fully etched SOI platform with a 220 nm-thick silicon layer. One input and two output 0.5µm waveguides are connected using an

adiabatic taper to the 2.6µm wide square power splitter design region with a connection width of 1.3µm (Fig. 1). We use transverse electric (TE) mode as an input, and its conversion to transverse magnetic (TM) mode is minimal (< 10 5 ). To solve both forward and inverse design problems, we develop an eight layer deep and 100 neurons wide residual neural network (ResNet) [9]. The forward problem is approached as a regression problem while the inverse problem is solved as a classification problem, where we predict a binary vector representing the hole locations. A Gaussian log-likelihood function is used to train the DNN modeling the forward design problem. A Bernoulli log-likelihood classifier is used as the loss function for training the inverse problem. The Adam algorithm [10] is used for a training parameter optimization. We then generated nearly 20,000 3D-FDTD simulation data. Some data are randomly generated in parallel, while others are generated sequentially through optimization algorithms such as Direct Binary Search. Each input data consists of 20 20 hole vectors (HV), each labeled by its spectral transmission response (SPEC) at port 1 and 2 and reflection from the input port. Each pixel is a circle with a radius of 45 nm and can have a binary state of 1 for etched (n = n Silicon ) and 0 for not etched (n = n Silica ). We split the data into 80 % for the training, and 20 % for testing. The training typically takes several hours using a desktop PC with a GPU board. First we test the forward computation of the network to see prediction of spectral response of a topology data that the network is not trained on (Fig. 2 (a)). Interestingly, the network could predict transmission with above 99 % correlation coefficient (Fig. 2 (b)). This is a significant improvement from our previous results with only three layers [5]. Note that a conventional fully connected NN does not improve the performance beyond four layers, while the use of ResNet enabled us to increase the number of layers up to eight. Fig. 2: (a) Spectral response of non-optimized nano-patterned power splitters, simulated by FDTD (solid lines) and predicted by DNN (dashed lines) at each output port for two device topologies, and (b) correlation between the target values (FDTD simulation) and the predicted values by DNN. Gray circle symbol size is proportional to gradient uncertainty. Next, we test the inverse modeling on the same data as above by using SPECs as data and HVs as label and reversing and optimizing the inverse network. To test the generalization capabilities of the network, we investigate the network s inverse design performance on arbitrary and unfamiliar cases. To do this, we generate a reference table containing broadband constant transmission values for each port and use them as the input data batch for the Inverse Design DNN model. The predicted HVs can take any value from 0 to 1 from a Bernoulli distribution classifier. The classification converges to 0 or 1 as the loss reduces by increasing the number of training epochs. The predicted binary sequence is then fed back into the FTDT solver to confirm the validity of the response (Fig. 3). This inverse design process takes less than a second. The simulated total transmission efficiency is 96 %, 92 %, and 92 % for the splitting ratio of 1:1, 1:2, and 1:3, respectively. The spectral response is very flat for the whole wavelength range of 1450 1650 nm. The reflection is below 20 db for most of the wavelength range.

Fig. 3: Demonstration of inverse design using DNN with splitting ratios of 1:1, 1:2, and 1:3. Spectral response for transmission at port 1, 2, and reflection to the input port. FDTD-simulated values for predicted hole vectors on the right match the target values well. 3. Conclusion DNNs can use device structure data (shape, depth, and refractive indices) as an input to predict the optical response of the nanostructure (forward network). In this case DNN can be used as a method for fast approximation of the optical response instead of using computationally heavy numerical methods. Another way to use DNNs, which is not available in the numerical method, is taking an optical response and providing the user with an approximate solution of nanostructure (inverse design). We demonstrate the application of DNNs in design of nanostuctured integrated photonic components. Although the design space for this problem is very large (2 400 possible combinations), by training DNN with nearly 16,000 simulation data, we created a network that can approximate the spectral response of the an arbitrary hole vector within this design space. In addition, we could use the inverse network to design a nearly optimized power splitter topology for any user specific spectral responses, achieving > 92 % transmission efficiency and typically < 20 db reflection for 1450 1650 nm. References 1. Ni, X., Wong, Z. J., Mrejen, M., Wang, Y., Zhang, X., An ultrathin invisibility skin cloak for visible light, Science 349, 1310-1314 (2015). 2. Arbabi, E., Arbabi, A., Kamali, S. M., Horie, Y., Faraon, A., Multiwavelength polarization-insensitive lenses based on dielectric metasurfaces with meta-molecules, Optica 3, 628-633 (2016). 3. Krasnok, A., Tymchenko, M., Alú, A., Nonlinear metasurfaces: a paradigm shift in nonlinear optics, Materials Today 21, 8-21 (2017). 4. Khorasaninejad, M., Chen, W. T., Devlin, R. C., Oh, J., Zhu, A. Y., Capasso, F. Metalenses at visible wavelengths: Diffraction-limited focusing and subwavelength resolution imaging, Science, 352, 1190-1194 (2016). 5. Kojima, K., Wang, B., Kamilov, U., Koike-akino, T., Parsons, K. Acceleration of FDTD-based Inverse Design Using a Neural Network Approach, In Integrated Photonics Research, Silicon and Nanophotonics (Optical Society of America, 2017), paper ITu1A-4. 6. Teng, M., Kojima, K., Koike-Akino, T., Wang, B., Lin, C., Parsons, K. Broadband SOI mode order converter based on topology optimization, In Optical Fiber Communications Conference (Optical Society of America, 2017), paper Th2A.8. 7. Tian, Y., Qiu, J., Yu, M., Huang, Z., Qiao, Y., Dong, Z., Wu, J., Broadband 1 3 Couplers With Variable Splitting Ratio Using Cascaded Step-Size MMI, IEEE Photonics Journal 10, 1-8 (2018). 8. Xu, K., Liu, L., Wen, X., Sun, W., Zhang, N., Yi, N., Song, Q., Integrated photonic power divider with arbitrary power ratios, Optics Letters 42, 855-858 (2017). 9. He, K., Zhang, X., Ren, S., Sun, J., Deep residual learning for image recognition, In Proceedings of the IEEE conference on computer vision and pattern recognition, 770-778 (2016). 10. Kingma, D. and Ba, J. A.: A method for stochastic optimization, arxiv preprint arxiv:1412.6980 (2014).