--- title: "4. Einsum operation by DelayedTensor" author: - name: Koki Tsuyuzaki affiliation: Laboratory for Bioinformatics Research, RIKEN Center for Biosystems Dynamics Research - name: Itoshi Nikaido affiliation: Laboratory for Bioinformatics Research, RIKEN Center for Biosystems Dynamics Research email: k.t.the-answer@hotmail.co.jp graphics: no package: DelayedTensor output: BiocStyle::html_document: toc_float: true vignette: | %\VignetteIndexEntry{Einsum} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r style, echo = FALSE, results = 'asis', message=FALSE} BiocStyle::markdown() ``` **Authors**: `r packageDescription("DelayedTensor")[["Author"]] `
**Last modified:** `r file.info("DelayedTensor_4.Rmd")$mtime`
**Compiled**: `r date()` # What is `einsum` `einsum` is an easy and intuitive way to write tensor operations. It was originally introduced by `Numpy`^[https://numpy.org/doc/stable/reference/generated/numpy.einsum.html] package of Python but similar tools have been implemented in other languages (e.g. R, Julia) inspired by `Numpy`. In this vignette, we will use CRAN `r CRANpkg("einsum")` package first. `einsum` is named after Einstein summation^[https://en.wikipedia.org/wiki/Einstein_notation] introduced by Albert Einstein, which is a notational convention that implies summation over a set of indexed terms in a formula. Here, we consider a simple example of `einsum`; *matrix multiplication*. If we naively implement the matrix multiplication, the calculation would look like the following in a for loop. ```{r Matrix Multiplication 1, echo=TRUE} A <- matrix(runif(3*4), nrow=3, ncol=4) B <- matrix(runif(4*5), nrow=4, ncol=5) C <- matrix(0, nrow=3, ncol=5) I <- nrow(A) J <- ncol(A) K <- ncol(B) for(i in 1:I){ for(j in 1:J){ for(k in 1:K){ C[i,k] = C[i,k] + A[i,j] * B[j,k] } } } ``` Therefore, any programming language can implement this. However, when analyzing tensor data, such operations tend to be more complicated and increase the possibility of causing bugs because the order of tensors is larger or more tensors are handled simultaneously. In addition, several programming languages, especially R, are known to significantly slow down the speed of computation if the code is written in for loop. Obviously, in the case of the R language, it should be executed using the built-in matrix multiplication function (%\*%) prepared by the R, as shown below. ```{r Matrix Multiplication 2, echo=TRUE} C <- A %*% B ``` However, more complex operations than matrix multiplication are not always provided by programming languages as standard. `einsum` is a function that solves such a problem. To put it simply, `einsum` is a wrapper for the for loop above. Like the Einstein summation, it omits many notations such as for, array size (e.g. I, J, and K), brackets (e.g. {}, (), and []), and even addition operator (+) and extracts the array subscripts (e.g. i, j, and k) to concisely express the tensor operation as follows. ```{r Matrix Multiplication 3, echo=TRUE} suppressPackageStartupMessages(library("einsum")) C <- einsum('ij,jk->ik', A, B) ``` # Einsum of `DelayedTensor` CRAN `r CRANpkg("einsum")` is easy to use because the syntax is almost the same as that of `Numpy`'s `einsum`, except that it prohibits the implicit modes that do not use '->'. It is extremely fast because the internal calculation is actually performed by C++. When the input tensor is huge, however, it is not scalable because it assumes that the input is R's standard array. Using `einsum` of `r Biocpkg("DelayedTensor")`, we can augment the CRAN `einsum`'s functionality; in `r Biocpkg("DelayedTensor")`, the input `r Biocpkg("DelayedArray")` objects are divided into multiple block tensors and the CRAN `r CRANpkg("einsum")` is incremently applied in the block processing. # Typical operations of einsum A surprisingly large number of tensor operations can be handled uniformly in `einsum`. In more detail, `einsum` is capable of performing any tensor operation that can be described by a combination of the following three operations^[https://ajcr.net/Basic-guide-to-einsum/]. 1. **Multiplication**: the element values of the tensors on the left side of -> are multiplied by each other 2. **Summation**: when comparing the left and right sides of ->, if there is a missing subscript on the right, the summation is done in the direction of the subscript 3. **Permutation**: the subscripts to the right of -> can be rearranged in any order Some typical operations are introduced below. Here we use the arrays and `r Biocpkg("DelayedArray")` objects below. ```{r Setting, echo=TRUE} suppressPackageStartupMessages(library("DelayedTensor")) suppressPackageStartupMessages(library("DelayedArray")) arrA <- array(runif(3), dim=c(3)) arrB <- array(runif(3*3), dim=c(3,3)) arrC <- array(runif(3*4), dim=c(3,4)) arrD <- array(runif(3*3*3), dim=c(3,3,3)) arrE <- array(runif(3*4*5), dim=c(3,4,5)) darrA <- DelayedArray(arrA) darrB <- DelayedArray(arrB) darrC <- DelayedArray(arrC) darrD <- DelayedArray(arrD) darrE <- DelayedArray(arrE) ``` ## No Operation ### print, show If the same subscript is written on both sides of ->, `einsum` will simply output the object without any calculation. ```{r Print 1, echo=TRUE} einsum::einsum('i->i', arrA) DelayedTensor::einsum('i->i', darrA) ``` ```{r Print 2, echo=TRUE} einsum::einsum('ij->ij', arrC) DelayedTensor::einsum('ij->ij', darrC) ``` ```{r Print 3, echo=TRUE} einsum::einsum('ijk->ijk', arrE) DelayedTensor::einsum('ijk->ijk', darrE) ``` ### diag We can also extract the diagonal elements as follows. ```{r Diag 1, echo=TRUE} einsum::einsum('ii->i', arrB) DelayedTensor::einsum('ii->i', darrB) ``` ```{r Diag 2, echo=TRUE} einsum::einsum('iii->i', arrD) DelayedTensor::einsum('iii->i', darrD) ``` ## Multiplication By using multiple arrays or `r Biocpkg("DelayedArray")` objects as input and writing "," on the right side of ->, multiplication will be performed. ### Hadamard Product Hadamard Product can also be implemented in `einsum`, multiplying by the product of each element. ```{r Hadamard Product 1, echo=TRUE} einsum::einsum('i,i->i', arrA, arrA) DelayedTensor::einsum('i,i->i', darrA, darrA) ``` ```{r Hadamard Product 2, echo=TRUE} einsum::einsum('ij,ij->ij', arrC, arrC) DelayedTensor::einsum('ij,ij->ij', darrC, darrC) ``` ```{r Hadamard Product 3, echo=TRUE} einsum::einsum('ijk,ijk->ijk', arrE, arrE) DelayedTensor::einsum('ijk,ijk->ijk', darrE, darrE) ``` ### Outer Product The outer product can also be implemented in `einsum`, in which the subscripts in the input array are all different, and all of them are kept. ```{r Outer Product 1, echo=TRUE} einsum::einsum('i,j->ij', arrA, arrA) DelayedTensor::einsum('i,j->ij', darrA, darrA) ``` ```{r Outer Product 2, echo=TRUE} einsum::einsum('ij,klm->ijklm', arrC, arrE) DelayedTensor::einsum('ij,klm->ijklm', darrC, darrE) ``` ## Summation If there is a vanishing subscript on the left or right side of ->, the summation is done for that subscript. ```{r Summation 1, echo=TRUE} einsum::einsum('i->', arrA) DelayedTensor::einsum('i->', darrA) ``` ```{r Summation 2, echo=TRUE} einsum::einsum('ij->', arrC) DelayedTensor::einsum('ij->', darrC) ``` ```{r Summation 3, echo=TRUE} einsum::einsum('ijk->', arrE) DelayedTensor::einsum('ijk->', darrE) ``` ### Row-wise / Column-wise Summation ```{r rowSums, echo=TRUE} einsum::einsum('ij->i', arrC) DelayedTensor::einsum('ij->i', darrC) ``` ```{r colSums, echo=TRUE} einsum::einsum('ij->j', arrC) DelayedTensor::einsum('ij->j', darrC) ``` ### Mode-wise Vectorization ```{r Summation in each Mode 1, echo=TRUE} einsum::einsum('ijk->i', arrE) DelayedTensor::einsum('ijk->i', darrE) ``` ```{r Summation in each Mode 2, echo=TRUE} einsum::einsum('ijk->j', arrE) DelayedTensor::einsum('ijk->j', darrE) ``` ```{r Summation in each Mode 3, echo=TRUE} einsum::einsum('ijk->k', arrE) DelayedTensor::einsum('ijk->k', darrE) ``` ### Mode-wise Summation These are the same as what the `modeSum` function does. ```{r Mode Sum 1, echo=TRUE} einsum::einsum('ijk->ij', arrE) DelayedTensor::einsum('ijk->ij', darrE) ``` ```{r Mode Sum 2, echo=TRUE} einsum::einsum('ijk->jk', arrE) DelayedTensor::einsum('ijk->jk', darrE) ``` ```{r Mode Sum 3, echo=TRUE} einsum::einsum('ijk->jk', arrE) DelayedTensor::einsum('ijk->jk', darrE) ``` ### Trace If we take the diagonal elements of a matrix and add them together, we get `trace`. ```{r Trace, echo=TRUE} einsum::einsum('ii->', arrB) DelayedTensor::einsum('ii->', darrB) ``` ## Permutation By changing the order of the indices on the left and right side of ->, we can get a sorted array or `r Biocpkg("DelayedArray")`. ```{r Permutation 1, echo=TRUE} einsum::einsum('ij->ji', arrB) DelayedTensor::einsum('ij->ji', darrB) ``` ```{r Permutation 2, echo=TRUE} einsum::einsum('ijk->jki', arrD) DelayedTensor::einsum('ijk->jki', darrD) ``` ## Multiplication + Summation Some examples of combining Multiplication and Summation are shown below. ### Inner Product (Squared Frobenius Norm) Inner Product first calculate Hadamard Product and collapses it to 0D tensor (norm). ```{r Inner Product 1, echo=TRUE} einsum::einsum('i,i->', arrA, arrA) DelayedTensor::einsum('i,i->', darrA, darrA) ``` ```{r Inner Product 2, echo=TRUE} einsum::einsum('ij,ij->', arrC, arrC) DelayedTensor::einsum('ij,ij->', darrC, darrC) ``` ```{r Inner Product 3, echo=TRUE} einsum::einsum('ijk,ijk->', arrE, arrE) DelayedTensor::einsum('ijk,ijk->', darrE, darrE) ``` ### Contracted Product The inner product is an operation that eliminates all subscripts, while the outer product is an operation that leaves all subscripts intact. In the middle of the two, the operation that eliminates some subscripts while keeping others by summing them is called contracted product. ```{r Contracted Product, echo=TRUE} einsum::einsum('ijk,ijk->jk', arrE, arrE) DelayedTensor::einsum('ijk,ijk->jk', darrE, darrE) ``` ### Matrix Multiplication Matrix Multiplication is considered a contracted product. ```{r Matrix Multiplication, echo=TRUE} einsum::einsum('ij,jk->ik', arrC, t(arrC)) DelayedTensor::einsum('ij,jk->ik', darrC, t(darrC)) ``` ## Multiplication + Permutation Some examples of combining Multiplication and Permutation are shown below. ```{r Multiplication and Permutation 1, echo=TRUE} einsum::einsum('ij,ij->ji', arrC, arrC) DelayedTensor::einsum('ij,ij->ji', darrC, darrC) ``` ```{r Multiplication and Permutation 2, echo=TRUE} einsum::einsum('ijk,ijk->jki', arrE, arrE) DelayedTensor::einsum('ijk,ijk->jki', darrE, darrE) ``` ## Summation + Permutation Some examples of combining Summation and Permutation are shown below. ```{r Summation and Permutation, echo=TRUE} einsum::einsum('ijk->ki', arrE) DelayedTensor::einsum('ijk->ki', darrE) ``` ## Multiplication + Summation + Permutation Finally, we will show a more complex example, combining Multiplication, Summation, and Permutation. ```{r Multiplication, Summation, and Permutation, echo=TRUE} einsum::einsum('i,ij,ijk,ijk,ji->jki', arrA, arrC, arrE, arrE, t(arrC)) DelayedTensor::einsum('i,ij,ijk,ijk,ji->jki', darrA, darrC, darrE, darrE, t(darrC)) ``` # Create your original function by `einsum` By using `einsum` and other `r Biocpkg("DelayedTensor")` functions, it is possible to implement your original tensor calculation functions. It is intended to be applied to Delayed Arrays, which can scale to large-scale data since the calculation is performed internally by block processing. For example, `kronecker` can be easily implmented by `eimsum` and other `r Biocpkg("DelayedTensor")` functions^[https://stackoverflow.com/ questions/56067643/speeding-up-kronecker-products-numpy] (the `kronecker` function inside `r Biocpkg("DelayedTensor")` has a more efficient implementation though). ```{r Kronecker Function, echo=TRUE} darr1 <- DelayedArray(array(1:6, dim=c(2,3))) darr2 <- DelayedArray(array(20:1, dim=c(4,5))) mykronecker <- function(darr1, darr2){ stopifnot((length(dim(darr1)) == 2) && (length(dim(darr2)) == 2)) # Outer Product tmpdarr <- DelayedTensor::einsum('ij,kl->ikjl', darr1, darr2) # Reshape DelayedTensor::unfold(tmpdarr, row_idx=c(2,1), col_idx=c(4,3)) } identical(as.array(DelayedTensor::kronecker(darr1, darr2)), as.array(mykronecker(darr1, darr2))) ``` # Session information {.unnumbered} ```{r sessionInfo, echo=FALSE} sessionInfo() ```