Modulus Bootstrapping

D-3.13 Modulus Bootstrapping

- Reference: Bootstrapping for Approximate Homomorphic Encryption [18]

During CKKS’s ciphertext-to-ciphertext multiplication, each ciphertext is associated with a particular multiplicative level and it decreases by 1 upon each ciphertext-to-ciphertext multiplication (by its internal modulus rescaling operation). Reaching multiplicative level 0 is equivalent to reaching the end of a ciphertext’s modulus chain and no more ciphertext-to-ciphertext multiplication can be performed. To continue with further ciphertext-to-ciphertext multiplication, CKKS provides a special operation called bootstrapping, which is a process of resetting the ciphertext’s end-of-chain modulus $q_{0}$ to the initial maximum modulus $q_{L}$ (which is either $q_{0} \cdot Δ^{L}$ in the vanilla rescaling scheme, or $\prod_{m = 0}^{L} w_{m}$ in the case of using CRT, as explained in §D-3.5.4).

Suppose we have a ciphertext $(A, B)$ with multiplicative depth 0. If we decrypt a ciphertext whose multiplicative level is 0 (i.e., the ciphertext’s modulus is $q_{0}$ ), then decrypting it without reduction modulo $q_{0}$ would output:

${𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B))$

$= B + A \cdot S = Δ M + E + q_{0} \cdot K$ # since $B + A \cdot S mod q_{0} = Δ M + E$

, where $q_{0} \cdot K$ accounts for wrap-around modulo $q_{0}$ values– each coefficient of polynomial $q_{0} K$ is some multiple of $q_{0}$ . CKKS’s bootstrapping procedure is equivalent to safely transforming a ciphertext’s modulus from $q_{0}$ to $q_{L}$ (where $q_{L} ≫ q_{0}$ ).

D-3.13.1 High-level Idea

As the first step of bootstrapping, we forcibly change the modulus of the ciphertext $(A, B)$ from $q_{0}$ to $q_{L}$ . Then, its decryption with reduction modulo $q_{L}$ would output:

${𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B)) mod q_{L}$

$= B + A \cdot S mod q_{L}$

$= Δ M + E + q_{0} K mod q_{L}$

Here, we assume that $q_{L}$ is large enough such that $Δ M + E + q_{0} K ≪ q_{L}$ . This is true given $S$ has small coefficients which are either ${- 1, 0, 1}$ , and thus the coefficients of $B + A \cdot S$ would not grow much.

In the $Δ M + E + q_{0} K mod q_{L}$ term, notice that because of the $q_{0} K$ term which is not modulo-reduced by $q_{0}$ anymore, the ciphertext’s decrypted plaintext polynomial’s each $i$ -th term would get a corrupted coefficient $Δ m_{i} + e_{i} + q_{0} \cdot k_{i} mod q_{L}$ instead of $Δ m_{i} + e_{i} mod q_{L}$ . So, we now need to eliminate the garbage term $q_{0} \cdot k_{i} mod q_{L}$ in each coefficient and distill the pure plaintext coefficient $Δ m_{i} + e_{i}$ .

Figure 16: Sine function $f (x) = \frac{q_{0}}{2 π} \cdot \sin (\frac{2 𝜋𝑥}{q_{0}})$ such that $f (Δ m_{i} + e_{i} + q_{0} k_{i}) \approx Δ m_{i} + e_{i}$ (provided $Δ m_{i} + e_{i} ≪ q_{0}$ ) (Source)

To do so, we will take an approximated approach by using a sine function described in Figure 16, which has a period of $q_{0}$ with the amplitude $\frac{q_{0}}{2 π}$ . This sine function has the following two useful properties:

1.: When $f (x)$ is evaluated at $x$ values near the multiple of $q_{0}$ , the result approximates to that of a line function $y = x$ . This is because the derivative (slope) of $\sin x$ is $y^{'} = \cos x$ , and if $x$ is a multiple of $2 π$ , the slope is: $y^{'} = \cos 2 π = 1$ .
2.: The evaluation of $f (x)$ eliminates the multiples of $q_{0}$ from the input (i.e., modulo reduction $q_{0}$ )

Combining these two properties, given input $x = Δ m_{i} + e_{i} + q_{0} k_{i}$ ,

$f (Δ m_{i} + e_{i} + q_{0} k_{i}) = \frac{q_{0}}{2 π} \cdot \sin (\frac{2 π \cdot (Δ m_{i} + e_{i} + q_{0} k_{i}))}{q_{0}}) = \frac{q_{0}}{2 π} \cdot \sin (\frac{2 π \cdot (Δ m_{i} + e_{i}))}{q_{0}}) \approx Δ m_{i} + e_{i}$

, provided $Δ m_{i} + e_{i}$ is very close to 0 relative to $q_{0}$ (i.e., $Δ m_{i} + e_{i} ≪ q_{0}$ ). This is true, because by construction of the CKKS scheme, the plaintext modulus (even with scaling it up by $Δ$ ), is significantly smaller than the ciphertext modulus. Therefore, to remove $q_{0} K$ from $Δ M + E + q_{0} K$ , we can update each coefficient of the polynomial $Δ M + E + q_{0} K$ by evaluating it with the $f (x)$ sine function. However, we cannot directly update the coefficients of the polynomial, because the CKKS scheme (the RLWE scheme in general) only supports the input vector’s slot-wise $(+,⋅)$ operations. Therefore, to update the polynomial coefficients, we need to express the update logic in terms of slot-wise input vector arithmetic $(+,⋅)$ . Considering all these, CKKS’s overall bootstrapping procedure is described in Table 6.


1	ModRaise: Given ciphertext $(A, B) mod q_{0}$ , we forcibly modify its modulus from $q_{0}$ to $q_{L}$ .
	Then, it ends up encrypting $Δ M + E + q_{0} k$ instead of $Δ M + E$ .
2	CoeffToSlot: Based on step 1’s ciphertext $(A, B) mod q_{L}$ , we generate a new ciphertext
	that encrypts an input vector whose its each $i$ -th slot stores $Δ m_{i} + e_{i} + q_{0} k_{i}$ .
	This is equivalent to moving the coefficients of polynomial $Δ M + E + q_{0} K$ to
	the input vector slots.
3	EvalExp: We convert the sine function into an approximated polynomial by using
	the Taylor series, as well as other optimizations such as
	Euler’s formula ( $e^{𝑖𝜃} = \cos (𝜃) + i \cdot \sin (𝜃)$ ). Then, we generate a CKKS plaintext that encodes
	this approximated sine function, and then use this to homomorphically evaluate
	step 2’s encrypted vector elements (to homomorphically remove every $q_{0} k_{i}$ .
4	SlotToCoeff: Based on the resulting ciphertext from step 3, we generate a new ciphertext
	whose encrypted polynomial’s each $i$ -th coefficient is (approximately) $Δ m_{i} + e_{i}$ .
	This is equivalent to moving the $q_{0} k_{i}$ -eliminated values stored in the input vector slots in
	step 3 back to the positions of the polynomial coefficients. The final ciphertext is
	our goal ciphertext that (approximately) encrypts $Δ M + E$ under modulus $q_{L}$ .

Table 6: High-level Description of CKKS’s Bootstrapping Procedure

D-3.13.2 Mathematical Expansion of the High-level Idea

We will mathematically walk through how the bootstrapping procedure (Table 6) correctly updates the modulus of the input ciphertext from $q_{0}$ to $q_{L}$ .

For ease of understanding, we will first explain how we would do modulus bootstrapping for a ciphertext with multiplicative level 0 (i.e., its modulus is $q_{0}$ ) in case we have access to the secret key $S (X)$ . Using this key, we can decrypt the ciphertext as follows:

${𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B))$ # where $𝖼𝗍 = (A, B) = {𝖱𝖫𝖶𝖤}_{S, σ} (Δ M)$

$= B + A \cdot S = Δ M + E mod q_{0}$

$= Δ M + E + q_{0} K$ # where $q_{0} K$ accounts for any potential wrap-around modulo $q_{0}$ values

Our initial goal is to bootstrap the modulus of the ciphertext from $q_{0}$ to $q_{L}$ by using only the following three tools:

Secret key $S$
Batch-encoding ( $σ^{- 1}$ ) and decoding ( $σ$ ) formulas
Batched slot-wise $(+,⋅)$ operation of input vectors based on their batch-encoded polynomials

After explaining the above, we will then explain how to achieve the same bootstrapping without having access to the secret key $S$ .

ModRaise: This step forcibly changes the ciphertext’s modulus from $q_{0}$ to $q_{L}$ and then decrypts the ciphertext as follows:

${𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B)) = B + A \cdot S = Δ M + E + q_{0} K mod q_{L}$

Notice that the ciphertext’s decrypted plaintext polynomial’s each $i$ -th coefficient gets corrupted to $m_{i} + e_{i} + q_{0} \cdot k_{i} mod q_{L}$ . So, we now need to eliminate the garbage term $q_{0} k_{i} mod q_{L}$ in each coefficient and distill the pure plaintext coefficient $Δ m_{i} + e_{i}$ .

CoeffToSlot: This step generates a new plaintext polynomial whose each $i$ -th input vector slot stores the corrupted coefficient $(m_{i} + e_{i} + q_{0} k_{i})$ . The trick of doing this is to apply CKKS’s batch-encoding mapping $σ^{- 1}$ (which represents the transformation $\vec{m} = \frac{\tilde{W} \cdot I_{n}^{R} \cdot {\vec{v}}_{^{'}}}{n}$ as explained in §D-3.1) to the input vector slots that encode the polynomial $Δ M + E + q_{0} K mod q_{L}$ . Let ${\vec{v}}_{c}$ be the input vector that corresponds to polynomial $Δ M + E + q_{0} K$ . Then, ${\vec{v}}_{c}$ and $Δ M + E + q_{0} K$ satisfy the following relation over the encoding mapping $σ^{- 1}$ :

$σ^{- 1} ({\vec{v}}_{c}) = M_{c} = \sum_{i = 0}^{n - 1} (Δ m_{i} + e_{i} + q_{0} k_{i}) \cdot X^{i}$ # i.e., polynomial $Δ M + E + q_{0} K$

This implies that if we homomorphically apply the $σ^{- 1}$ transformation to the elements of the input vector ${\vec{v}}_{c}$ , then the resulting input vector ${\vec{v}}_{s}$ will store ${\vec{v}}_{c}$ ’s encoded polynomial coefficient values as follows:

$σ^{- 1} \circ {\vec{v}}_{c} = {\vec{v}}_{s} = (Δ m_{0} + e_{0} + q_{0} k_{0}, Δ m_{1} + e_{1} + q_{0} k_{1}, \dots, Δ m_{n - 1} + e_{n - 1} + q_{0} k_{n - 1})$

# where $\circ$ represents a linear transformation operation comprising $(+,⋅)$

However, remember that at the end of the ModRaise step, we get the decrypted (but corrupted by $q_{0} k$ ) polynomial $M_{c} = {𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B)) = Δ M + E + q_{0} K$ and we are not allowed to decode it into ${\vec{v}}_{c}$ . Therefore, we will instead encode the matrix $\tilde{W} \cdot I_{n}^{R}$ in the encoding transformation $σ^{- 1}$ ( $\vec{m} = \frac{\tilde{W} \cdot I_{n}^{R} \cdot {\vec{v}}_{^{'}}}{n}$ ) into its equivalent polynomials (treating a matrix as a combination of vectors) and then perform batched slot-wise $(+,⋅)$ operation between $M_{c}$ and the polynomial version of $\tilde{W} \cdot I_{n}^{R}$ . We express this polynomial-based computation as follows:

$M_{s} = σ_{σ^{- 1}}^{- 1} \circ M_{c} mod q_{L}$ # $σ_{σ^{- 1}}^{- 1}$ is the polynomial-encoded version of the $σ^{- 1}$ transformation

Then, the resulting polynomial $M_{s}$ ’s corresponding input vector slots (i.e., the decoded version of $M_{s}$ ) will store ${\vec{v}}_{s} = (Δ m_{0} + e_{0} + q_{0} k_{0}, Δ m_{1} + e_{1} + q_{0} k_{1}, \dots, Δ m_{n - 1} + e_{n - 1} + q_{0} k_{n - 1})$ . In other words, the above computation effectively moves the coefficients of $M_{c}$ to the input vector slots of a new plaintext polynomial.

However, remember that in CKKS, an input vector can store only up to $\frac{n}{2}$ slots, whereas we need to store a total of $n$ coefficients of $M_{c}$ in the input vector slots. Therefore, we technically need to create 2 pieces of $M_{s}$ as $M_{s 1}$ and $M_{s 2}$ , where the input vector of $M_{s 1}$ stores $(Δ m_{0} + e_{0} + q_{0} k_{0}, Δ m_{1} + e_{1} + q_{0} k_{1}, \dots, Δ m_{\frac{n}{2} - 1} + e_{\frac{n}{2} - 1} + q_{0} k_{\frac{n}{2} - 1})$ , and the input vector of $M_{s 2}$ stores $(Δ m_{\frac{n}{2}} + e_{\frac{n}{2}} + q_{0} k_{\frac{n}{2}}, \dots, Δ m_{n - 1} + e_{n - 1} + q_{0} k_{n - 1})$ .

EvalExp: Our next step is to update ${\vec{v}}_{s}$ ’s each element $m_{i} + e_{i} + q_{0} k_{i}$ to $m_{i} + e_{i}$ by evaluating it with the sine function $f (x)$ . Since the output of the CoeffToSlot step is polynomial $M_{s}$ (technically $M_{s 1}$ and $M_{s 2}$ ), we need to apply the evaluation transformation in an encoded form. First, we approximate $f (x)$ as a linear combination comprising only $(+,⋅)$ operations by using the Taylor series and Euler’s formula (will be explained later). Then, we encode (i.e., $σ$ ) the approximated formula into a polynomial form, and we denote it as $σ_{f}$ . Finally, we apply the $σ_{f}$ transformation to $M_{s}$ as follows:

$σ_{f}^{- 1} \circ M_{s} mod q_{L}$ # Applying the sine function’s linear transformation to ${\vec{v}}_{s}$ ’s each slot storing $Δ m_{i} + e_{i} + q_{0} k_{i}$

$= M_{t} = σ ({\vec{v}}_{t}) = σ ({(Δ m_{i} + e_{i})}_{i = 0}^{n - 1}) mod q_{L}$

After the linear transformation by the sine function, notice that each $q_{0} k_{i}$ term gets eliminated from ${\vec{v}}_{s}$ ’s slots (i.e. modulo reduction by $q$ ) and the resulting vector ${\vec{v}}_{t}$ stores only the $Δ m_{i} + e_{i}$ terms.

SlotToCoeff: Now that we have a polynomial $M_{t}$ whose corresponding input vector ${\vec{v}}_{t}$ ’s slots store garbage-removed coefficients of (i.e., $Δ m_{i} + e_{i}$ ) our initial plaintext polynomial, our next step is to put these coefficients stored in ${\vec{v}}_{t}$ back to the polynomial. This is an exact reverse operation of CoeffToSlot as follows:

$σ_{σ}^{- 1} \circ M^{t} = M_{b}$ # $σ_{σ}^{- 1}$ is a polynomial-encoded form of the batch-decoding formula ${\vec{v}}_{^{'}} = {\tilde{W}}^{*} \cdot \vec{m}$ (§D-3.1)

The result is polynomial $M_{b}$ whose coefficients are garbage-eliminated (i.e., $q_{0} k_{i}$ -free) versions of $M_{c}$ . Finally, we re-encrypt $M_{b}$ as ${𝖱𝖫𝖶𝖤}_{S, σ} (M_{b})$ as the final modulus-bootstrapped ciphertext.

Bootstrapping Without a Secret Key: So far, we have assumed that we have access to the secret key $S$ . With decryption and re-encryption enabled, the above bootstrapping steps described are mathematically equivalent to computing the following:

1.: INPUT: $𝖼𝗍 = (A, B) mod q_{0}$ # where $𝖼𝗍 = (A, B) = {𝖱𝖫𝖶𝖤}_{S, σ} (Δ M)$
2.: ModRaise: $𝖼𝗍 = (A, B) mod q_{L}$
3.: Decryption: ${𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B))) mod q_{L}$
4.: CoeffToSlot: $σ_{σ^{- 1}}^{- 1} \circ {𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B))) mod q_{L}$
5.: EvalExp: $σ_{f}^{- 1} \circ (σ_{σ^{- 1}}^{- 1} \circ {𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B)))) mod q_{L}$
6.: SlotToCoeff: $σ_{σ}^{- 1} \circ (σ_{f}^{- 1} \circ (σ_{σ^{- 1}}^{- 1} \circ {𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B))))) mod q_{L}$
7.: Re-encryption: ${𝖱𝖫𝖶𝖤}_{S, σ} (σ_{σ}^{- 1} \circ (σ_{f}^{- 1} \circ (σ_{σ^{- 1}}^{- 1} \circ {𝖱𝖫𝖶𝖤}^{- 1} (𝖼𝗍 = (A, B)))))) mod q_{L}$

However, the ultimate goal of CKKS bootstrapping is to reset the modulus of a ciphertext from $q_{0}$ to $q_{L}$ without having access to $S$ .

Meanwhile, one important insight is that CKKS’s ModRaise procedure on the ciphertext $(A, B) mod q_{0}$ from $q_{0} \to q_{L}$ effectively transforms the ciphertext into a new one which is an encryption of $Δ M + q_{0} K$ . Before ModRaise, ciphertext $(A, B) mod q_{0}$ ’s decryption relation is as follows:

$A \cdot A + B = Δ M + E + K q_{0} mod q_{0} = Δ M + E$

After ModRaise to $(A, B) mod q_{L}$ , its decryption relation is as follows:

$A \cdot S + B = Δ M + E + K q_{0} mod q_{L} = Δ M + E + K q_{0}$ # because $Δ M + E + K q_{0} ≪ q_{L}$

Therefore, the mod-raised ciphertext $(A, B) mod q_{L} = {𝖱𝖫𝖶𝖤}_{S, σ} (Δ M + K q_{0})$ with noise $E$ . Thus, CKKS’s homomorphic bootstrapping strategy is to run the subsequent CoeffToSlot, EvalExp, and SlotToCoeff steps homomorphically based on the ciphertext $(A, B) mod q_{L}$ . Running these 3 steps consumes a few multiplicative levels due to the ciphertext-to-ciphertext multiplication operations when homomorphically multiplying the coefficient-to-slot and slot-to-coefficient transformation matrices and homomorphically computing powers of $X$ (i.e., $X^{k}$ ) during sine approximation. Therefore, upon completion of these 3 steps, the ciphertext modulus reduces from $q_{L} \to q_{l}$ (where $l$ is some integer such that $l < L$ ).

Note that the result of homomorphic bootstrapping is equal to the explicit bootstrapping based on decryption & re-encryption (if we ignore the small differences in the final ciphertext modulus and the noise). In the following subsections, we will explain the algebraic details of CoeffToSlot, EvalExp and SlotToCoeff steps.

D-3.13.3 Details: CoeffToSlot

Homomorphically moving the coefficients of $M_{c}$ (i.e., $Δ m_{i} + e_{i} + q_{0} k_{i}$ for $0 \leq i \leq n - 1$ ) to a new ciphertext’s input vector slots is mathematically equivalent to homomorphically computing $σ_{σ^{- 1}}^{- 1} \circ ({𝖱𝖫𝖶𝖤}_{S, σ} (𝖼𝗍 = (A, B))$ , which is equivalent to applying the encoding formula to the input vector slot values of ${𝖱𝖫𝖶𝖤}_{S, σ} (𝖼𝗍 = (A, B))$ .

As explained in Summary D-3.9 (in §D-3.9), the encoding formula for converting an input vector into a list of polynomial coefficients is $\vec{m} = \frac{\tilde{W} \cdot I_{n}^{R} \cdot {\vec{v}}_{^{'}}}{n}$ , where $\tilde{W}$ is a basis of the $n$ -dimensional vector space crafted as follows:

$\tilde{W} = [\begin{matrix} 1 & 1 & \dots & 1 & 1 & 1 & \dots & 1 \\ (ω^{J (\frac{n}{2} - 1)}) & (ω^{J (\frac{n}{2} - 2)}) & \dots & (ω^{J (0)}) & (ω^{J_{*} (\frac{n}{2} - 1)}) & (ω^{J_{*} (\frac{n}{2} - 2)}) & \dots & (ω^{J_{*} (0)}) \\ {(ω^{J (\frac{n}{2} - 1)})}^{2} & {(ω^{J (\frac{n}{2} - 2)})}^{2} & \dots & {(ω^{J (0)})}^{2} & {(ω^{J_{*} (\frac{n}{2} - 1)})}^{2} & {(ω^{J_{*} (\frac{n}{2} - 2)})}^{2} & \dots & {(ω^{J_{*} (0)})}^{2} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ {(ω^{J (\frac{n}{2} - 1)})}^{n - 1} & {(ω^{J (\frac{n}{2} - 2)})}^{n - 1} & \dots & {(ω^{J (0)})}^{n - 1} & {(ω^{J_{*} (\frac{n}{2} - 1)})}^{n - 1} & {(ω^{J_{*} (\frac{n}{2} - 2)})}^{n - 1} & \dots & {(ω^{J_{*} (0)})}^{n - 1} \end{matrix}]$

# where the rotation helper function $J (h) = 5^{h} mod 2 n$

Therefore, given the input ciphertext ${𝖼𝗍}_{𝖼} = {𝖱𝖫𝖶𝖤}_{S, σ} (𝖼𝗍 = (A, B)) mod q_{L}$ whose plaintext polynomial $M_{c}$ contains corrupted coefficients, computing $σ_{σ^{- 1}}^{- 1} \circ {𝖼𝗍}_{𝖼}$ is equivalent to computing $\frac{\tilde{W} \cdot I_{n}^{R} \cdot 𝖼𝗍_{𝖼}}{n}$ . However, one problem here is that each CKKS ciphertext encodes only $\frac{n}{2}$ input vector slots, whereas our goal is to move $n$ (corrupted) coefficients of the plaintext polynomial $M_{c}$ encrypted in $𝖼𝗍_{𝖼}$ . Therefore, we will instead generate 2 ciphertexts, ${𝖼𝗍}_{𝗌1}$ and ${𝖼𝗍}_{𝗌2}$ , such that each ${𝖼𝗍}_{𝗌1}$ ’s input vector slots store ${(Δ m_{i} + e_{i} + q_{0} k_{i})}_{0 \leq i < \frac{n}{2}}$ and ${𝖼𝗍}_{𝗌2}$ ’s input vector slots store ${(Δ m_{i} + e_{i} + q_{0} k_{i})}_{\frac{n}{2} \leq i < n}$ .

We split the $n \times n$ matrix $\tilde{W} \cdot I_{n}^{R}$ into four $\frac{n}{2} \times \frac{n}{2}$ matrices as follows:

${[\tilde{W} I_{n}^{R}]}_{11}$ : a matrix comprising the upper left-half section of $\tilde{W} \cdot I_{n}^{R}$
${[\tilde{W} I_{n}^{R}]}_{12}$ : a matrix comprising the upper right-half section of $\tilde{W} \cdot I_{n}^{R}$
${[\tilde{W} I_{n}^{R}]}_{21}$ : a matrix comprising the lower left-half section of $\tilde{W} \cdot I_{n}^{R}$
${[\tilde{W} I_{n}^{R}]}_{22}$ : a matrix comprising the lower right-half section of $\tilde{W} \cdot I_{n}^{R}$

Then, we can compute ${𝖼𝗍}_{𝗌1}$ and ${𝖼𝗍}_{𝗌2}$ as follows:

${𝖼𝗍}_{𝗌1} = \frac{{[\tilde{W} I_{n}^{R}]}_{11} \cdot 𝖼𝗍_{𝖼} + {[\tilde{W} I_{n}^{R}]}_{12} \cdot I_{\frac{n}{2}}^{R} \cdot \bar{𝖼𝗍_{𝖼}}}{n}$

${𝖼𝗍}_{𝗌2} = \frac{{[\tilde{W} I_{n}^{R}]}_{21} \cdot 𝖼𝗍_{𝖼} + {[\tilde{W} I_{n}^{R}]}_{22} \cdot I_{\frac{n}{2}}^{R} \cdot \bar{𝖼𝗍_{𝖼}}}{n}$

Each homomorphic matrix-vector multiplication (e.g., ${[\tilde{W} I_{n}^{R}]}_{21} \cdot 𝖼𝗍_{𝖼}$ ) can be done in an efficient manner that reduces the number of homomorphic rotations (§D-2.10). $\bar{𝖼𝗍_{𝖼}}$ can be computed by applying homomorphic conjugation to ct_c (§D-3.11).

D-3.13.4 Details: EvalExp

We will use the sine function $f (x) = \frac{q}{2 π} \cdot \sin (\frac{2 𝜋𝑥}{q_{0}})$ to approximately eliminate $q_{0} k_{i}$ from $Δ m_{i} + e_{i} + q_{0} k_{i}$ by computing $f (Δ m_{i} + e_{i} + q_{0} k_{i}) \approx Δ m_{i} + e_{i}$ . This approximation works if $Δ m_{i} + e_{i}$ is very close to $x = 0$ relative to $q_{0}$ (i.e., $Δ m_{i} + e_{i} ≪ q_{0}$ ). Still, the elimination of $q_{0} k_{i}$ is approximate (i.e., $\approx Δ m_{i} + e_{i}$ ), because $f (x)$ is $y \approx x$ nearby $x = 0$ , not exactly $y = x$ .

One issue is that we need to evaluate $f (x)$ homomorphically based on ct_s1 and ct_s2 as inputs (i.e., $f (𝖼𝗍_{𝗌1})$ and $f (𝖼𝗍_{𝗌2})$ ), but FHE supports only $(+,⋅)$ operations, whereas the sine graph cannot be formulated by only $(+,⋅)$ . Therefore, we will approximate the sine function $f (x)$ by using the Taylor series (§A-14):

$f (x) = f (a) + \frac{f^{'} (a)}{1!} (x - a) + \frac{f^{″} (a)}{2!} {(x - a)}^{2} + \frac{f^{‴} (a)}{3!} {(x - a)}^{3} + \dots = \sum_{d = 0}^{\infty} \frac{f^{(d)} (a)}{d!} {(x - a)}^{d}$

If we approximate $f (x)$ around $x = 0$ , then the approximated polynomial is as follows:

$f (x) = \frac{q_{0}}{2 π} \cdot \sin (\frac{2 π}{q_{0}} \cdot 0) + \frac{q_{0}}{2 π} \cdot \frac{2 π}{q_{0}} \cdot \frac{\cos (\frac{2 π}{q_{0}} \cdot 0)}{1!} \cdot x + \frac{q_{0}}{2 π} \cdot {(\frac{2 π}{q_{0}})}^{2} \cdot \frac{- \sin (\frac{2 π}{q_{0}} \cdot 0)}{2!} \cdot x^{2} + \dots$

$= \frac{q_{0}}{2 π} \cdot \sum_{j = 0}^{\infty} (\frac{{(- 1)}^{j}}{(2 j + 1)!} \cdot {(\frac{2 𝜋𝑥}{q_{0}})}^{2 j + 1})$

$\approx \frac{q_{0}}{2 π} \cdot \sum_{j = 0}^{h} (\frac{{(- 1)}^{j}}{(2 j + 1)!} \cdot {(\frac{2 𝜋𝑥}{q_{0}})}^{2 j + 1}) = \hat{f} (x)$

, where $\hat{f} (x)$ is a $(2 h + 1)$ -degree polynomial.

Remember that in the RLWE cryptosystem, $B + 𝐴𝑆 mod q_{0} = Δ M + E$ , or $B + 𝐴𝑆 = Δ M + E + q_{0} K$ with some polynomial $K$ representing the wrapping around values of modulo $q_{0}$ . Since the secret key $S$ is an $(n - 1)$ -degree polynomial whose coefficients are small (i.e., $s_{i} \in {- 1, 0, 1}$ ), the coefficients of $K$ will have some reasonably small upper bound, which decreases with the sparsity of $S$ (i.e., the frequency of 0 coefficients in $S$ ). Therefore, the degree of our approximated $\hat{f} (x)$ only needs to be high enough to accurately evaluate $y$ values between $- q_{0} \cdot 𝑘_{𝑚𝑎𝑥} \leq x \leq q_{0} \cdot 𝑘_{𝑚𝑎𝑥}$ . The required minimum degree of our approximated Taylor polynomial $\hat{f} (x)$ increases with $q_{0} 𝑘_{𝑚𝑎𝑥}$ (i.e., the upper bound of $x$ ). Our one issue is that the computation overhead for homomorphic evaluation of a polynomial generally increases exponentially with the degree of the polynomial, which will slow down bootstrapping. To reduce this computation cost, we will leverage Euler’s formula (§A-11) and its square arithmetic:

{\begin{matrix} e^{i \cdot 𝜃} = \cos 𝜃 + i \cdot \sin 𝜃 \\ {(e^{i \cdot 𝜃})}^{2} = e^{i \cdot 2 𝜃} \end{matrix}

By substituting $𝜃 = \frac{2 𝜋𝑥}{q_{0}}$ , we will use Euler’s formula. We will also approximate $e^{𝑖𝜃}$ with the Taylor series, but instead of directly approximating $e^{𝑖𝜃}$ , we will first approximate $e^{\frac{𝑖𝜃}{2^{r}}}$ for some large $2^{r}$ . After that, we will iteratively square $e^{\frac{𝑖𝜃}{2^{r}}}$ a total $r$ times. Then, we get an approximation of ${(e^{\frac{𝑖𝜃}{2^{r}}})}^{2^{r}} = e^{𝑖𝜃}$ . The reason why we start with the approximation of $e^{\frac{𝑖𝜃}{2^{r}}}$ instead of $e^{𝑖𝜃}$ is that its approximation requires a small degree of polynomial, as $\frac{𝜃}{2^{r}}$ (i.e., the input to the complex exponential function) is small provided $2^{r}$ is sufficiently large. Specifically, we learned that $x$ ( $= Δ m_{i} + e_{i} + q_{0} k_{i}$ ) is upper-bounded by $q_{0} 𝑘_{𝑚𝑎𝑥}$ , thus $𝜃 = \frac{2 𝜋𝑥}{q_{0}}$ is upper-bounded by $\frac{2 π 𝑘_{𝑚𝑎𝑥}}{2^{r}}$ . As the targeted range of $x$ for approximation in $f (x)$ is small, we need a small degree of Taylor series polynomial.

Using the Taylor series with degree $d_{0}$ around $x = 0$ , we can approximate $e^{\frac{2 𝜋𝑖𝑥}{2^{r} q_{0}}}$ as:

$f_{e} (x) = e^{\frac{2 𝜋𝑖𝑥}{2^{r} q_{0}}} \approx \sum_{d = 0}^{d_{0}} \frac{1}{d!} {(\frac{2 𝜋𝑖𝑥}{2^{r} q_{0}})}^{d} = {\hat{f}}_{e} (x)$

Then, we iteratively square ${\hat{f}}_{e}$ total $r$ times to get:

${({\hat{f}}_{e} (x))}^{2^{r}} \approx {(f_{e} (x))}^{2^{r}} = e^{i \frac{2 𝜋𝑥}{q_{0}}} = e^{𝑖𝜃}$

Then, based on Euler’s formula $e^{i \cdot 𝜃} = \cos 𝜃 + i \cdot \sin 𝜃$ , we can derive the following relations:

$\bar{e^{i \cdot 𝜃}} = \cos 𝜃 + \bar{i \cdot \sin 𝜃}$

$e^{- i \cdot 𝜃} = \cos 𝜃 - i \cdot \sin 𝜃$

$e^{i \cdot 𝜃} - e^{- i \cdot 𝜃} = (\cos 𝜃 + i \cdot \sin 𝜃) - (\cos 𝜃 - i \cdot \sin 𝜃) = 2 i \sin 𝜃$

$\sin 𝜃 = \frac{- i}{2} \cdot (e^{i \cdot 𝜃} - e^{- i \cdot 𝜃})$

$\frac{q_{0}}{2 π} \cdot \sin 𝜃 = \frac{q_{0}}{2 π} \cdot \frac{- i}{2} \cdot (e^{i \cdot 𝜃} - e^{- i \cdot 𝜃})$

Substituting $𝜃 = \frac{2 𝜋𝑥}{q_{0}}$ , we finally get:

$\frac{q_{0}}{2 π} \cdot \sin (\frac{2 𝜋𝑥}{q_{0}}) = \frac{q_{0}}{2 π} \cdot \frac{- i}{2} \cdot (e^{i \cdot \frac{2 𝜋𝑥}{q_{0}}} - e^{- i \cdot \frac{2 𝜋𝑥}{q_{0}}})$

Using the final relation above, the EvalExp step homomorphically evaluates the approximation of $\frac{q_{0}}{2 π} \cdot \sin (\frac{2 𝜋𝑥}{q_{0}})$ where $x = Δ m_{i} + e_{i} + q_{0} k_{i}$ as follows:

1.: Homomorphically approximately compute $\hat{f} (x) = e^{i \cdot \frac{2 𝜋𝑥}{q_{0}}}$ .
2.: Homomorphically approximately compute $\bar{\hat{f} (x)} = e^{- i \cdot \frac{2 𝜋𝑥}{q_{0}}}$ by applying homomorphic conjugation. (§D-3.11) to $\hat{f} (x)$
3.: Homomorphically compute $\hat{f} (x) - \bar{\hat{f} (x)} = e^{i \cdot \frac{2 𝜋𝑥}{q_{0}}} - e^{- i \cdot \frac{2 𝜋𝑥}{q_{0}}}$ , and then multiply the result by $\frac{- i}{2}$ encoded as CKKS plaintext.

The result of EvalExp is two ciphertexts whose input vector slots store the bootstrapped coefficients of $M_{c}$ , which are modulo-reduced $q_{0}$ from $Δ m_{i} + e_{i} + q_{0} k_{i}$ to $Δ m_{i} + e_{i} + e_{𝑏𝑖}$ . Note that $e_{𝑏𝑖}$ is a bootstrapping error introduced by the following three factors: (1) the intrinsic homomorphic $(+,⋅)$ computation noises of the CoeffToSlot, EvalExp, and SlotToCoeff steps; (2) the EvalExp step’s Taylor polynomial approximation error of the exponential function $e^{𝑖𝜃}$ ; (3) the EvalExp step’s sine graph error, since the graph is not exactly $y = x$ around $x = 0$ , but only $y \approx x$ .

Note that since the output of the CoeffToSlot step was split into 2 ciphertexts (ct_s1 and ct_s2), the output of the EvalExp step is also in 2 ciphertexts: (ct_b1 and ct_b2). The input vector slots of ct_b1 store ${(Δ m_{i} + e_{i} + e_{𝑏𝑖})}_{i = 0}^{\frac{n}{2} - 1}$ , whereas the input vector slots of ct_b2 store ${(Δ m_{i} + e_{i} + e_{𝑏𝑖})}_{i = \frac{n}{2}}^{n - 1}$ .

D-3.13.5 Details: SlotToCoeff

This step is an exact inverse of the CoeffToSlot step, which is moving the bootstrapped (i.e. modulo-reduced $q_{0}$ ) coefficients of $M_{v}$ stored in the input vector slots back to the final plaintext polynomial $M_{f}$ . Remember that the decoding formula from a polynomial to an input vector (§D-3.1) is ${\vec{v}}_{^{'}} = {\tilde{W}}^{*} \cdot \vec{m}$ , where:

${\tilde{W}}^{*} = [\begin{matrix} 1 & (ω^{J (0)}) & {(ω^{J (0)})}^{2} & \dots & {(ω^{J (0)})}^{n - 1} \\ 1 & (ω^{J (1)}) & {(ω^{J (1)})}^{2} & \dots & {(ω^{J (1)})}^{n - 1} \\ 1 & (ω^{J (2)}) & {(ω^{J (2)})}^{2} & \dots & {(ω^{J (2)})}^{n - 1} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & (ω^{J (\frac{n}{2} - 1)}) & {(ω^{J (\frac{n}{2} - 1)})}^{2} & \dots & {(ω^{J (\frac{n}{2} - 1)})}^{n - 1} \\ 1 & (ω^{J_{*} (\frac{n}{2} - 1)}) & {(ω^{J_{*} (\frac{n}{2} - 1)})}^{2} & \dots & {(ω^{J_{*} (\frac{n}{2} - 1)})}^{n - 1} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & (ω^{J_{*} (1)}) & {(ω^{J_{*} (1)})}^{2} & \dots & {(ω^{J_{*} (1)})}^{n - 1} \\ 1 & (ω^{J_{*} (0)}) & {(ω^{J_{*} (0)})}^{2} & \dots & {(ω^{J_{*} (0)})}^{n - 1} \end{matrix}]$

We denote ${[{\tilde{W}}^{*}]}_{11}$ , ${[{\tilde{W}}^{*}]}_{12}$ , ${[{\tilde{W}}^{*}]}_{21}$ , and ${[{\tilde{W}}^{*}]}_{22}$ as $\frac{n}{2} \times \frac{n}{2}$ matrices corresponding to the upper-left, upper-right, lower-left, and lower-right sections of ${\tilde{W}}^{*}$ . Then, homomorphically applying the decoding formula results in the final bootstrapped ciphertext ct_final modulo $q_{L}$ whose plaintext polynomial is garbage-eliminated from $Δ M + E + q_{0} K mod q_{L}$ to $Δ M + E + E_{b} mod q_{l}$ (where $E_{b}$ is the bootstrapping error polynomial). Note that the ciphertext modulus changed from $q_{L} \to q_{l}$ because we consumed some multiplicative levels for computing ciphertext-to-ciphertext multiplications during polynomial evaluation (i.e., $X^{k}$ ). We can derive ct_final by homomorphically applying the decoding transformation ${\tilde{W}}_{T}$ to the results of the EvalExp step (ct_b1 and ct_b2) as follows:

$𝖼𝗍_{𝖿𝗂𝗇𝖺𝗅} = {𝖱𝖫𝖶𝖤}_{S, σ} (Δ M + E + E_{b}) = {[{\tilde{W}}^{*}]}_{11} \cdot 𝖼𝗍_{𝖻1} + {[{\tilde{W}}^{*}]}_{12} \cdot 𝖼𝗍_{𝖻2}$

Note that we do not use ${[{\tilde{W}}^{*}]}_{21}$ and ${[{\tilde{W}}^{*}]}_{22}$ , because we only need to derive the $\frac{n}{2}$ input vector slots whose decoding would result in the $n$ coefficients ${(Δ m_{i} + e_{i} + e_{𝑏𝑖})}_{i = 0}^{n - 1}$ of the final $(n - 1)$ -degree polynomial. Once we generate a new ciphertext ct_final whose $\frac{n}{2}$ input vector slots store ${(Δ m_{i} + e_{i} + e_{𝑏𝑖})}_{i = 0}^{n - 1}$ , then its latter $\frac{n}{2}$ conjugate slots get automatically filled with the conjugates of the first $\frac{n}{2}$ slot values.

D-3.13.6 Reducing the Bootstrapping Overhead by Sparsely Packing Ciphertext

In many cases, the application of CKKS may use only a small number of input vector slots (e.g., $\frac{n^{'}}{2}$ ) out of $\frac{n}{2}$ slots. Suppose that such $n^{'}$ is some number that divides $n$ . Then, we can do a series of homomorphic rotations and multiplications to make the input vector slots store $\frac{n}{n^{'}}$ repetitions of the $\frac{n^{'}}{2}$ -slot values. Specifically, we can do this in total $\frac{n}{n^{'}}$ rounds of rotations and additions: initially, we zero-mask between the $\frac{n^{'}}{2}$ -th slot and the $\frac{n}{2} - 1$ -th slots and save as ct, and then in each $i$ -th round we compute $𝖼𝗍 = 𝖼𝗍 + 𝖱𝗈𝗍𝖺𝗍𝖾 (𝖼𝗍, - n^{'} \cdot 2^{i})$ .

Then, we apply the optimization of sparsely packing ciphertext in Summary D-3.12 (§D-3.12): if an $\frac{n}{2}$ -dimensional input vector is structured as $\frac{n}{n^{'}}$ consecutive repetitions of the first $\frac{n^{'}}{2}$ slot values, then its encoded polynomial $M (X) \in ℤ [X] ∕ (X^{n} + 1)$ has the structure such that all its coefficients whose degree term is not a multiple of $\frac{n}{n^{'}}$ are zero as follows:

$M (X) = c_{0} + c_{\frac{n}{n^{'}}} X^{\frac{n}{n^{'}}} + c_{\frac{2 n}{n^{'}}} X^{\frac{2 n}{n^{'}}} + \dots + c_{n - \frac{n}{n^{'}}} X^{n - \frac{n}{n^{'}}}$ .

Remember that in the CoeffToSlot step (§D-3.13.3), we use the formula $\vec{m} = \frac{\tilde{W} \cdot I_{n}^{R} \cdot {\vec{v}}_{^{'}}}{n}$ to move the $q_{0} k$ -contaminated polynomial’s coefficients to the input vector slots. But by the principle of sparsely packed ciphertext, we know that all the slots of $\vec{m}$ which are not a multiple of $\frac{n}{n^{'}}$ slots would store a zero coefficient. This means that we will get the same computation result even if we only compute the $\vec{m} = \frac{\tilde{W} \cdot I_{n}^{R} \cdot {\vec{v}}_{^{'}}}{n}$ formula with the rows of $\tilde{W}$ whose row index is a multiple of $\frac{n}{n^{'}}$ . Mathematically, we can update the encoding formula to $\vec{m_{s}} = \frac{E \cdot I_{n^{'}}^{R} \cdot {\vec{v}}_{^{'}}}{n^{'}}$ where the $n \times \frac{n}{n^{'}}$ matrix E is an elimination of all those columns from $\tilde{W}$ whose column index is not a multiple of $\frac{n}{n^{'}}$ :

$E = [\begin{matrix} 1 & 1 & \dots & 1 & 1 & \dots & 1 & 1 \\ (ξ^{J (\frac{0 \cdot n}{n^{'}} - n^{'})}) & (ξ^{J (\frac{1 \cdot n}{n^{'}})}) & \dots & (ξ^{J (n - \frac{n}{n^{'}})}) & (ξ^{J_{*} (n - \frac{n}{n^{'}})}) & \dots & (ξ^{J_{*} (\frac{1 \cdot n}{n^{'}})}) & (ξ^{J_{*} (\frac{0 \cdot n}{n^{'}} - n^{'})}) \\ {(ξ^{J (\frac{0 \cdot n}{n^{'}} - n^{'})})}^{2} & {(ξ^{J (\frac{1 \cdot n}{n^{'}} - n^{'})})}^{2} & \dots & {(ξ^{J (n - \frac{n}{n^{'}})})}^{2} & {(ξ^{J_{*} (n - \frac{n}{n^{'}})})}^{2} & \dots & {(ξ^{J_{*} (\frac{1 \cdot n}{n^{'}} - n^{'})})}^{2} & {(ξ^{J_{*} (\frac{0 \cdot n}{n^{'}} - n^{'})})}^{2} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ {(ξ^{J (\frac{0 \cdot n}{n^{'}} - n^{'})})}^{n - 1} & {(ξ^{J (\frac{1 \cdot n}{n^{'}} - n^{'})})}^{n - 1} & \dots & {(ξ^{J (n - \frac{n}{n^{'}})})}^{n - 1} & {(ξ^{J_{*} (n - \frac{n}{n^{'}})})}^{n - 1} & ⋮ & {(ξ^{J_{*} (\frac{1 \cdot n}{n^{'}} - n^{'})})}^{n - 1} & {(ξ^{J_{*} (\frac{0 \cdot n}{n^{'}} - n^{'})})}^{n - 1} \end{matrix}]$

Remember that in the original CoeffToSlot step (§D-3.13.3), we had to split ct_s into ct_s1 and ct_s2 because in CKKS each input vector can store a maximum of $\frac{n}{2}$ slots but we need to move a total of $n$ coefficient values to the input vector slots for bootstrapping. On the other hand, the computation result of the above updated encoding formula (using a sparsely packed ciphertext) is ${\vec{m}}_{s}$ , having only $\frac{n}{n^{'}}$ coefficient slots instead of $n$ coefficient slots, and each slot index $i$ in ${\vec{m}}_{s}$ corresponds to the encoded polynomial’s coefficient with degree term $i \cdot \frac{n}{n^{'}}$ (we do not compute any other coefficient terms, because we know that they are 0 anyway, so no need to bootstrap them). And notice that $\frac{n}{n^{'}} \leq \frac{n}{2}$ , because $n^{'}$ divides $n$ . Therefore, without computing two ciphertexts ${𝖼𝗍}_{𝗌1} = {[\tilde{W} I_{n}^{R}]}_{11} \cdot 𝖼𝗍_{𝖼} + {[\tilde{W} I_{n}^{R}]}_{12} \cdot I_{\frac{n}{2}}^{R} \cdot \bar{𝖼𝗍_{𝖼}}$ and ${𝖼𝗍}_{𝗌2} = {[\tilde{W} I_{n}^{R}]}_{21} \cdot 𝖼𝗍_{𝖼} + {[\tilde{W} I_{n}^{R}]}_{22} \cdot I_{\frac{n}{2}}^{R} \cdot \bar{𝖼𝗍_{𝖼}}$ separately, we can directly compute $𝖼𝗍_{𝖼} = \frac{E \cdot I_{n^{'}}^{R} \cdot 𝖼𝗍_{𝖼}}{n^{'}}$ , because all coefficients for bootstrapping fit in $\frac{n}{2}$ slots. Therefore, the number of homomorphic computations and memory requirement for the CoeffToSlot step can be reduced by half. And the same is true for the EvalExp step (§D-3.13.4).

Similarly, as for the SlotToCoeff step (§D-3.13.5), we update the decoding formula ${\vec{v}}_{^{'}} = {\tilde{W}}^{*} \cdot \vec{m}$ to ${\vec{v}}_{^{'}} = E^{T} \cdot {\vec{m}}_{c}$ . This again reduces the number of homomorphic computations and memory requirements for the SlotToCoeff step by half. Notice that $E^{T}$ is a matrix where those columns whose column index is not a multiple of $\frac{n}{n^{'}}$ are zero. This zero-enforcement to the columns of $E^{T}$ still outputs the same computation result, because ${\vec{m}}_{c}$ is a vector such that those slots whose slot index is not a multiple of $\frac{n}{n^{'}}$ are zero, which makes the computation result with their corresponding columns of $E^{T}$ (i.e., the columns whose index is not a multiple of $\frac{n}{n^{'}}$ ) zero, anyway.

D-3.13.7 Summary

We summarize the CKKS bootstrapping procedure as follows.

$⟨$ Summary D-3.13.7 $⟩$ CKKS Bootstrapping

1.

INPUT:

𝖼𝗍 = (A, B) mod q_{0}

# where

𝖼𝗍 = (A, B) = {𝖱𝖫𝖶𝖤}_{S, σ} (Δ M)

, which satisfies the decryption relation: $A \cdot S + B = Δ M + E + K q_{0}$

2.

ModRaise: View the polynomials

A

and

B

as plaintext polynomials whose each coefficient is in

ℤ_{q_{L}}

(i.e.,

(A, B) mod q_{L}

). This change of viewpoint automatically changes the ciphertext as

{𝖱𝖫𝖶𝖤}_{S, σ} (Δ M + K q_{0})

. The ModRaise step does not require any actual computation.

3.

CoeffToSlot:

Move the coefficients of the encrypted plaintext $Δ M + E + q_{0} K$ to the input vector slots by homomorphically multiplying $n^{- 1} \cdot \tilde{W} \cdot I_{n}^{R}$ to it follows:

${𝖱𝖫𝖶𝖤}_{S, σ} (Z_{1}) = n^{- 1} \cdot \tilde{W} \cdot I_{n}^{R} \cdot {𝖱𝖫𝖶𝖤}_{S, σ} (Δ M + E + q_{0} K) mod q_{L}$

4.

EvalExp:

Remove the wrap-around garbage value $q_{0} K$ in $Δ M + E + q_{0} K$ by homomorphically evaluating the polynomial $σ_{f}$ which approximates a sine function with period $q_{0}$ as follows:

${𝖱𝖫𝖶𝖤}_{S, σ} (Z_{2}) = σ_{f} \circ {𝖱𝖫𝖶𝖤}_{S, σ} (Z_{1}) mod q_{l}$

This step is equivalent to homomorphically performing modulo reduction by $q_{0}$ to the input value. This step reduces the ciphertext modulus from $q_{L} \to q_{l}$ as it consumes multiplicative levels when homomorphically evaluating the polynomial approximation of the sine function.

5.

SlotToCoeff:

Move the modulo- $q_{0}$ -reduced plaintext value $Δ M + E$ stored in the input vector slots back to the plaintext coefficient positions by homomorphically multiplying the encoding matrix ${\tilde{W}}^{*}$ as follows:

${𝖱𝖫𝖶𝖤}_{S, σ} (Δ M + E) = {\tilde{W}}^{*} \cdot {𝖱𝖫𝖶𝖤}_{S, σ} (Z_{2}) mod q_{l}$

Limitation: The noise slowly grows over each bootstrapping due to the bootstrapping error and will eventually overflow the message and the ciphertext modulus.

Comparison between BFV and CKKS Bootstrapping: In the case of CKKS’s bootstrapping, it does not reduce the magnitude of the old noise $E$ and keeps it the same as before, because the sine approximation function converts $Δ M + E + K q_{0}$ into $Δ M + E$ . However, as the ciphertext modulus gets increased from $q_{0} \to q_{L}$ , the noise-to-ciphertext-modulus ratio decreases, since $\frac{E}{q_{L}} ≪ \frac{E}{q_{0}}$ . On the other hand, the bootstrapping procedure introduces a new bootstrapping noise, which can be viewed as a fixed amount. However, this fixed amount of new noise accumulates over each bootstrapping. Therefore, after a very large number of bootstrappings, the noise will eventually overflow the message and even the ciphertext modulus.

In the case of BFV’s bootstrapping, it reduces the noise, but does not change the ciphertext modulus. However, there is no need to reset the ciphertext modulus, because BFV does not have a leveled ciphertext modulus chain, and BFV’s ciphertext-to-ciphertext multiplication does not consume ciphertext modulus. Furthermore, since BFV’s bootstrapping directly removes the noise, the noise is guaranteed to be kept under a certain threshold even after an infinite number of bootstrappings.

Another important difference is that CKKS’s bootstrapping does not require homomorphic decryption, primarily because it maintains the plaintext’s scaling factor to be the same across the entire bootstrapping procedure. On the other hand, BFV’s bootstrapping needs to change the plaintext’s scaling factor to run the digit extraction algorithm. Therefore, homomorphic decryption is required to change the plaintext scaling factor ( $p^{𝜀}$ ) while preserving the same ciphertext modulus ( $q$ ).

D-3.13.8 Reducing the Bootstrapping Noise

As explained in §D-3.13.4, the bootstrapping procedure generates three types of noises:

Type-1 Noise: the intrinsic homomorphic $(+,⋅)$ computation noises of the CoeffToSlot, EvalExp, and SlotToCoeff steps
Type-2 Noise: the EvalExp step’s approximation error of the exponential function $e^{𝑖𝜃}$
Type-3 Noise: the EvalExp step’s sine graph error (i.e., not exactly $y = x$ around $x = 0$ , but only $y \approx x$ )

The Type-1 noise is inevitable by the design of FHE. The Type-2 noise can be either avoided or unavoidable depending on the tradeoff setup between the bootstrapping accuracy and efficiency. Unlike these two types of noises, the Type-3 noise can be effectively reduced by newer bootstrapping techniques.

Figure 17: Arc-sine graph for smaller approximation error (Source)

$𝐚𝐫𝐜𝐬𝐢𝐧 (𝐬𝐢𝐧 (𝐱))$ Approximation (EUROCRYPT 2021 [19]): Using the $\arcsin (\sin (x))$ function instead of the $\sin x$ function can reduce the Type-3 noise, because its line is not curved but straight, as shown in Figure 17 (comprising a series of $y = x$ and $y = - x$ segments). This technique also uses the Remez algorithm that evenly distributes the approximation error over a specified region. However, one downside of this technique is that it consumes 3 multiplicative levels.

Meta-BTS (CCS 2022 [20]): This is thus far the most computationally efficient and accurate bootstrapping technique, whose procedure is as follows:

1.: Perform the regular bootstrapping based on the sine graph to the input ciphertext.
2.: Rescale step 1’s bootstrapped ciphertext to modulus $q_{0}$ .
3.: Subtract step 2’s ciphertext from the initial un-bootstrapped ciphertext (where both ciphertexts are modulo $q_{0}$ ), whose result is a modulo $q_{0}$ ciphertext storing the bootstrapping error.
4.: Bootstrap the output ciphertext of step 3 (storing the bootstrapping error) to modulus $q_{l}$ .
5.: Subtract step 4’s ciphertext from step 1’s ciphertext (where both ciphertexts are modulo $q_{l}$ ), which gives a new modulo $q_{l}$ ciphertext with a reduced bootstrapping error.

Limitation in Noise Handling: The above bootstrapping techniques can reduce the Type-3 noise, because the bootstrapping error is smaller than the plaintext message and a smaller input $x$ value to the approximating sine function outputs a value closer to $y = x$ . Running this algorithm multiple times, the Type-3 noise becomes exponentially smaller, because the size of the target plaintext (i.e., the extracted bootstrapping error as the output of step 3 above) is much smaller than before. Meanwhile, Type-1 and Type-2 noises do not decrease over multiple bootstrapping rounds, relatively keeping their same level, because each round generates new Type-1 and Type-2 noises.

[prev][parent]