應用數值線性代數習題參考答案【Ch1Q12-21】

2021-02-20 樂植桃數

Demmel J. W. Applied Numerical Linear Algebra課後習題參考答案Chapter 1 Question 12 - 21Foreword

第一章的12-21題難度相比於前面有了明顯的增加. 許多題目是在寫作業時抓耳撓腮一整天沒有很明確頭緒的. 部分題目的完整解答仍然欠缺（即使我們已經有了一個不那麼完整而嚴格的證明, 但考慮到寫得不嚴謹會被噴）, 以期待有興趣的讀者參與補全完整的證明. 後面的一些題目偏向於在計算機上實踐以真正理解浮點數運算和當今的IEEE 754標準並非顯然. 但更深入的理解要求我們對計算機組成原理和計算機的計算原理有較為透徹的認識, 對於一般非相關學科的同學來說是較為困難的. 實踐的部分題目對編者的編程能力有較大的挑戰, C語言norm2函數我們的實現在向同級計院曹書瑜同學請教後運行速度有了明顯的提升, 此處表示感謝.

附上視頻1996年歐洲阿麗亞娜5型運載火箭首飛發射失敗, 來一起看看沒有仔細處理浮點數運算的後果.https://www.bilibili.com/video/av2319355/

Question 1.12

Question. In order to analyze the effects of rounding errors, wehave used the following model

where

Proof.

For a complex calculation, we only need to calculate its Re and Imaccurately. Let . In order toprove that there exist a

1)For addition, our algorithm is

Now let's analyse its error.

For the relative accuracy, let

Re and Im have high relative accuracy.

2)For substraction, our algorithm is

Now let's analyse its error

For the relative accuracy, let

Re and Im have high relative accuracy.

3)For multiplication, our algorithm is

Now let's analyse its error

For relative accuracy

When

4)For division

Now let's analyse its error, we only analyse thecase

For relative accarucy,

Cause we won't multiple two big number, the problemof overflow won't happend.

This is properly tested on python code as the following.

>>> def complex_divide(a1, a2, b1, b2):
...     if abs(b1) < abs(b2):
...         return [(b1/b2*a1+a2)/(b1/b2*b1+b2), (b1/b2*a2-a1)/(b1/b2*b1+b2)]
...     else:
...         return [(a1+b2/b1*a2)/(b1+b2/b1*b2), (a2-b2/b1*a1)/(b1+b2/b1*b2)]
...
    
>>> complex_divide(2**666, 2**666, 2**666, 2**666)
[1.0, 0.0]
>>> complex_divide(2**-666, 2**-666, 2**-666, 2**-666)
[1.0, 0.0]
>>> complex_divide(2**-666, 2**666, 2**-666, 2**666)
[1.0, -0.0]
>>> complex_divide(2**666, 2**-666, 2**666, 2**-666)
[1.0, 0.0]
Question 1.13Question. Prove Lemma 1.3. Let 
Proof.
We only prove the case in 
The matrix form of inner product. Let 
We shall show that A is s.p.d.,
which means 
If A is s.p.d, we shall show that 
1)
2).
3)
4)for A is h.p.d, 
Question 1.14Question. Prove lemma 1.5: 
Proof.
Question 1.15Question. Prove Lemma 1.6:
An operator norm is a matrix norm.
Proof.
1)
If 
2)
3)
Question 1.16Question. Prove all parts except 7 of Lemma 1.7. Hint for part 8:Use the fact that if 
Part 1. Prove that
Proof. For operator norm, this can be soon obtained by
if for any nonzero vector 
For matrix Frobenius norm, if we rewrite
The last step is to sum the square of all the inequalities above,i.e.
Part 2. Prove that
Proof. For operator norm,
For Frobenius norm, this can be soon obtained byrewrite 
Part 3. Prove that the max norm and Frobenius norm are not operatornorm.
Proof. For max norm, we may show that it violates part 2 ofLemma 1.7 by the counter example,
 It canbe easily shown thatFrobenius norm is not operator norm, as
Part 4. 
Proof. Only to Prove when orthogonal, since unitary is the same.For operator norm induced by 
For Frobenius norm, it is the Pythagorean theorem. if
Part 5. Prove that,the maximum absolute row sum.
Proof. this can be obtained by take 
Part 6. Prove that,the maximum absolute column sum.
Proof. This is trivial from the view of functional analysis, as
Part 8. Prove that
Proof. This is the same as the proof of part 6.
Part 9. Prove that 
Proof. Remember that 
This is true for 
Part 10. If 
Proof. There exists 
wherethe last inequality is a corollary from the inequality of means.There exists Part 11. If 
Proof. This can be soon obtained from parts 6, 8, and 10.
Part 12. If 
Proof. This can be soon obtained from parts 10 and 11.
Part 13. If 
Proof. For the first part, find one, then define 
The second part is true, since
Question 1.17Question. We mentioned that on a Cray machine the expression
Proof.
Now consider the case that we want to calculate 
For any 
As 
Define  Andobviously,
We want to prove that 
...(Wait for completion. If you can do it, please contact us)
When 
Question 1.18Question. Suppose that 
if 

Prove the following facts:
Barring overflow or underflow, the only round-off error committed inrunning the algorithm is computing 
Thus, this program in effect simulates quadruple precision arithmetic,representing the true sum 
Using this and similar tricks in a systematic way, it is possible toefficiently simulate all four basic floating-point operations inarbitrary precision arithmetic, using only the underlying floating pointinstructions and no "bit-fiddling". 128-bit arithmetic is implementedthis way on the IBM RS6000 and Cray (but much less efficiently on theCray, which does not have IEEE arithmetic).
Proof. We shall prove the following lemma: If floating point numbers
Proof of lemma. Remember we are in the binary system, and 
Assume 
When 
When 
When 
For the case 
Question 1.19Question. This question illustrates the challenges in engineeringhighly reliable numerical software. Your job is to write a program tocompute the two-norm

for 

end for
This algorithm is inadequate because it does not have the followingdesirable properties:
It must compute the answer accurately (i.e., nearly all the computeddigits must be correct) unless 
It must be nearly as fast as the obvious program above in mostcases.
It must work on any "reasonable" machine, possibly including ones not running IEEE arithmetic. This means it may not cause an errorcondition unless 
To illustrate the difficulties, note that the obvious algorithm failswhen 
This routine is important enough that it has been standardized as aBasic Linear Algebra Subroutine, or BLAS, which should be availableon all machines.
Answer.
To avoid overflow, one can separate out the exponent part of a floating-point number, and compute the scaled squares. One result can be like
import numpy as np
def frexp(x):
    """fake one to separate double precision floating point number's
    exponent and tail"""
    b = '{:0>64s}'.format(bin(np.float64(x).view(np.uint64))[2:])
    exp = int(b[1:12], base=2) - 1023
    if exp > -1023:
        return exp, int(b[12:], base=2) * (2**-52) + 1
    else:
        extra = b[1:].find('1')
        return exp - extra + 11, int(b[extra:], base=2) * (2**(extra-62))

def norm2(x):
    if len(x) == 0:
        return 0
    if len(x) == 1:
        return abs(x[0])

    s = 0
    size = -1076
    for i in range(len(x)):
        x_exp, x_frac = frexp(x[i])  # 獲取指數部分和尾數部分
        if size < x_exp:
            s *= 2 ** (size - x_exp)
            size = x_exp
        else:
            x_frac *= 2 ** (x_exp - size)
        s += x_frac ** 2

    return s**0.5 * 2**size
        
This code is tested for
>>> norm2([2**-1024]) == 2**-1024
True
>>> norm2([2**-1054, 2**-1055]) == 2**-1055 * 5**0.5
True
>>> norm2([0.+2**1020]) == 2**1020
True
>>> norm2([0.+2**1020, 0.+2**1020]) == 2**1020 * 2**0.5
True
>>> norm2([2**-1054, 2**1020]) == 2**1020
True
>>> norm2([2**1020, 2**-1054]) == 2**1020
True
Though for the convenience of coding, we cannot apply the built-infunction <frexp> in C code and implement it in a slow way, theperformance can be still evaluated. Assume the 'frexp' operation can beperformed in a proper way (in machine level, separate the exponent partand the tail part), and calculated as three operations, totally thisalgorithm cost 
Further, the code is implemented in C code. (Please go see attachment ingitee)
https://gitee.com/j7168908jx/shuzhidaishu/blob/master/code/c1-19/code.c
Test results are given in table 11. norm2simple is the baseline algorithm given inthe question, norm2lapack is the one from LAPACK in Fortran code.norm2 and norm22 are the algorithms designed in the answer, wherenorm22 uses more efficient but probably harder to read operations, andmight not work with denormalized floating-point numbers. The vector 
During the completion of the C code, conclusions are obtained. One ofthe most important for the problem is that sometimes it is not soexpensive to do the division, rather than using other techniques thatmight add more complexity to the code and the running time. Limited CPUstructure(register cache, variable storage) is to blame.
Table 1. Test result for question 1.19Iterations, length(
11.25s13.45s20.49s22.38sQuestion 1.20Question. We will use a Matlab program to illustrate how sensitivethe roots of a polynomial can be to small perturbations in thecoefficients. Polyplot takes an input polynomial specified by its rootsr and then adds random perturbations to the polynomial coefficients,computes the perturbed roots, and plots them. The inputs are


The first part of your assignment is to run this program for the following inputs. In all cases choose m high enough that you get afairly dense plot but don't have to wait too long. m = a few hundred or perhaps 1000 is enough. You may want to change the axes of the plot if the graph is too small or too large.
Also try your own example with complex conjugate roots. Which rootsare most sensitive?
r=(1:10);e=1e-3,1e-4,1e-5,1e-6,1e-7,1e-8,
r=(1:20);e=1e-9,1e-11,1e-13,1e-15,
r=[2,4,8,16,...,1024];e=1e-1,1e-2,1e-3,1e-4.
The second part of your assignment is to modify the program tocompute the condition number c(i) for each root. In other words, arelative perturbation of e in each coefficient should change rootr(i) by at most about e*c(i). Modify the program to plot circlescentered at r(i) with radii e*c(i), and confirm that these circlesenclose the perturbed roots (at least when e is small enough thatthe linearization used to derive the condition number is accurate).You should turn in a few plots with circles and perturbed roots, andsome explanation of what you observe.
In the last part, notice that your formula for c(i) "blows up" if
Answer.
1.Here we let m =1000.
1)The case r=1:10, let e=1e-3,1e-4,1e-5,1e-6,1e-7,1e-8. See figures below.
Figure. When r=1:102)The case r=1:20,let e=1e-9,1e-11,1e-13,1e-15, See figures below.
Figure. When r=2:103)The case r=[2,4,8,...,1024],let e=1e-1,1e-2,1e-3,1e-4, See figures below.
Figure. When r=[2,4,8,...,1024]2.Let's start from the theoretical analysis. Let, 
where 
So
For our numerical experiment, we assume that 
r=1:10,e=1e-7We can see that the circle in the middle is muchlarger than the circle on the edeg, but the circle of the roots 5 or 6is not the largest . It is easy to see that 
derivative3.The same trick as 2, notice that k-th derivative of p(x) will be 0 if 
See the code appendix c1-20
https://gitee.com/j7168908jx/shuzhidaishu/tree/master/code/c1-20
Question 1.21Question. Apply Algorithm 1.1, Bisection, to find the roots of
Answer.
The bisection is implemented as
import numpy as np
from matplotlib import pyplot as plt

def bisect(coeff, a, b, tol):
    low = a
    high = b
    poly = np.poly1d(coeff, r=True)
    p_low = poly(low)
    p_high = poly(high)

    if abs(p_low) < tol:
        return low, low
    elif abs(p_high) < tol:
        return high, high

    while high - low > 2 * tol:
        mid = (low + high) / 2
        p_mid = poly(mid)
        if p_mid * p_low < 0:
            high = mid
            p_high = p_mid
        elif p_mid * p_high < 0:
            low = mid
            p_low = p_mid
        else:
            low = high = mid
            p_low = p_high = p_mid
    return low, high

result = [bisect([2] * 9, 1, h, 1e-5) for h in np.linspace(3, 3.5, 100)]
plt.plot(np.linspace(3, 3.5, 100), result)
plt.xlabel("search interval: [1, x]")
plt.ylabel('result')
plt.title("different bisection result when initial value varies")
plt.show()    
and the great difference in result can be inherited from figure listed below.
result for question 1.21Modified bisection search code is listed below.
def bisect(roots, a, b, tol):
    low = a
    high = b
    poly = np.poly1d(roots, r=True)
    coeff = poly.coef
    poly2 = np.poly1d(np.abs(coeff))
    p_low = poly(low)
    p_high = poly(high)
        
    if abs(p_low) < tol:
        return low, low
    elif abs(p_high) < tol:
        return high, high
        
    def stop(mi, pmi):
        err = 2 * len(roots) * poly2(np.abs(mi)) * 2**-52
        if pmi > 0:
            return pmi - err <= 0
        else:
            return pmi - err > 0
        
    while True:
        mid = (low + high) / 2
        p_mid = poly(mid)
        if p_mid * p_low < 0:
            high = mid
            p_high = p_mid
        elif p_mid * p_high < 0:
            low = mid
            p_low = p_mid
        else:
            low = high = mid
            p_low = p_high = p_mid
        if high - low < 2 * tol:
            break
        if stop(mid, p_mid):
            low = high = mid
            p_low = p_high = p_mid
            break
        
    return low, high
Respectively, the outcome of code, when given different initial value,is shown in the figure below.
result for question 1.21有什麼問題歡迎在下方留言哦~ <

應用數值線性代數習題參考答案【Ch1Q12-21】

相關焦點

工程數學-線性代數(第六版)課後答案及習題解析.主編:同濟大學數學系.

人工智慧中的線性代數:如何理解並更好地應用它

《量子力學概論》課本習題答案(格裡菲斯、英文版)

注重基礎培養能力 2011考研線性代數複習建議

清華把線性代數教材換成英文版引熱議:網友:早該換了

線性代數入門——矩陣乘積和乘冪在實際問題中的應用舉例

強化學習中的線性代數知識

《世紀雲圖·2017張宇線性代數9講》圖書

二年級下冊語文第三單元習題之閱讀理解和看圖寫作題以及參考答案

線性代數的非主流經典名作:《線性代數應該這樣學》

86歲還在錄網課:MIT教授Gilbert Strang最新「線性代數」課程上線

2016考研數學:線性代數解題技巧

數列與極限100道經典例題答案

同濟大學《高等數學》第七版教材課後習題參考解答

《信號與系統》第八次作業參考答案

資料下載 | 線性代數應該這樣學(中文第三版)

2020年六年級道德與法治期中測試習題參考答案詳解

剛體力學的知識點和參考習題

線性代數重要,選對教材更重要:同濟版《線性代數》引發激烈爭議

如何看待清華大學將線性代數教材改為英文教材?