subject

In this exercise, we look at how software techniques can extract instruction-level parallelism (ILP) in a common vector loop. The following loop is the so-called DAXPY loop (double-precision aX plus Y) and is the central operation in Gaussian elimination. The following code implements the DAXPY operation, Y = aX + Y, for a vector length 100. Initially, F0 holds constant a, R1 is set to the base address of array X, and R2 is set to the base address of array Y: DADDIU R4, R1, #800 ; R4 = upper bound for X foo: L. D F2, 0(R1) ; load X(i) to F2 MUL. D F4, F2, F0 ; (F4) = a*X(i) L. D F6, 0(R2) ; (F6) = Y(i) ADD. D F6, F4, F6 ; (F6) = a*X(i) + Y(i) S. D F6, 0(R2) ; store F6 to Y(i) DADDIU R1, R1, #8 ; increase X index DADDIU R2, R2, #8 ; increase Y index DSLTU R3, R1, R4 ; test: continue loop? BNEZ R3, foo ; loop if needed

ansver
Answers: 3

Another question on Computers and Technology

question
Computers and Technology, 21.06.2019 17:00
How should you set the ohms adjust control on a multitester of analog vom, for resistance measurements?
Answers: 1
question
Computers and Technology, 22.06.2019 20:00
What is the worst-case complexity of the maxrepeats function? assume that the longest string in the names array is at most 25 characters wide (i.e., string comparison can be treated as o( class namecounter { private: int* counts; int nc; string* names; int nn; public: namecounter (int ncounts, int nnames); int maxrepeats() const; }; int namecounter: : maxrepeats () { int maxcount = 0; for (int i = 0; i < nc; ++i) { int count = 1; for (int j = i+1; j < nc; ++j) { if (names[i] == names[j]) ++count; } maxcount = max(count, maxcount); } return maxcount; }
Answers: 3
question
Computers and Technology, 24.06.2019 13:20
In the insert table dialog box, you select the checkbox to create the first row as the header of the table.
Answers: 3
question
Computers and Technology, 25.06.2019 05:30
You are asked to calculate an object's velocity, in order to do so you must know the object's a. direction and speed. b. distance and time. c. speed and distance. d. direction and distance.
Answers: 1
You know the right answer?
In this exercise, we look at how software techniques can extract instruction-level parallelism (ILP)...
Questions
question
Mathematics, 09.02.2021 23:00
question
Mathematics, 09.02.2021 23:00
Questions on the website: 13722363