Top Banner
Programming with Programming with CUDA CUDA WS 08/09 WS 08/09 Lecture 12 Lecture 12 Tue, 02 Dec, 2008 Tue, 02 Dec, 2008
17

Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

Dec 19, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

Programming with Programming with CUDACUDAWS 08/09WS 08/09

Lecture 12Lecture 12Tue, 02 Dec, 2008Tue, 02 Dec, 2008

Page 2: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

PreviouslyPreviously

Optimization example: parallel reductionOptimization example: parallel reduction

Page 3: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

TodayToday

Graded/ungraded course?Graded/ungraded course? Revisiting shared memory bank conflictsRevisiting shared memory bank conflicts Final projectsFinal projects

Page 4: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

Graded/ungradedGraded/ungraded

All settled?All settled?

Page 5: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

Shared MemoryShared Memory

Devices of compute capability 1.x Devices of compute capability 1.x have 16 bankshave 16 banks– 16K shared memory in 16 banks, 1K 16K shared memory in 16 banks, 1K

eacheach– Successive 32-bit words are stored in Successive 32-bit words are stored in

successive bankssuccessive banks

Page 6: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.
Page 7: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.
Page 8: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.
Page 9: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.
Page 10: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.
Page 11: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.
Page 12: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.
Page 13: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

Final ProjectsFinal Projects

Time-lineTime-line– Thu, 20 Nov:Thu, 20 Nov:

Float write-ups on ideas of Jens & WaqarFloat write-ups on ideas of Jens & Waqar

– Tue, 25 Nov:Tue, 25 Nov: Suggest groups and topicsSuggest groups and topics

– Thu, 27 Nov:Thu, 27 Nov: Groups and topics assignedGroups and topics assigned

– Tue, 2 Dec (today):Tue, 2 Dec (today): Last chance to change groups/topicsLast chance to change groups/topics Groups and topics finalizedGroups and topics finalized

Page 14: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

Final ProjectsFinal Projects

There will be no lectures in the There will be no lectures in the second halfsecond half– Meetings with groupsMeetings with groups– Schedule will be put onlineSchedule will be put online

Page 15: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

Final ProjectsFinal Projects

General tipsGeneral tips– Optimize your codeOptimize your code– Document your codeDocument your code– Make your code platform Make your code platform

independentindependent– don't mix C and C++don't mix C and C++

Page 16: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

Final ExamFinal Exam

When?When?

Page 17: Programming with CUDA WS 08/09 Lecture 12 Tue, 02 Dec, 2008.

On to exercises!On to exercises!