Modern FPGA placement techniques with hardware acceleration

dc.contributor.advisorPan, David Z.
dc.contributor.advisorIyer, Mahesh A.
dc.contributor.committeeMemberTouba, Nur A.
dc.contributor.committeeMemberGerstlauer, Andreas
dc.contributor.committeeMemberChiou, Derek
dc.contributor.committeeMemberRossbach, Christopher
dc.creatorDhar, Shounak
dc.date.accessioned2021-04-23T23:43:24Z
dc.date.available2021-04-23T23:43:24Z
dc.date.created2019-08
dc.date.issued2019-09-13
dc.date.submittedAugust 2019
dc.date.updated2021-04-23T23:43:25Z
dc.description.abstractIn deep sub-micron technology nodes, Application-Specific Integrated Circuits (ASICs) are becoming expensive to design and manufacture. For this reason, Field Programmable Gate Arrays (FPGAs), which are general purpose and flexible programmable hardware, are gaining more design wins in low volume and fast evolving applications. Modern FPGAs are becoming popular in high performance data analytics, search engines, autonomous cars, communication and networking applications. FPGAs are also accompanied with a complete Computer-Aided Design (CAD) toolchain, that is used to optimally map and fit the design applications or workloads onto the underlying target FPGA device. These design applications mapped onto the FPGA demand high maximum achievable clock frequency (Fmax) and low power consumption while maintaining a low compilation time, which is a major hindrance in widespread adoption of FPGAs. The focus of this Ph.D. dissertation is the placement problem for FPGAs, which takes a major portion of the FPGA CAD tool runtime. A new algorithm for spreading cells during FPGA global placement is proposed, which achieves better wirelength and routing congestion and takes less runtime than the algorithm used in the state-of-the-art academic FPGA placer. We also propose FPGA acceleration of various subsystems of an analytic global placement algorithm, including wirelength gradient computation and spreading, which achieves significant speedup over the multi-threaded CPU version. A new detailed placement algorithm is proposed, which offers better tradeoff between quality and runtime compared to existing methods. This algorithm is also accelerated on a GPU and an FPGA, achieving significant speedup over multi-threaded CPU implementation. Another detailed placement algorithm is also proposed which physically re-aligns timing critical paths and improves Fmax with minimal runtime overhead. Both of these algorithms for detailed placement have shown good results on industrial benchmarks and have been integrated into an industrial FPGA CAD tool flow
dc.description.departmentElectrical and Computer Engineering
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2152/85439
dc.identifier.urihttp://dx.doi.org/10.26153/tsw/12403
dc.language.isoen
dc.subjectFPGA
dc.subjectGPU
dc.subjectPlacement
dc.subjectHardware acceleration
dc.subjectGlobal placement
dc.subjectDetailed placement
dc.subjectEDA
dc.subjectCAD
dc.titleModern FPGA placement techniques with hardware acceleration
dc.typeThesis
dc.type.materialtext
local.embargo.lift2021-08-01
local.embargo.terms2021-08-01
thesis.degree.departmentElectrical and Computer Engineering
thesis.degree.disciplineElectrical and Computer Engineering
thesis.degree.grantorThe University of Texas at Austin
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy

Access full-text files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DHAR-DISSERTATION-2019.pdf
Size:
5.16 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.45 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.84 KB
Format:
Plain Text
Description: