IEEE Transactions on VLSI 2025 Research Papers

IEEE Transactions on VLSI 2025 Research Papers

IEEE Transactions on VLSI 2025 Research Papers, We are offering iee projects 2024-2025 in latest technology like Java, dot net, android, embedded, matlab, vlsi, hadoop, power elctronics, power system, mechanical, civil projects. IEEE Master is a unit of LeMeniz Technologies. We guide all final year M.E/M.Tech, B.E/B.Tech, MPhil, MCA, BCA, M.Sc, B.Sc, and Diploma students for their Academic Projects to get best results.

Why Us?

IEEE Project Technology

LOW POWER

S.No Code Title Year
1 VLSI_2025_01_LP
46-nA High-PSR CMOS Buffered Voltage Reference With 1.2–5 V and -40°C to 125°C Operating Range
A nanopower, buffered CMOS voltage reference designed to operate across the entire industrial temperature range (from −40 ◦C to 125 ◦C) and with an input voltage range from 1.2 to 5 V (automotive applications) is presented. The solution provides 390 mV at 20 ◦C and is implemented in a standard BCD technology featuring 160-nm CMOS devices. It is characterized by an average temperature coefficient of 200 ppm/◦C, a line sensitivity (LS) of 0.138%/V, and a power supply rejection of −83 dB at 100 Hz. In addition, the circuit occupies a die area of 0.146 mm2 (with the reference circuit alone covering 0.043 mm2 ) and maintains a highly stable current consumption of around 45 nA across various process and input voltage conditions (25 nA for the reference circuit alone) while providing a maximum output current of 630 µA with a load regulation of 0.016 mV/µA..
2026
2 VLSI_2025_11_LP
Exploiting Body-Driven Feedbacks in Physical Unclonable Functions for Ultra Low Voltage, Ultra Low Power Applications: A 0.3 V Weak-PUF
This paper introduces an innovative approach to designing a mismatched current mirror with a fully unbalanced output, significantly reducing the minimum supply voltage requirements for Regulated Cascode Current Mirror (RCCM) Physical Unclonable Functions (PUFs). Leveraging body-driven feedback mechanisms, the proposed circuit reliably operates with supply voltages as low as 0.3V, maintaining stable power consumption through a reference bias current. The resulting PUF achieves remarkable energy efficiency, consuming only 0.3 fJ per bit, without compromising statistical performance. It exhibits a response bias of 49.42%, a reliability of 99.483%, and a uniqueness of 50.176%. Validation of this novel approach is conducted through simulations and measurements on a 130nm CMOS test-chip, considering a nominal supply voltage of 0.3V, ±10% supply voltage variations, and a temperature range from 0 ◦C to 75◦C. Rigorous experimental verification on 20 chip samples, along with detailed explanations of design methodologies, underscores the robustness and practicality of the proposed Body-PUF design. Comparative analyses against state-of-the-art literature reveal that the Body-PUF outperforms previous PUF designs in Figures of Merit (FOM), making it promising for realworld authentication scenarios. Its outstanding trade-off between performance and practicality positions it as a compelling solution for secure applications, including Internet of Things (IoT) devices and other security-critical systems.
2026
3 VLSI_2025_17_LP
Cryogenic Front-End ASICs for Low-Noise Readout of Charge Signals
This paper presents design details and measurement results of LArASIC, a front-end application specific integrated circuit (ASIC) designed for low-noise readout of charge signals generated in neutrino study experiments within liquid argon time projection chambers. LArASIC comprises of 16-channels of programmable charge amplification and pulse shaping stages that provide a voltage readout proportional to the input charge and was optimized for operation at liquid argon temperature, i.e., 89 K. The chip was fabricated in a 180 nm CMOS process. Measurements at liquid nitrogen temperature, i.e., 77 K, indicate that the channel outputs have high linearity (INL < 0.1%) within the operating range, an equivalent noise charge of 534 electrons for a peaking time of 1µs and a detector capacitance of 150 pF, and a worst-case inter-channel cross-talk of 0.35%. The paper also presents design choices made in the process of migrating LArASIC to CHARMS, an ASIC to be fabricated in a 65 nm process that includes all features provided by LArASIC, along with additional digital programmability for improved robustness and flexibility. CHARMS is intended for use in future high-energy physics experiments that require high-resolution charge or light readout with shorter pulse peaking times.
2026
4 VLSI_2025_35_LP
Low-Power and High-Speed SRAM Cells With Double-Node Upset Self-Recovery for Reliable Applications
Transistor sizing and spacing are constantly decreasing due to the continuous advancement of CMOS technology. The charge of the sensitive nodes in the static random access memory (SRAM) cell gradually decreases, making the SRAM cell more and more sensitive to soft errors, such as single-node upsets (SNUs) and double-node upsets (DNUs). Therefore, two types of radiation-hardened SRAM cells are proposed in this article. First, a low-power DNU self-recovery S6P8N cell is proposed. This cell can realize SNU self-recovery from all sensitive nodes as well as realize partial DNU self-recovery and has low-power consumption overhead. Second, we propose a high-speed DNU self-recovery S8P6N cell, which has a soft-error tolerance equivalent to the S6P8N. Furthermore, it reduces the read access time (RAT) and write access time (WAT). Simulation results show that the proposed cells are self-recovery for all SNUs and most of DNUs. Compared with RHBD12, QC8M2T, QC8CE12T, RHM10T, SEA14T, RHM-12T, S8P4R, S8P8M, RH-14T, HRLP16T, GC16T, and HRH-12T, the average power consumption of S6P8N is reduced by 48.78%, and the average power consumption of S8P6N is reduced by 26.34%. At the same time, WAT and RAT are reduced by 9.07% and 36.84%, respectively.
2026
5 VLSI_2025_37_LP
A Single-Ended High-Voltage-Compliant 11-bit Current-Steering Digital-to-Analog Converter for Adaptive Noise Cancellation in Power Over Data Line Networks
Automotive Ethernet is considered to be the backbone of future in-vehicle data communication. One main feature is its ability to simultaneously transmit data and energy via power over data lines (PoDL). This article proposes the design of a single-ended high-voltage (HV)-compliant 11-bit current-steering digital-to-analog converter (DAC). The converter is tailored for the utilization as digitally controlled current source in adaptive noise-cancellation filter for PoDL networks. Designed in an HV-compliant 180-nm bipolar complementary metal-oxide-semiconductor (BiCMOS) semiconductor technology, the DAC features a monolithically combined topology of two identical 10-bit low-voltage (LV) current-steering DACs supplied at 1.8 V and two complementary HV-compliant output current stages. Main design features of the segmented LV DAC are the utilization of single-ended current cells with an optimized switching logic, proposed to enhance the cells inherent performance and energy efficiency. Furthermore, a newly derived 0.4° asymmetric output walk switching scheme is investigated. At a maximum output voltage swing of 6 V, the proposed DAC provides a bidirectional output current with the amplitude of up to 50 mA. The proposed DAC exhibits the highest voltage compliance combined with large output current capability reported to date for HV-DACs. It also features excellent dynamic performance with a spurious free dynamic range (SFDR) of 57.8 dB as well as measured four sample rates enabling up to 1.6 MS/s and a maximum integral nonlinearity (INL) error of 1.61 LSB and a differential nonlinearity (DNL) error of 1.05 LSB.
2026
6 VLSI_2025_45_LP
Leveraging Highly Approximated Multipliers in DNN Inference
In this work, we present our control variate approximation technique that enables the exploitation of highly approximate multipliers in Deep Neural Network (DNN) accelerators. Our approach does not require retraining and significantly decreases the induced error due to approximate multiplications, improving power efficiency and accuracy. As a result, control variate approximation enables satisfying tight error resilience requirements while preserving the power savings. Our experimental evaluation, across six different DNNs and several approximate multiplier designs, demonstrates the versatility of control variate technique and shows that compared to the accurate designs, it achieves the same performance, 45% power reduction, and less than 1% average accuracy loss. Compared to the corresponding approximate designs without using our technique, the error-correction of the control variate method improves the accuracy by 1.9x on average.
2026
7 VLSI_2025_47_LP
Design of a Hardware-Efficient Approximate 4-2 Compressor for Multiplications in Image Processing
This letter presents a novel hardware-efficient approximate 4-2 compressor design that significantly enhances accuracy through a systematic analysis of input patterns obtained from practical applications. We incorporate a majority operation and a compound gate in the compressor design to effectively boost hardware efficiency in multiplication. Our design approach results in substantial error reductions, with normalized mean error distance (NMED) and mean relative error distance (MRED) decreasing by up to 74.84% and 82.04%, respectively, compared to existing approximate multipliers discussed in this letter. When implemented in a 32-nm CMOS technology, the approximate multiplier adopting the proposed 4-2 compressor achieves excellent hardware efficiency, reducing area, power, and energy consumption by up to 8.95%, 13.02%, and 13.02%, respectively, compared to other alternatives. Moreover, our design delivers enhanced performance in image processing, achieving up to a 4.84× increase in peak signal-to-noise ratio (PSNR) compared to other designs, all while optimizing hardware efficiency.
2026
8 VLSI_2025_54_LP
A Laddered-Inverter Nonoverlapping Clock Generator
This article presents a new and novel nonoverlapping clock (NOC) generator based on a laddered inverter (LI) circuit. Unlike conventional approaches, the proposed NOC combines the clock generation and pulsewidth-modulation (PWM) circuit into one integrated architecture, offering lower power consumption, smaller area, and a more robust solution. Furthermore, the proposed NOC offers an inherent guarantee of the nonoverlap (dead time) between the output signals thanks to the guaranteed monotonicity of the LI circuit, thus offering a layout-agnostic design. The proposed NOC can also offer dead-time reconfigurability with the help of multiplexers, allowing both calibration and fine-tuning of the dead times to meet specific requirements. We provide a comprehensive assessment of the proposed NOC through simulation and measurement results in 65-nm CMOS. Measured results show that the proposed NOC consumes 1 µW at 5 MHz with a 1-V supply, achieving more than 10× lower power consumption and 25% smaller area compared to the conventional clock circuit. The proposed NOC demonstrates the capability to generate waveforms with frequencies up to 3.5 GHz at a 1.2-V supply, as validated through simulations.
2026
9 VLSI_2025_55_LP
An Area-Efficient VLSI Architecture for High-Throughput Computation of the 2-D DWT
AIn this article, an area-efficient VLSI architecture scheme for high-throughput computation of the 2-D discrete wavelet transform (DWT) is proposed, effectively applied in the context of aircraft cargo hold scenes. The proposed architecture aims to reduce computation and storage resources while maintaining the DWT-IDWT reconstructed image quality for the 9/7 discrete wavelet. The hardware implementation formulae based on the flipping architecture have been modified to reduce RAM storage bit width. By transforming the coefficients of the formula into hardware-friendly values, the required multiplication operations are split into two stages of addition. On this basis, a pipelined architecture is constructed to set the critical path delay (CPD) of the architecture to be close to the delay of a single adder, Tₐ, thereby achieving a high throughput. Compared to existing architectures in the research field, the proposed single-level 2-D DWT architecture achieves resource savings on the field-programmable gate array (FPGA) platform while ensuring good image reconstruction quality. The advantages of the multilevel 2-D DWT are evaluated, and the results show similar results on the application-specific integrated circuit (ASIC) platform. The proposed architecture reduces average computation time by at least 35.54% while achieving delay of one-level decomposition, decreases the area-delay product (ADP) by at least 25.41%, and saves a significant amount of energy per image (EPI). Furthermore, the proposed folded architecture achieves close to 100% hardware utilization efficiency (HUE) in multilevel 2-D DWT computations.
2026
10 VLSI_2025_57_LP
A Two-Stage CMOS Amplifier With High Degree of Stability for All Capacitive Loads
This article presents the conception, design, and realization of a fully differential two-stage CMOS amplifier that is unconditionally stable for any value of the capacitive load. This is simply achieved by sending a scaled replica of the output stage current to the amplifier virtual ground in order to create a left half-plane (LHP) zero in the loop gain that either cancels or tracks the output pole in all process, voltage, and temperature (PVT) conditions. Consequently, from a stability point of view, the amplifier behavior resembles that of a single-pole OTA. Starting from an existing two-stage gain-programmable amplifier, designed in a 0.18-µm bipolar-CMOS-DMOS (BCD) process that was able to drive only 10 pF without encountering into stability issues, a simple circuit has been added to extend the stability to any capacitive load value. An interesting and unusual method, based on the frequency behavior of the unloaded closed-loop amplifier output impedance, has been introduced to further verify the unconditional stability of this solution. Measurements show a high degree of stability in any load conditions. In the used 0.18-µm BCD technology, silicon area and current consumption of the extra circuit are only 0.0004 mm² and 2 µA, respectively, with a 5-V power supply.
2026
11 VLSI_2025_58_LP
Revisiting Multiple ECC on High-Density NAND Flash Memory
Three-dimensional NAND flash memory using the advanced multibit-per-cell technique is widely adopted due to its high density. However, it faces the problem of deteriorating read performance and energy consumption due to decreased reliability. Low-density parity-check code (LDPC) is typically adopted as an error correction code (ECC) to encode data and provide fault tolerance. To reduce the cost, LDPC with a high code rate is always adopted. However, LDPC will lead to read retry operations when the accessed data are not successfully decoded, and such retry-induced performance degradation is serious, especially for modern high-density flash memory. In this work, a reliability-aware differential ECC (READECC) approach is proposed to reduce redundancy protection and storage cost of LDPC with a low cost. The basic idea is to adopt LDPC with a suitable code rate considering both data access characteristics and flash reliability characteristics. First, hot reads are strengthened based on the frequency of being accessed. Second, based on the reliability variation characteristics, the life of NAND flash is divided into three reliability periods. As the reliability period shifts, the code rate of the LDPC adjusts adaptively to minimize redundancy overhead. Furthermore, the READECC scheme provides further flexibility to support LDPC with strong error correction capability only when paired with a low redundancy rate. With careful design and evaluation on 3-D triple-level cell NAND flash memory, READECC achieves encouraging optimizations with a negligible cost.
2026
12 VLSI_2025_60_LP
A Nanopower EEG Low-Pass Filter Using Current-Sharing Vertical Differential Pairs
A follower-based gₘ – C low-pass filter that employs CMOS vertical source-couple-pair (VSCP) transconductors is proposed for practical use in EEG acquisition systems. The VSCP transconductor operates as a gₘ cell with current sharing and linearity enhancement features. It is featured in the first- and second-order gₘ – C sections cascaded to form a third-order low-pass filter targeting a 150-Hz bandwidth. To mitigate the effects of biasing current source mismatch, dynamic element matching (DEM) is optionally applied to the relevant biasing current source pairs, resulting in second harmonic distortion (HD2) and noise suppression. Implemented in a 0.18-µm process, the proposed filter consumes 16.3-nW power from a 1.2-V supply. Thanks to the DEM and VSCPS, the linear bandwidth is 150-Hz with linear input range (measured at 1% total harmonic distortion (THD)), whereas the dynamic range for less than 4 µV₍rms₎ noise is obtained to a filter dynamic range (DR) of 65.15 dB. Overall performance comparisons with other recent biomedical filters indicate that the figure of merit (FoM) of this proposed filter is comparable while the linear input range is larger.
2026
13 VLSI_2025_76_LP
High Efficiency Multiply-Accumulator Using Ternary Logic and Ternary Approximate Algorithm
A multiply-accumulator, often abbreviated as a MAC unit, is central to a multitude of computational tasks, particularly those tasks (such as neural networks) involving array-based mathematical computations. The quest for novel methods to efficiently store and process data in a MAC has become imperative. Recently, ternary logic has attracted significant attention due to its higher information density than conventional binary systems. However, though numerous studies have showcased ternary arithmetic techniques, advancements in ternary-based vector processing have been notably scarce. To bridge this gap, this work undertakes comprehensive study into the optimization of ternary MAC units. Firstly, we propose various ternary approximate algorithms, which allow 30% less power consumption and 21% compact area in comparison with the accurate design. Secondly, we design sophisticated ternary circuits and obtain 74%–86% lower power-delay-product (PDP) than previous works. Furthermore, we evaluate proposed ternary MAC unit using both carbon-nanotube field-effect transistor (CNFET) and silicon-based 180 nm CMOS processes. The simulation results show the ternary circuit is better than binary circuit in terms of both area (~45% less) and power (~30% less), highlighting its strong potential for practical applications.
2026
14 VLSI_2025_77_LP
Low-Power Subthreshold Voltage References for High-Temperature Applications
Advanced Internet-of-Things (IoT) devices are increasingly used in high-temperature environments, such as automotive, aerospace, defense, and industrial applications. High-temperature voltage references are crucial for these systems. This paper presents two topologies designed for high-temperature operation. The first topology minimizes leakage currents by using a replica branch to decouple leakage from the core, reducing its impact on the reference voltage. The second topology optimizes the operating temperature by employing one-stage amplifiers as buffers to handle junction leakage while maintaining body voltage. Both designs are fabricated in 180 nm CMOS process. The first design supports operation up to 140 °C with an average temperature coefficient (TC) of 70 ppm/°C, while the second design operates up to 180 °C with a TC of 64 ppm/°C. The designs also exhibit line sensitivities of 0.46 %/V and 0.31 %/V, PSRRs of –37.8 dB and –38.8 dB at 100 Hz, and power consumption of 111 pW and 136.8 pW at room temperature, respectively.
2026
15 VLSI_2025_84_LP
Low-Power High Precision Floating-Point Divider With Bidimensional Linear Approximation
In this paper we propose a novel approximate floating-point divider based on bidimensional linear approximation. In our approach, the mantissa quotient is seen as a function of the two input mantissas of the divider. The domain of this two-variable function is partitioned into nx × ny subregions, named tiles, where nx, ny are chosen as powers of two. In each tile the quotient is approximated with a linear combination of the input mantissas. To achieve fine accuracy, an optimization problem is formulated within each tile to determine the optimal coefficients for the linear combination, which minimize the Mean Relative Error Distance (MRED) of the divider. Furthermore, to make hardware implementation more effective, the minimization problem is appropriately modified to search for optimal quantized coefficients. The hardware structure of the divider only requires a small look-up table to store the linear approximation coefficients, and a carry save adder tree. The proposed architecture is highly tunable at design-time over a wide range of accuracy, depending on the number of tiles chosen for the approximation. The obtained results demonstrate error performance and hardware features superior to the state-of-the-art. The proposed dividers define the Pareto front, considering the trade-off between power-delay-product vs. MRED and area-delay-product vs. MRED, for MRED in the range of 4 × 10−3 − 2 × 10−2 . Application results for JPEG compression and tone mapping further highlight the strength of our proposal, which exhibits Structural Similarity Index (SSIM) very close to 1 in all cases and Peak Signal-to-Noise Ratio (PSNR) up to 45 dB.
2026
16 VLSI_2025_85_LP
Fanout-Based Reliability Model for SER Estimation in Combinational Circuits
Soft errors in Integrated Circuits (ICs) have always been a major concern, particularly as CMOS technology nodes continue to shrink, resulting in higher frequency, lower power, and smaller areas, exacerbating radiation-induced soft errors. Therefore, Single Event Transient (SET) has become a crucial consideration in designing modern radiation-tolerant circuits, as it has the potential to cause failures in circuit outputs. This paper employs the concept of signal probability for transient fault propagation in circuits. Considering the issue of transient fault-masking, an error propagation model is presented for each fault-masking case. Furthermore, approaches are proposed for both probabilistic and time-based scenarios to address the impact of re-convergent paths on transient faults. Since considering re-convergent paths increases computational complexity, three computational approximations are proposed in this paper as a compromise to reduce the size of simulation runs as much as possible. We compared the simulation results with Monte-Carlo methods and HSPICE-based simulations to validate the proposed method. According to the comparison results on ISCAS-85 benchmarks, the proposed approach for estimating the SER exhibits an average accuracy of 95% while consuming less than 5% compared to traditional fault injection.
2026

AREA EFFICIENT/ TIMING & DELAY REDUCTION

S.No Code Title Year
1 VLSI_2025_05_AE
Manipulated Lookup Table Method for Efficient High-Performance Modular Multiplier
Introduces a LUT-based optimization technique for modular multiplication. Improves throughput and reduces hardware complexity for cryptographic systems.
2025
2 VLSI_2025_06_AE
Low-Power High Precision Floating-Point Divider With Bidimensional Linear Approximation
Proposes a floating-point divider using bidimensional linear approximation. Reduces power consumption while maintaining high numerical accuracy.
2025
3 VLSI_2025_08_AE
Leveraging Highly Approximated Multipliers in DNN Inference
Uses approximate multipliers to accelerate deep neural network inference. Trades minor accuracy loss for major energy efficiency gains.
2025
4 VLSI_2025_09_AE
Hardware Implementation of Improved Banker’s Fixed-Point Rounding Algorithm
Implements an optimized Banker’s rounding technique in hardware. Enhances numerical stability in fixed-point arithmetic systems.
2025
5 VLSI_2025_10_AE
Fail-Safe Logic Design Strategies Within Modern FPGA Architectures
Introduces fault-tolerant logic design techniques for FPGA systems. Improves reliability under transient and permanent faults.
2025
6 VLSI_2025_12_AE
Enhancing Delay-Driven LUT Mapping With Boolean Decomposition
Optimizes LUT mapping using Boolean decomposition techniques. Reduces delay in FPGA logic synthesis.
2025
7 VLSI_2025_14_AE
Embedded Hardware-Efficient FPGA Architecture for SVM Learning and Inference
Designs an FPGA-based SVM accelerator for machine learning tasks. Optimized for low power and fast inference.
2025
8 VLSI_2025_16_AE
Design and Analysis of Energy Efficient Approximate Multipliers for Image Processing and Deep Neural Network
Presents approximate multiplier architectures for DNN and image processing. Reduces energy consumption with acceptable error bounds.
2025
9 VLSI_2025_18_AE
C4TERO: Configurable Cascaded Carry Chains for High Reliability TERO PUFs on FPGAs
Introduces a secure hardware PUF design using configurable carry chains. Enhances FPGA-based security primitives.
2025
10 VLSI_2025_19_AE
AXI Hardware Accelerator for McEliece on FPGA Embedded Systems
Implements McEliece cryptosystem accelerator using AXI interface. Optimized for post-quantum cryptography hardware.
2025
11 VLSI_2025_23_AE
A High-Resolution Calibration Method for Time-to-Digital Converter of Lidar
Proposes calibration techniques for TDCs used in LiDAR systems. Improves distance measurement accuracy.
2025
12 VLSI_2025_24_AE
Energy-Efficient Syndrome Calculation Architecture for BCH Decoders
Optimizes BCH decoding syndrome computation for lower energy consumption. Suitable for communication systems.
2025
13 VLSI_2025_25_AE
Low-Complexity Implementation of Real-Time Reconfigurable Low-Pass Equalizers
Designs reconfigurable equalizer architecture for real-time filtering. Reduces hardware cost and improves flexibility.
2025
14 VLSI_2025_26_AE
MSDF-Based MAC for Energy-Efficient Neural Networks
Introduces MSDF-based MAC unit for neural networks. Improves energy efficiency in AI hardware accelerators.
2025
15 VLSI_2025_27_AE
A Study of Signed-Digit Hybrid Stochastic Number for Arithmetic Computing
Explores hybrid stochastic number systems for arithmetic computation. Enhances fault tolerance and energy efficiency.
2025
16 VLSI_2025_28_AE
Manipulated Lookup Table Method for Efficient High-Performance Modular Multiplier
Improves LUT-based modular multipliers for cryptographic acceleration. Reduces latency and hardware overhead.
2025
17 VLSI_2025_29_AE
SPEED: A Scalable RISC-V Vector Processor Enabling Efficient Multiprecision DNN Inference
Designs scalable RISC-V vector processor for DNN inference. Optimized for multi-precision computation workloads.
2025
18 VLSI_2025_30_AE
FPGA-Based Low-Bit and Lightweight Fast Light Field Depth Estimation
Implements depth estimation for light field imaging on FPGA. Optimized for low-bit and real-time processing.
2025
19 VLSI_2025_31_AE
A Comparative Analysis of Low Temperature and Room Temperature Circuit Operation
Analyzes circuit performance under different temperature conditions. Provides insights for cryogenic and standard operation designs.
2025
20 VLSI_2025_39_AE
A 4.86-pJ/b Energy-Efficient Fully Parallel Stochastic LDPC Decoder With Two-Stage Shared Memory
Designs ultra-low-energy LDPC decoder using stochastic methods. Optimized for high-throughput communication systems.
2025
21 VLSI_2025_40_AE
A Pay-Per-ISE RISC-V Processor With Hardware-Assisted Orthogonal Obfuscation
Implements secure RISC-V processor with hardware obfuscation techniques. Enhances IP protection in FPGA systems.
2025
22 VLSI_2025_48_AE
An Efficient Test Architecture Using Hybrid Built-In Self-Test for Processing-in-Memory
Proposes hybrid BIST architecture for in-memory computing systems. Improves fault detection efficiency.
2025
23 VLSI_2025_53_AE
Strassen Multisystolic Array Hardware Architectures
Implements Strassen algorithm using systolic arrays. Enhances matrix multiplication performance.
2025
24 VLSI_2025_61_AE
High-Speed Compute-Efficient Bandit Learning for Many Arms
Designs hardware-efficient accelerator for multi-armed bandit learning. Optimized for decision-making AI systems.
2025
25 VLSI_2025_62_AE
Design of Low-Cost and High-Accurate 8-bit Logarithmic Floating-Point Arithmetic Circuits
Proposes logarithmic floating-point arithmetic circuits. Balances accuracy and hardware efficiency.
2025
26 VLSI_2025_63_AE
An Area-Energy-Efficient 64–2048 Point FFT With Approximate Plane-Fitting Complex Multipliers
Implements FFT architecture using approximate multipliers. Reduces area and energy consumption significantly.
2025
27 VLSI_2025_69_AE
A Reconfigurable Floating-Point Division and Square Root Architecture for High-Precision Softmax
Designs reconfigurable divider and square root unit for Softmax computation. Optimized for AI workloads.
2025
28 VLSI_2025_81_AE
In-Memory Implementation of an Approximate Adder With Reduced Latency and Error
Implements approximate adder in memory architecture. Reduces latency and improves energy efficiency.
2025
29 VLSI_2025_82_AE
RIVL: A Low-Cost SoC Agile Development Platform for Multiple RISC-V Processors Design and Verification
Provides SoC platform for rapid RISC-V design and verification. Supports multi-core development and testing.
2025

HIGH SPEED DATA TRANSMISSION

S.No Code Title Year
1VLSI_2025_02_HS
Scalable 5G NR Rate-Matcher and Rate-Dematcher for Efficient Use in FPGA Accelerators
Implements scalable rate matching and dematching architecture for 5G NR systems. Optimized for FPGA-based baseband acceleration with reduced latency.
2025
2VLSI_2025_04_HS
Method to Determine Quantization-Related Parameters of the Digital-to-Time Converter in a Fractional-N Frequency Synthesizer
Presents a calibration method for DTC quantization parameters in fractional-N PLL systems. Improves frequency stability and phase accuracy.
2025
3VLSI_2025_07_HS
Longest-First Search Using Bloom Filter: Algorithm and FPGA Implementation
Proposes Bloom filter-based longest-first search algorithm with FPGA implementation. Reduces memory overhead and improves search speed.
2025
4VLSI_2025_13_HS
Enhanced Fetal Arrhythmia Classification by Non-Invasive ECG Using Cross Domain Feature and Spatial Differences Windows Information
Improves fetal ECG classification using cross-domain feature extraction. Enhances detection accuracy for arrhythmia analysis.
2025
5VLSI_2025_15_HS
Design and HDL Implementation of Pulse-Arrival-Time Estimation Using XGBoost Regression With Tree-Recycling Architecture
Implements XGBoost-based pulse arrival time estimation in hardware. Optimized with tree-recycling for area and power reduction.
2025
6VLSI_2025_20_HS
Analysis and Mitigation of Excess Phase Noise and Spurs in Digital-to-Time-Converter-Enhanced Fractional-N Frequency Synthesizers
Analyzes phase noise in DTC-enhanced synthesizers and proposes mitigation techniques for cleaner frequency generation.
2025
7VLSI_2025_21_HS
Advanced Quantization Schemes to Increase Accuracy, Reduce Area, and Lower Power Consumption in FFT Architectures
Introduces optimized quantization methods for FFT architectures. Balances precision, area, and power efficiency.
2025
8VLSI_2025_22_HS
A Hybrid Speech Enhancement Technique Based on Discrete Wavelet Transform and Spectral Subtraction
Combines DWT and spectral subtraction for improved speech enhancement. Reduces noise while preserving speech clarity.
2025
9VLSI_2025_33_HS
ReAdapt-II: Energy-Quality Optimizations for VLSI Adaptive Filters Through Automatic Reconfiguration and Built-In Iterative Dividers
Proposes adaptive filter optimization with dynamic reconfiguration. Enhances energy efficiency and signal quality trade-offs.
2025
10VLSI_2025_37_HS
FANNS: An FPGA-Based Approximate Nearest-Neighbor Search Accelerator
Accelerates nearest-neighbor search using FPGA-based approximate computing. Improves search speed for large datasets.
2025
11VLSI_2025_41_HS
Adjacent Channel Weight Dependable RLS Adaptive Filter With VMD Based Artifact Removal Mechanism in Fetal ECG Separation
Enhances fetal ECG separation using adaptive RLS filtering and VMD-based artifact removal. Improves biomedical signal clarity.
2025
12VLSI_2025_42_HS
An Energy-Efficient Configurable 1-D CNN-Based Multi-Lead ECG Classification Coprocessor for Wearable Cardiac Monitoring Devices
Implements CNN-based ECG classifier for wearable devices. Focuses on low power and real-time cardiac monitoring.
2025
13VLSI_2025_43_HS
A Hybrid Speech Enhancement Technique Based on Discrete Wavelet Transform and Spectral Subtraction
Second optimized implementation of hybrid speech enhancement. Improves noise suppression efficiency.
2025
14VLSI_2025_44_HS
An Area-Efficient VLSI Architecture for High-Throughput Computation of the 2-D DWT
Designs efficient hardware architecture for 2-D DWT computation. Optimized for image processing applications.
2025
15VLSI_2025_46_HS
AdAM: Adaptive Approximate Multiplier for Fault Tolerance in DNN Accelerators
Introduces adaptive approximate multipliers for neural networks. Improves fault tolerance in DNN accelerators.
2025
16VLSI_2025_48_HS
Fault Bounding On-Die BCH Codes for Improving Reliability of System ECC
Enhances ECC reliability using on-die BCH coding. Improves fault tolerance in memory systems.
2025
17VLSI_2025_50_HS
A 7.4–9.2-GHz Fractional-N Differential Sampling PLL Based on Phase-Domain and Voltage-Domain Hybrid Calibration
Presents PLL architecture with hybrid calibration techniques. Improves frequency stability and phase noise reduction.
2025
18VLSI_2025_51_HS
Area-Efficient Pipeline Architecture for Serial Real-Valued Fast Fourier Transform
Implements pipelined FFT architecture optimized for area efficiency and real-time processing.
2025
19VLSI_2025_52_HS
ATEP: An Asynchronous Timing Error Prediction Circuit With Adaptive Voltage and Frequency Scaling
Predicts timing errors in asynchronous circuits. Enables adaptive voltage-frequency scaling for energy saving.
2025
20VLSI_2025_55_HS
An Area-Efficient VLSI Architecture for High-Throughput Computation of the 2-D DWT
Second optimized 2-D DWT architecture focusing on throughput and reduced area usage.
2025
21VLSI_2025_59_HS
A 25-GHz PLL Achieving 8-ns Phase-Shifting Time With Double-Path Modulation Scheme
High-frequency PLL design with fast phase shifting. Suitable for high-speed communication systems.
2025
22VLSI_2025_64_HS
A Sub-0.9-ps Static Phase Offset 500 MHz Delay-Locked Loop With a Large Gain Phase Detector
Designs ultra-precise DLL with sub-picosecond phase offset. Enhances clock synchronization accuracy.
2025
23VLSI_2025_66_HS
Toward High-Performance Network Coding: FPGA Acceleration With Bounded-Value Generators
Accelerates network coding using FPGA-based bounded-value generation. Improves data transmission efficiency.
2025
24VLSI_2025_67_HS
An Implementation Method for 100% ASK Modulation Applied to NFC Tags
Implements ASK modulation technique for NFC communication. Optimized for low-power tag systems.
2025
25VLSI_2025_68_HS
A Low-Latency Feed-Forward Architecture for Image Filtering via Row-by-Row Processing
Proposes feed-forward filtering architecture for image processing. Reduces latency in row-wise computation.
2025
26VLSI_2025_72_HS
Optimized Algorithms for FPGA Implementation of PDCCH Chain for 5G-NR Base Station
Optimizes PDCCH chain for 5G base stations. Improves FPGA throughput and decoding efficiency.
2025
27VLSI_2025_73_HS
Design and Implementation of a Low-Complexity Continuously Variable Digital Filter Using a Novel Farrow-Equivalent-Newton Structure-Based Fractional Delay Filter
Implements variable digital filter using Farrow-based structure. Enhances flexibility and reduces computation cost.
2025
28VLSI_2025_83_HS
VLSI Architecture Design for Compact Shortcut Denoising Autoencoder Neural Network of ECG Signal
Designs compact autoencoder architecture for ECG signal denoising. Optimized for wearable health devices.
2025

VLSI Design of Image, Video and Audio Processing

S.No Code Title Year
1 VLSI_2025_03_IM
NN2FPGA: Optimizing CNN Inference on FPGAs With Binary Integer Programming
Proposes a binary integer programming approach to optimize CNN mapping on FPGA hardware. Improves resource allocation, reduces latency, and enhances inference efficiency.
2025
2 VLSI_2025_32_IM
Detect and Replace: Efficient Soft Error Protection of FPGA-Based CNN Accelerators
Introduces a detect-and-replace mechanism for mitigating soft errors in FPGA-based CNN accelerators. Enhances reliability with minimal hardware overhead.
2025
3 VLSI_2025_34_IM
Research on Hardware Acceleration of Traffic Sign Recognition Based on Spiking Neural Network and FPGA Platform
Develops FPGA-based acceleration for spiking neural networks used in traffic sign recognition. Achieves low-power and real-time classification performance.
2025
4 VLSI_2025_36_IM
An Interpolation-Free Fractional Motion Estimation Algorithm and Hardware Implementation for VVC
Proposes an interpolation-free motion estimation method for VVC video coding. Reduces computational complexity while maintaining encoding quality.
2025
5 VLSI_2025_65_IM
Design of a Low-Power Analog Integrated Deep Convolutional Neural Network
Presents an analog CNN architecture designed for ultra-low-power deep learning inference. Suitable for edge AI applications.
2025
6 VLSI_2025_70_IM
Hardware-Efficient Architecture for Multiple Quantized Gaussian Noise Generation
Introduces an efficient hardware design for generating quantized Gaussian noise. Optimized for stochastic computing and ML accelerators.
2025
7 VLSI_2025_74_IM
MBS: A High-Precision Approximation Method for Softmax and Efficient Hardware Implementation
Proposes an approximation method for Softmax computation in hardware. Reduces complexity while maintaining high numerical precision.
2025
8 VLSI_2025_75_IM
EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language Models
Designs a hybrid CPU-FPGA accelerator for efficient LLM inference at the edge. Reduces latency and energy consumption significantly.
2025
9 VLSI_2025_78_IM
A High-Throughput FPGA Accelerator for Lightweight CNNs With Balanced Dataflow
Proposes a balanced dataflow architecture for lightweight CNN acceleration on FPGA. Improves throughput and reduces bottlenecks.
2025
10 VLSI_2025_79_IM
A Novel Transform Accelerator With Fast Kernel Selection and Efficient Transform Circuit
Introduces a transform accelerator with dynamic kernel selection. Enhances processing speed for signal and image transformations.
2025
11 VLSI_2025_80_IM
Rational-Exponent Filters with Applications to Generalized Exponent Filters
Proposes rational-exponent filter structures for flexible signal processing. Enables generalized filtering with reduced computational cost.
2025

+91 9566475911 | +91 9962588976 | projects@lemenizinfotech.com