IEEE Transactions on VLSI 2025 Research Papers

IEEE Transactions on VLSI 2025 Research Papers, We are offering iee projects 2024-2025 in latest technology like Java, dot net, android, embedded, matlab, vlsi, hadoop, power elctronics, power system, mechanical, civil projects. IEEE Master is a unit of LeMeniz Technologies. We guide all final year M.E/M.Tech, B.E/B.Tech, MPhil, MCA, BCA, M.Sc, B.Sc, and Diploma students for their Academic Projects to get best results.

Why Us?

24 X 7 Online Supports
No False Commitment
Affordable Price
On Time Project Delivery
Project Execution on student laptop
Project Completion Certificate
Output Video

IEEE Project Technology

Java
Dotnet
Android
Hadoop
NS2
Matlab
Embedded
VLSI

LOW POWER

S.No	Code	Title	Year
1	VLSI_2025_01_LP	46-nA High-PSR CMOS Buffered Voltage Reference With 1.2–5 V and -40°C to 125°C Operating Range A nanopower, buffered CMOS voltage reference designed to operate across the entire industrial temperature range (from −40 ◦C to 125 ◦C) and with an input voltage range from 1.2 to 5 V (automotive applications) is presented. The solution provides 390 mV at 20 ◦C and is implemented in a standard BCD technology featuring 160-nm CMOS devices. It is characterized by an average temperature coefficient of 200 ppm/◦C, a line sensitivity (LS) of 0.138%/V, and a power supply rejection of −83 dB at 100 Hz. In addition, the circuit occupies a die area of 0.146 mm2 (with the reference circuit alone covering 0.043 mm2 ) and maintains a highly stable current consumption of around 45 nA across various process and input voltage conditions (25 nA for the reference circuit alone) while providing a maximum output current of 630 µA with a load regulation of 0.016 mV/µA..	2026
2	VLSI_2025_11_LP	Exploiting Body-Driven Feedbacks in Physical Unclonable Functions for Ultra Low Voltage, Ultra Low Power Applications: A 0.3 V Weak-PUF This paper introduces an innovative approach to designing a mismatched current mirror with a fully unbalanced output, significantly reducing the minimum supply voltage requirements for Regulated Cascode Current Mirror (RCCM) Physical Unclonable Functions (PUFs). Leveraging body-driven feedback mechanisms, the proposed circuit reliably operates with supply voltages as low as 0.3V, maintaining stable power consumption through a reference bias current. The resulting PUF achieves remarkable energy efficiency, consuming only 0.3 fJ per bit, without compromising statistical performance. It exhibits a response bias of 49.42%, a reliability of 99.483%, and a uniqueness of 50.176%. Validation of this novel approach is conducted through simulations and measurements on a 130nm CMOS test-chip, considering a nominal supply voltage of 0.3V, ±10% supply voltage variations, and a temperature range from 0 ◦C to 75◦C. Rigorous experimental verification on 20 chip samples, along with detailed explanations of design methodologies, underscores the robustness and practicality of the proposed Body-PUF design. Comparative analyses against state-of-the-art literature reveal that the Body-PUF outperforms previous PUF designs in Figures of Merit (FOM), making it promising for realworld authentication scenarios. Its outstanding trade-off between performance and practicality positions it as a compelling solution for secure applications, including Internet of Things (IoT) devices and other security-critical systems.	2026
3	VLSI_2025_17_LP	Cryogenic Front-End ASICs for Low-Noise Readout of Charge Signals This paper presents design details and measurement results of LArASIC, a front-end application specific integrated circuit (ASIC) designed for low-noise readout of charge signals generated in neutrino study experiments within liquid argon time projection chambers. LArASIC comprises of 16-channels of programmable charge amplification and pulse shaping stages that provide a voltage readout proportional to the input charge and was optimized for operation at liquid argon temperature, i.e., 89 K. The chip was fabricated in a 180 nm CMOS process. Measurements at liquid nitrogen temperature, i.e., 77 K, indicate that the channel outputs have high linearity (INL < 0.1%) within the operating range, an equivalent noise charge of 534 electrons for a peaking time of 1µs and a detector capacitance of 150 pF, and a worst-case inter-channel cross-talk of 0.35%. The paper also presents design choices made in the process of migrating LArASIC to CHARMS, an ASIC to be fabricated in a 65 nm process that includes all features provided by LArASIC, along with additional digital programmability for improved robustness and flexibility. CHARMS is intended for use in future high-energy physics experiments that require high-resolution charge or light readout with shorter pulse peaking times.	2026
4	VLSI_2025_35_LP	Low-Power and High-Speed SRAM Cells With Double-Node Upset Self-Recovery for Reliable Applications Transistor sizing and spacing are constantly decreasing due to the continuous advancement of CMOS technology. The charge of the sensitive nodes in the static random access memory (SRAM) cell gradually decreases, making the SRAM cell more and more sensitive to soft errors, such as single-node upsets (SNUs) and double-node upsets (DNUs). Therefore, two types of radiation-hardened SRAM cells are proposed in this article. First, a low-power DNU self-recovery S6P8N cell is proposed. This cell can realize SNU self-recovery from all sensitive nodes as well as realize partial DNU self-recovery and has low-power consumption overhead. Second, we propose a high-speed DNU self-recovery S8P6N cell, which has a soft-error tolerance equivalent to the S6P8N. Furthermore, it reduces the read access time (RAT) and write access time (WAT). Simulation results show that the proposed cells are self-recovery for all SNUs and most of DNUs. Compared with RHBD12, QC8M2T, QC8CE12T, RHM10T, SEA14T, RHM-12T, S8P4R, S8P8M, RH-14T, HRLP16T, GC16T, and HRH-12T, the average power consumption of S6P8N is reduced by 48.78%, and the average power consumption of S8P6N is reduced by 26.34%. At the same time, WAT and RAT are reduced by 9.07% and 36.84%, respectively.	2026
5	VLSI_2025_37_LP	A Single-Ended High-Voltage-Compliant 11-bit Current-Steering Digital-to-Analog Converter for Adaptive Noise Cancellation in Power Over Data Line Networks Automotive Ethernet is considered to be the backbone of future in-vehicle data communication. One main feature is its ability to simultaneously transmit data and energy via power over data lines (PoDL). This article proposes the design of a single-ended high-voltage (HV)-compliant 11-bit current-steering digital-to-analog converter (DAC). The converter is tailored for the utilization as digitally controlled current source in adaptive noise-cancellation filter for PoDL networks. Designed in an HV-compliant 180-nm bipolar complementary metal-oxide-semiconductor (BiCMOS) semiconductor technology, the DAC features a monolithically combined topology of two identical 10-bit low-voltage (LV) current-steering DACs supplied at 1.8 V and two complementary HV-compliant output current stages. Main design features of the segmented LV DAC are the utilization of single-ended current cells with an optimized switching logic, proposed to enhance the cells inherent performance and energy efficiency. Furthermore, a newly derived 0.4° asymmetric output walk switching scheme is investigated. At a maximum output voltage swing of 6 V, the proposed DAC provides a bidirectional output current with the amplitude of up to 50 mA. The proposed DAC exhibits the highest voltage compliance combined with large output current capability reported to date for HV-DACs. It also features excellent dynamic performance with a spurious free dynamic range (SFDR) of 57.8 dB as well as measured four sample rates enabling up to 1.6 MS/s and a maximum integral nonlinearity (INL) error of 1.61 LSB and a differential nonlinearity (DNL) error of 1.05 LSB.	2026
6	VLSI_2025_45_LP	Leveraging Highly Approximated Multipliers in DNN Inference In this work, we present our control variate approximation technique that enables the exploitation of highly approximate multipliers in Deep Neural Network (DNN) accelerators. Our approach does not require retraining and significantly decreases the induced error due to approximate multiplications, improving power efficiency and accuracy. As a result, control variate approximation enables satisfying tight error resilience requirements while preserving the power savings. Our experimental evaluation, across six different DNNs and several approximate multiplier designs, demonstrates the versatility of control variate technique and shows that compared to the accurate designs, it achieves the same performance, 45% power reduction, and less than 1% average accuracy loss. Compared to the corresponding approximate designs without using our technique, the error-correction of the control variate method improves the accuracy by 1.9x on average.	2026
7	VLSI_2025_47_LP	Design of a Hardware-Efficient Approximate 4-2 Compressor for Multiplications in Image Processing This letter presents a novel hardware-efficient approximate 4-2 compressor design that significantly enhances accuracy through a systematic analysis of input patterns obtained from practical applications. We incorporate a majority operation and a compound gate in the compressor design to effectively boost hardware efficiency in multiplication. Our design approach results in substantial error reductions, with normalized mean error distance (NMED) and mean relative error distance (MRED) decreasing by up to 74.84% and 82.04%, respectively, compared to existing approximate multipliers discussed in this letter. When implemented in a 32-nm CMOS technology, the approximate multiplier adopting the proposed 4-2 compressor achieves excellent hardware efficiency, reducing area, power, and energy consumption by up to 8.95%, 13.02%, and 13.02%, respectively, compared to other alternatives. Moreover, our design delivers enhanced performance in image processing, achieving up to a 4.84× increase in peak signal-to-noise ratio (PSNR) compared to other designs, all while optimizing hardware efficiency.	2026
8	VLSI_2025_54_LP	A Laddered-Inverter Nonoverlapping Clock Generator This article presents a new and novel nonoverlapping clock (NOC) generator based on a laddered inverter (LI) circuit. Unlike conventional approaches, the proposed NOC combines the clock generation and pulsewidth-modulation (PWM) circuit into one integrated architecture, offering lower power consumption, smaller area, and a more robust solution. Furthermore, the proposed NOC offers an inherent guarantee of the nonoverlap (dead time) between the output signals thanks to the guaranteed monotonicity of the LI circuit, thus offering a layout-agnostic design. The proposed NOC can also offer dead-time reconfigurability with the help of multiplexers, allowing both calibration and fine-tuning of the dead times to meet specific requirements. We provide a comprehensive assessment of the proposed NOC through simulation and measurement results in 65-nm CMOS. Measured results show that the proposed NOC consumes 1 µW at 5 MHz with a 1-V supply, achieving more than 10× lower power consumption and 25% smaller area compared to the conventional clock circuit. The proposed NOC demonstrates the capability to generate waveforms with frequencies up to 3.5 GHz at a 1.2-V supply, as validated through simulations.	2026
9	VLSI_2025_55_LP	An Area-Efficient VLSI Architecture for High-Throughput Computation of the 2-D DWT AIn this article, an area-efficient VLSI architecture scheme for high-throughput computation of the 2-D discrete wavelet transform (DWT) is proposed, effectively applied in the context of aircraft cargo hold scenes. The proposed architecture aims to reduce computation and storage resources while maintaining the DWT-IDWT reconstructed image quality for the 9/7 discrete wavelet. The hardware implementation formulae based on the flipping architecture have been modified to reduce RAM storage bit width. By transforming the coefficients of the formula into hardware-friendly values, the required multiplication operations are split into two stages of addition. On this basis, a pipelined architecture is constructed to set the critical path delay (CPD) of the architecture to be close to the delay of a single adder, Tₐ, thereby achieving a high throughput. Compared to existing architectures in the research field, the proposed single-level 2-D DWT architecture achieves resource savings on the field-programmable gate array (FPGA) platform while ensuring good image reconstruction quality. The advantages of the multilevel 2-D DWT are evaluated, and the results show similar results on the application-specific integrated circuit (ASIC) platform. The proposed architecture reduces average computation time by at least 35.54% while achieving delay of one-level decomposition, decreases the area-delay product (ADP) by at least 25.41%, and saves a significant amount of energy per image (EPI). Furthermore, the proposed folded architecture achieves close to 100% hardware utilization efficiency (HUE) in multilevel 2-D DWT computations.	2026
10	VLSI_2025_57_LP	A Two-Stage CMOS Amplifier With High Degree of Stability for All Capacitive Loads This article presents the conception, design, and realization of a fully differential two-stage CMOS amplifier that is unconditionally stable for any value of the capacitive load. This is simply achieved by sending a scaled replica of the output stage current to the amplifier virtual ground in order to create a left half-plane (LHP) zero in the loop gain that either cancels or tracks the output pole in all process, voltage, and temperature (PVT) conditions. Consequently, from a stability point of view, the amplifier behavior resembles that of a single-pole OTA. Starting from an existing two-stage gain-programmable amplifier, designed in a 0.18-µm bipolar-CMOS-DMOS (BCD) process that was able to drive only 10 pF without encountering into stability issues, a simple circuit has been added to extend the stability to any capacitive load value. An interesting and unusual method, based on the frequency behavior of the unloaded closed-loop amplifier output impedance, has been introduced to further verify the unconditional stability of this solution. Measurements show a high degree of stability in any load conditions. In the used 0.18-µm BCD technology, silicon area and current consumption of the extra circuit are only 0.0004 mm² and 2 µA, respectively, with a 5-V power supply.	2026
11	VLSI_2025_58_LP	Revisiting Multiple ECC on High-Density NAND Flash Memory Three-dimensional NAND flash memory using the advanced multibit-per-cell technique is widely adopted due to its high density. However, it faces the problem of deteriorating read performance and energy consumption due to decreased reliability. Low-density parity-check code (LDPC) is typically adopted as an error correction code (ECC) to encode data and provide fault tolerance. To reduce the cost, LDPC with a high code rate is always adopted. However, LDPC will lead to read retry operations when the accessed data are not successfully decoded, and such retry-induced performance degradation is serious, especially for modern high-density flash memory. In this work, a reliability-aware differential ECC (READECC) approach is proposed to reduce redundancy protection and storage cost of LDPC with a low cost. The basic idea is to adopt LDPC with a suitable code rate considering both data access characteristics and flash reliability characteristics. First, hot reads are strengthened based on the frequency of being accessed. Second, based on the reliability variation characteristics, the life of NAND flash is divided into three reliability periods. As the reliability period shifts, the code rate of the LDPC adjusts adaptively to minimize redundancy overhead. Furthermore, the READECC scheme provides further flexibility to support LDPC with strong error correction capability only when paired with a low redundancy rate. With careful design and evaluation on 3-D triple-level cell NAND flash memory, READECC achieves encouraging optimizations with a negligible cost.	2026
12	VLSI_2025_60_LP	A Nanopower EEG Low-Pass Filter Using Current-Sharing Vertical Differential Pairs A follower-based gₘ – C low-pass filter that employs CMOS vertical source-couple-pair (VSCP) transconductors is proposed for practical use in EEG acquisition systems. The VSCP transconductor operates as a gₘ cell with current sharing and linearity enhancement features. It is featured in the first- and second-order gₘ – C sections cascaded to form a third-order low-pass filter targeting a 150-Hz bandwidth. To mitigate the effects of biasing current source mismatch, dynamic element matching (DEM) is optionally applied to the relevant biasing current source pairs, resulting in second harmonic distortion (HD2) and noise suppression. Implemented in a 0.18-µm process, the proposed filter consumes 16.3-nW power from a 1.2-V supply. Thanks to the DEM and VSCPS, the linear bandwidth is 150-Hz with linear input range (measured at 1% total harmonic distortion (THD)), whereas the dynamic range for less than 4 µV₍rms₎ noise is obtained to a filter dynamic range (DR) of 65.15 dB. Overall performance comparisons with other recent biomedical filters indicate that the figure of merit (FoM) of this proposed filter is comparable while the linear input range is larger.	2026
13	VLSI_2025_76_LP	High Efficiency Multiply-Accumulator Using Ternary Logic and Ternary Approximate Algorithm A multiply-accumulator, often abbreviated as a MAC unit, is central to a multitude of computational tasks, particularly those tasks (such as neural networks) involving array-based mathematical computations. The quest for novel methods to efficiently store and process data in a MAC has become imperative. Recently, ternary logic has attracted significant attention due to its higher information density than conventional binary systems. However, though numerous studies have showcased ternary arithmetic techniques, advancements in ternary-based vector processing have been notably scarce. To bridge this gap, this work undertakes comprehensive study into the optimization of ternary MAC units. Firstly, we propose various ternary approximate algorithms, which allow 30% less power consumption and 21% compact area in comparison with the accurate design. Secondly, we design sophisticated ternary circuits and obtain 74%–86% lower power-delay-product (PDP) than previous works. Furthermore, we evaluate proposed ternary MAC unit using both carbon-nanotube field-effect transistor (CNFET) and silicon-based 180 nm CMOS processes. The simulation results show the ternary circuit is better than binary circuit in terms of both area (~45% less) and power (~30% less), highlighting its strong potential for practical applications.	2026
14	VLSI_2025_77_LP	Low-Power Subthreshold Voltage References for High-Temperature Applications Advanced Internet-of-Things (IoT) devices are increasingly used in high-temperature environments, such as automotive, aerospace, defense, and industrial applications. High-temperature voltage references are crucial for these systems. This paper presents two topologies designed for high-temperature operation. The first topology minimizes leakage currents by using a replica branch to decouple leakage from the core, reducing its impact on the reference voltage. The second topology optimizes the operating temperature by employing one-stage amplifiers as buffers to handle junction leakage while maintaining body voltage. Both designs are fabricated in 180 nm CMOS process. The first design supports operation up to 140 °C with an average temperature coefficient (TC) of 70 ppm/°C, while the second design operates up to 180 °C with a TC of 64 ppm/°C. The designs also exhibit line sensitivities of 0.46 %/V and 0.31 %/V, PSRRs of –37.8 dB and –38.8 dB at 100 Hz, and power consumption of 111 pW and 136.8 pW at room temperature, respectively.	2026
15	VLSI_2025_84_LP	Low-Power High Precision Floating-Point Divider With Bidimensional Linear Approximation In this paper we propose a novel approximate floating-point divider based on bidimensional linear approximation. In our approach, the mantissa quotient is seen as a function of the two input mantissas of the divider. The domain of this two-variable function is partitioned into nx × ny subregions, named tiles, where nx, ny are chosen as powers of two. In each tile the quotient is approximated with a linear combination of the input mantissas. To achieve fine accuracy, an optimization problem is formulated within each tile to determine the optimal coefficients for the linear combination, which minimize the Mean Relative Error Distance (MRED) of the divider. Furthermore, to make hardware implementation more effective, the minimization problem is appropriately modified to search for optimal quantized coefficients. The hardware structure of the divider only requires a small look-up table to store the linear approximation coefficients, and a carry save adder tree. The proposed architecture is highly tunable at design-time over a wide range of accuracy, depending on the number of tiles chosen for the approximation. The obtained results demonstrate error performance and hardware features superior to the state-of-the-art. The proposed dividers define the Pareto front, considering the trade-off between power-delay-product vs. MRED and area-delay-product vs. MRED, for MRED in the range of 4 × 10−3 − 2 × 10−2 . Application results for JPEG compression and tone mapping further highlight the strength of our proposal, which exhibits Structural Similarity Index (SSIM) very close to 1 in all cases and Peak Signal-to-Noise Ratio (PSNR) up to 45 dB.	2026
16	VLSI_2025_85_LP	Fanout-Based Reliability Model for SER Estimation in Combinational Circuits Soft errors in Integrated Circuits (ICs) have always been a major concern, particularly as CMOS technology nodes continue to shrink, resulting in higher frequency, lower power, and smaller areas, exacerbating radiation-induced soft errors. Therefore, Single Event Transient (SET) has become a crucial consideration in designing modern radiation-tolerant circuits, as it has the potential to cause failures in circuit outputs. This paper employs the concept of signal probability for transient fault propagation in circuits. Considering the issue of transient fault-masking, an error propagation model is presented for each fault-masking case. Furthermore, approaches are proposed for both probabilistic and time-based scenarios to address the impact of re-convergent paths on transient faults. Since considering re-convergent paths increases computational complexity, three computational approximations are proposed in this paper as a compromise to reduce the size of simulation runs as much as possible. We compared the simulation results with Monte-Carlo methods and HSPICE-based simulations to validate the proposed method. According to the comparison results on ISCAS-85 benchmarks, the proposed approach for estimating the SER exhibits an average accuracy of 95% while consuming less than 5% compared to traditional fault injection.	2026

AREA EFFICIENT/ TIMING & DELAY REDUCTION

S.No	Code	Title	Year
1	VLSI_2025_05_AE	Manipulated Lookup Table Method for Efficient High-Performance Modular Multiplier Introduces a LUT-based optimization technique for modular multiplication. Improves throughput and reduces hardware complexity for cryptographic systems.	2025
2	VLSI_2025_06_AE	Low-Power High Precision Floating-Point Divider With Bidimensional Linear Approximation Proposes a floating-point divider using bidimensional linear approximation. Reduces power consumption while maintaining high numerical accuracy.	2025
3	VLSI_2025_08_AE	Leveraging Highly Approximated Multipliers in DNN Inference Uses approximate multipliers to accelerate deep neural network inference. Trades minor accuracy loss for major energy efficiency gains.	2025
4	VLSI_2025_09_AE	Hardware Implementation of Improved Banker’s Fixed-Point Rounding Algorithm Implements an optimized Banker’s rounding technique in hardware. Enhances numerical stability in fixed-point arithmetic systems.	2025
5	VLSI_2025_10_AE	Fail-Safe Logic Design Strategies Within Modern FPGA Architectures Introduces fault-tolerant logic design techniques for FPGA systems. Improves reliability under transient and permanent faults.	2025
6	VLSI_2025_12_AE	Enhancing Delay-Driven LUT Mapping With Boolean Decomposition Optimizes LUT mapping using Boolean decomposition techniques. Reduces delay in FPGA logic synthesis.	2025
7	VLSI_2025_14_AE	Embedded Hardware-Efficient FPGA Architecture for SVM Learning and Inference Designs an FPGA-based SVM accelerator for machine learning tasks. Optimized for low power and fast inference.	2025
8	VLSI_2025_16_AE	Design and Analysis of Energy Efficient Approximate Multipliers for Image Processing and Deep Neural Network Presents approximate multiplier architectures for DNN and image processing. Reduces energy consumption with acceptable error bounds.	2025
9	VLSI_2025_18_AE	C4TERO: Configurable Cascaded Carry Chains for High Reliability TERO PUFs on FPGAs Introduces a secure hardware PUF design using configurable carry chains. Enhances FPGA-based security primitives.	2025
10	VLSI_2025_19_AE	AXI Hardware Accelerator for McEliece on FPGA Embedded Systems Implements McEliece cryptosystem accelerator using AXI interface. Optimized for post-quantum cryptography hardware.	2025
11	VLSI_2025_23_AE	A High-Resolution Calibration Method for Time-to-Digital Converter of Lidar Proposes calibration techniques for TDCs used in LiDAR systems. Improves distance measurement accuracy.	2025
12	VLSI_2025_24_AE	Energy-Efficient Syndrome Calculation Architecture for BCH Decoders Optimizes BCH decoding syndrome computation for lower energy consumption. Suitable for communication systems.	2025
13	VLSI_2025_25_AE	Low-Complexity Implementation of Real-Time Reconfigurable Low-Pass Equalizers Designs reconfigurable equalizer architecture for real-time filtering. Reduces hardware cost and improves flexibility.	2025
14	VLSI_2025_26_AE	MSDF-Based MAC for Energy-Efficient Neural Networks Introduces MSDF-based MAC unit for neural networks. Improves energy efficiency in AI hardware accelerators.	2025
15	VLSI_2025_27_AE	A Study of Signed-Digit Hybrid Stochastic Number for Arithmetic Computing Explores hybrid stochastic number systems for arithmetic computation. Enhances fault tolerance and energy efficiency.	2025
16	VLSI_2025_28_AE	Manipulated Lookup Table Method for Efficient High-Performance Modular Multiplier Improves LUT-based modular multipliers for cryptographic acceleration. Reduces latency and hardware overhead.	2025
17	VLSI_2025_29_AE	SPEED: A Scalable RISC-V Vector Processor Enabling Efficient Multiprecision DNN Inference Designs scalable RISC-V vector processor for DNN inference. Optimized for multi-precision computation workloads.	2025
18	VLSI_2025_30_AE	FPGA-Based Low-Bit and Lightweight Fast Light Field Depth Estimation Implements depth estimation for light field imaging on FPGA. Optimized for low-bit and real-time processing.	2025
19	VLSI_2025_31_AE	A Comparative Analysis of Low Temperature and Room Temperature Circuit Operation Analyzes circuit performance under different temperature conditions. Provides insights for cryogenic and standard operation designs.	2025
20	VLSI_2025_39_AE	A 4.86-pJ/b Energy-Efficient Fully Parallel Stochastic LDPC Decoder With Two-Stage Shared Memory Designs ultra-low-energy LDPC decoder using stochastic methods. Optimized for high-throughput communication systems.	2025
21	VLSI_2025_40_AE	A Pay-Per-ISE RISC-V Processor With Hardware-Assisted Orthogonal Obfuscation Implements secure RISC-V processor with hardware obfuscation techniques. Enhances IP protection in FPGA systems.	2025
22	VLSI_2025_48_AE	An Efficient Test Architecture Using Hybrid Built-In Self-Test for Processing-in-Memory Proposes hybrid BIST architecture for in-memory computing systems. Improves fault detection efficiency.	2025
23	VLSI_2025_53_AE	Strassen Multisystolic Array Hardware Architectures Implements Strassen algorithm using systolic arrays. Enhances matrix multiplication performance.	2025
24	VLSI_2025_61_AE	High-Speed Compute-Efficient Bandit Learning for Many Arms Designs hardware-efficient accelerator for multi-armed bandit learning. Optimized for decision-making AI systems.	2025
25	VLSI_2025_62_AE	Design of Low-Cost and High-Accurate 8-bit Logarithmic Floating-Point Arithmetic Circuits Proposes logarithmic floating-point arithmetic circuits. Balances accuracy and hardware efficiency.	2025
26	VLSI_2025_63_AE	An Area-Energy-Efficient 64–2048 Point FFT With Approximate Plane-Fitting Complex Multipliers Implements FFT architecture using approximate multipliers. Reduces area and energy consumption significantly.	2025
27	VLSI_2025_69_AE	A Reconfigurable Floating-Point Division and Square Root Architecture for High-Precision Softmax Designs reconfigurable divider and square root unit for Softmax computation. Optimized for AI workloads.	2025
28	VLSI_2025_81_AE	In-Memory Implementation of an Approximate Adder With Reduced Latency and Error Implements approximate adder in memory architecture. Reduces latency and improves energy efficiency.	2025
29	VLSI_2025_82_AE	RIVL: A Low-Cost SoC Agile Development Platform for Multiple RISC-V Processors Design and Verification Provides SoC platform for rapid RISC-V design and verification. Supports multi-core development and testing.	2025

HIGH SPEED DATA TRANSMISSION

S.No	Code	Title	Year
1	VLSI_2025_02_HS	Scalable 5G NR Rate-Matcher and Rate-Dematcher for Efficient Use in FPGA Accelerators Implements scalable rate matching and dematching architecture for 5G NR systems. Optimized for FPGA-based baseband acceleration with reduced latency.	2025
2	VLSI_2025_04_HS	Method to Determine Quantization-Related Parameters of the Digital-to-Time Converter in a Fractional-N Frequency Synthesizer Presents a calibration method for DTC quantization parameters in fractional-N PLL systems. Improves frequency stability and phase accuracy.	2025
3	VLSI_2025_07_HS	Longest-First Search Using Bloom Filter: Algorithm and FPGA Implementation Proposes Bloom filter-based longest-first search algorithm with FPGA implementation. Reduces memory overhead and improves search speed.	2025
4	VLSI_2025_13_HS	Enhanced Fetal Arrhythmia Classification by Non-Invasive ECG Using Cross Domain Feature and Spatial Differences Windows Information Improves fetal ECG classification using cross-domain feature extraction. Enhances detection accuracy for arrhythmia analysis.	2025
5	VLSI_2025_15_HS	Design and HDL Implementation of Pulse-Arrival-Time Estimation Using XGBoost Regression With Tree-Recycling Architecture Implements XGBoost-based pulse arrival time estimation in hardware. Optimized with tree-recycling for area and power reduction.	2025
6	VLSI_2025_20_HS	Analysis and Mitigation of Excess Phase Noise and Spurs in Digital-to-Time-Converter-Enhanced Fractional-N Frequency Synthesizers Analyzes phase noise in DTC-enhanced synthesizers and proposes mitigation techniques for cleaner frequency generation.	2025
7	VLSI_2025_21_HS	Advanced Quantization Schemes to Increase Accuracy, Reduce Area, and Lower Power Consumption in FFT Architectures Introduces optimized quantization methods for FFT architectures. Balances precision, area, and power efficiency.	2025
8	VLSI_2025_22_HS	A Hybrid Speech Enhancement Technique Based on Discrete Wavelet Transform and Spectral Subtraction Combines DWT and spectral subtraction for improved speech enhancement. Reduces noise while preserving speech clarity.	2025
9	VLSI_2025_33_HS	ReAdapt-II: Energy-Quality Optimizations for VLSI Adaptive Filters Through Automatic Reconfiguration and Built-In Iterative Dividers Proposes adaptive filter optimization with dynamic reconfiguration. Enhances energy efficiency and signal quality trade-offs.	2025
10	VLSI_2025_37_HS	FANNS: An FPGA-Based Approximate Nearest-Neighbor Search Accelerator Accelerates nearest-neighbor search using FPGA-based approximate computing. Improves search speed for large datasets.	2025
11	VLSI_2025_41_HS	Adjacent Channel Weight Dependable RLS Adaptive Filter With VMD Based Artifact Removal Mechanism in Fetal ECG Separation Enhances fetal ECG separation using adaptive RLS filtering and VMD-based artifact removal. Improves biomedical signal clarity.	2025
12	VLSI_2025_42_HS	An Energy-Efficient Configurable 1-D CNN-Based Multi-Lead ECG Classification Coprocessor for Wearable Cardiac Monitoring Devices Implements CNN-based ECG classifier for wearable devices. Focuses on low power and real-time cardiac monitoring.	2025
13	VLSI_2025_43_HS	A Hybrid Speech Enhancement Technique Based on Discrete Wavelet Transform and Spectral Subtraction Second optimized implementation of hybrid speech enhancement. Improves noise suppression efficiency.	2025
14	VLSI_2025_44_HS	An Area-Efficient VLSI Architecture for High-Throughput Computation of the 2-D DWT Designs efficient hardware architecture for 2-D DWT computation. Optimized for image processing applications.	2025
15	VLSI_2025_46_HS	AdAM: Adaptive Approximate Multiplier for Fault Tolerance in DNN Accelerators Introduces adaptive approximate multipliers for neural networks. Improves fault tolerance in DNN accelerators.	2025
16	VLSI_2025_48_HS	Fault Bounding On-Die BCH Codes for Improving Reliability of System ECC Enhances ECC reliability using on-die BCH coding. Improves fault tolerance in memory systems.	2025
17	VLSI_2025_50_HS	A 7.4–9.2-GHz Fractional-N Differential Sampling PLL Based on Phase-Domain and Voltage-Domain Hybrid Calibration Presents PLL architecture with hybrid calibration techniques. Improves frequency stability and phase noise reduction.	2025
18	VLSI_2025_51_HS	Area-Efficient Pipeline Architecture for Serial Real-Valued Fast Fourier Transform Implements pipelined FFT architecture optimized for area efficiency and real-time processing.	2025
19	VLSI_2025_52_HS	ATEP: An Asynchronous Timing Error Prediction Circuit With Adaptive Voltage and Frequency Scaling Predicts timing errors in asynchronous circuits. Enables adaptive voltage-frequency scaling for energy saving.	2025
20	VLSI_2025_55_HS	An Area-Efficient VLSI Architecture for High-Throughput Computation of the 2-D DWT Second optimized 2-D DWT architecture focusing on throughput and reduced area usage.	2025
21	VLSI_2025_59_HS	A 25-GHz PLL Achieving 8-ns Phase-Shifting Time With Double-Path Modulation Scheme High-frequency PLL design with fast phase shifting. Suitable for high-speed communication systems.	2025
22	VLSI_2025_64_HS	A Sub-0.9-ps Static Phase Offset 500 MHz Delay-Locked Loop With a Large Gain Phase Detector Designs ultra-precise DLL with sub-picosecond phase offset. Enhances clock synchronization accuracy.	2025
23	VLSI_2025_66_HS	Toward High-Performance Network Coding: FPGA Acceleration With Bounded-Value Generators Accelerates network coding using FPGA-based bounded-value generation. Improves data transmission efficiency.	2025
24	VLSI_2025_67_HS	An Implementation Method for 100% ASK Modulation Applied to NFC Tags Implements ASK modulation technique for NFC communication. Optimized for low-power tag systems.	2025
25	VLSI_2025_68_HS	A Low-Latency Feed-Forward Architecture for Image Filtering via Row-by-Row Processing Proposes feed-forward filtering architecture for image processing. Reduces latency in row-wise computation.	2025
26	VLSI_2025_72_HS	Optimized Algorithms for FPGA Implementation of PDCCH Chain for 5G-NR Base Station Optimizes PDCCH chain for 5G base stations. Improves FPGA throughput and decoding efficiency.	2025
27	VLSI_2025_73_HS	Design and Implementation of a Low-Complexity Continuously Variable Digital Filter Using a Novel Farrow-Equivalent-Newton Structure-Based Fractional Delay Filter Implements variable digital filter using Farrow-based structure. Enhances flexibility and reduces computation cost.	2025
28	VLSI_2025_83_HS	VLSI Architecture Design for Compact Shortcut Denoising Autoencoder Neural Network of ECG Signal Designs compact autoencoder architecture for ECG signal denoising. Optimized for wearable health devices.	2025

VLSI Design of Image, Video and Audio Processing

S.No	Code	Title	Year
1	VLSI_2025_03_IM	NN2FPGA: Optimizing CNN Inference on FPGAs With Binary Integer Programming Proposes a binary integer programming approach to optimize CNN mapping on FPGA hardware. Improves resource allocation, reduces latency, and enhances inference efficiency.	2025
2	VLSI_2025_32_IM	Detect and Replace: Efficient Soft Error Protection of FPGA-Based CNN Accelerators Introduces a detect-and-replace mechanism for mitigating soft errors in FPGA-based CNN accelerators. Enhances reliability with minimal hardware overhead.	2025
3	VLSI_2025_34_IM	Research on Hardware Acceleration of Traffic Sign Recognition Based on Spiking Neural Network and FPGA Platform Develops FPGA-based acceleration for spiking neural networks used in traffic sign recognition. Achieves low-power and real-time classification performance.	2025
4	VLSI_2025_36_IM	An Interpolation-Free Fractional Motion Estimation Algorithm and Hardware Implementation for VVC Proposes an interpolation-free motion estimation method for VVC video coding. Reduces computational complexity while maintaining encoding quality.	2025
5	VLSI_2025_65_IM	Design of a Low-Power Analog Integrated Deep Convolutional Neural Network Presents an analog CNN architecture designed for ultra-low-power deep learning inference. Suitable for edge AI applications.	2025
6	VLSI_2025_70_IM	Hardware-Efficient Architecture for Multiple Quantized Gaussian Noise Generation Introduces an efficient hardware design for generating quantized Gaussian noise. Optimized for stochastic computing and ML accelerators.	2025
7	VLSI_2025_74_IM	MBS: A High-Precision Approximation Method for Softmax and Efficient Hardware Implementation Proposes an approximation method for Softmax computation in hardware. Reduces complexity while maintaining high numerical precision.	2025
8	VLSI_2025_75_IM	EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language Models Designs a hybrid CPU-FPGA accelerator for efficient LLM inference at the edge. Reduces latency and energy consumption significantly.	2025
9	VLSI_2025_78_IM	A High-Throughput FPGA Accelerator for Lightweight CNNs With Balanced Dataflow Proposes a balanced dataflow architecture for lightweight CNN acceleration on FPGA. Improves throughput and reduces bottlenecks.	2025
10	VLSI_2025_79_IM	A Novel Transform Accelerator With Fast Kernel Selection and Efficient Transform Circuit Introduces a transform accelerator with dynamic kernel selection. Enhances processing speed for signal and image transformations.	2025
11	VLSI_2025_80_IM	Rational-Exponent Filters with Applications to Generalized Exponent Filters Proposes rational-exponent filter structures for flexible signal processing. Enables generalized filtering with reduced computational cost.	2025

+91 9566475911 | +91 9962588976 | projects@lemenizinfotech.com

IEEE Transactions on VLSI 2025 Research Papers

IEEE Transactions on VLSI 2025 Research Papers

Why Us?

IEEE Project Technology

LOW POWER

AREA EFFICIENT/ TIMING & DELAY REDUCTION

HIGH SPEED DATA TRANSMISSION

VLSI Design of Image, Video and Audio Processing

2023 © Copyrights IEEE Master