What is SRAM Soft Error Detection?
A study conducted by IBM over 20 years ago showed that high-energy nuclear particles cause electronic component "soft" errors. There are two sources of nuclear particles: decaying radioactive atoms existing in trace amounts in all materials and extraterrestrial radioactive cosmic rays cascading through the Earth's atmosphere. When these particles hit a silicon nucleus, it explodes into fragments that generate a stream of electrical charges that can upset any circuit. Any circuit is vulnerable to soft fails due to cosmic rays.
Soft Errors vs. Hard Errors
JEDEC defines soft errors as, " An erroneous output signal from a latch or memory cell that can be corrected by performing one or more normal functions of the device containing the latch or memory cell. As commonly used, the term refers to an error caused by radiation or electromagnetic pulses and not to an error associated with a physical defect introduced during the manufacturing process." Soft errors occur randomly and cause no permanent damage to the memory device.
Hard errors on the other hand are those that "keep recurring as a result of hardware or physical defects on the memory or storage device. Hard memory errors are commonly caused by operating a system beyond the memory's speed capacity and subjecting the system to charges of static electricity. Other causes include environmental factors such as temperature, shock/vibration, electrical/voltage stress or physical stress. Mishandling, aging, or manufacturing defects can also affect the reliability of hardware components. Hard errors are usually permanent and require module replacement."
Causes and Impact of Soft Errors
A soft error, also known as a single-event upset (SEU), is caused by ionizing radiation from cosmic rays and alpha particles. Cosmic rays such as neutrons are high-energy particles from space, which enter the Earth's atmosphere and interact with the air, while alpha particles are from traces of contaminants or radioactive materials in memory chip packages. When these highly charged particles penetrate a memory cell, the state of a bit changes (flips). If the charge is big enough, it can cause multiple cells or bits to be upset.
In systems requiring the highest levels of reliability, alpha particles can be reduced by applying shielding materials or using components that are insensitive to radiation. Neutrons however, can't be shielded – they can penetrate even 5 ft. of concrete!
Figure 1. When radioactive particles such as neutrons from cosmic rays penetrate a silicon nucleus from the substrate, the resulting nuclear reaction fragments the silicon nucleus. The fragments (alpha particles) upset the sensitive region, causing the memory cell to "flip" or change value.
Soft errors are typically resolved by a system reboot. However, because they are random and temporary, it is possible that they go undetected, thus accumulate without the proper action being taken. Below are some effects of soft errors:
Functional interruptions
Read/write errors, data corruption
Device hanging or stops working but resumes after a power cycle
Bricking, or the device stops working even after a power cycle
Data written in the wrong location
Data writing takes longer
Figure 2. Soft errors, also called single-event upsets, occur when highly charged particles such as neutrons and radioactive materials like alpha particles from the environment strike sensitive regions of an electronic device and disrupt its normal operation.
SRAM Vulnerability
The study conducted by Intel showed that electronic failure from cosmic rays increased at elevated altitudes. These days, however, even ground-based devices are at great risk, particularly SRAM-based devices that have high sensitivities to radiation effects. Static Random Access Memory (SRAM) is used as memory cache by the processor. Unlike Dynamic Random Access Memory (DRAM), which requires being constantly refreshed (recharged with power) to keep data, SRAM can store data without frequent refreshing. This means that the processor does not have to wait to access data on the SRAM, resulting in speedier processing. SRAM is very fast, and hence also more expensive than DRAM. The following are common reasons why SRAM is susceptible to soft errors.
Lower Supply Voltage. SRAM voltage goes down with every process generation, resulting in lower cell capacitance (the ability of a cell to store electric charge). This makes the memory cell more vulnerable to getting struck by an alpha particle or cosmic ray.
Scaling. The trend to use more SRAM bits to cut latencies makes SRAM arrays the densest memory on a chip and increases exposure to charged particles.
Packaging. Integrated chips are packaged using materials with small amounts of radioactive contaminants. Trace amounts of Uranium and Thorium for example are found in mold compounds and assembly materials. If the ideal material purity is not maintained, alpha particles can cause soft errors.
The ATP e.MMC SRAM Soft Error Detector and Recovery Mechanism
For some personal devices, the impact of soft errors may be insignificant. However, for mission-critical tasks like financial transactions, traffic control, security/surveillance, even minor glitches can have disastrous effects. Unattended, soft errors can lead to function loss, system failure, and other adverse effects.
The ATP e.MMC advanced SRAM Soft Error Detector and Recovery mechanism maximizes data integrity by providing timely error detection, logging, and configurable action to address the error (configuration is predetermined by the customer with ATP and cannot be changed on the field). If, after assessing the risk, the user opts to continue running the device, an error log and a system reboot should be performed to avoid unpredictable events that could damage the system, or worse, cause personal safety risks in critical autonomous applications.
The following figures show how the e.MMC SRAM Soft Error Detector and Recovery mechanism works.
(Note: Steps may vary depending on pre-determined configuration.)
Figure 3. As soon as an error is detected, the event is logged in the flash. The system can be alerted to handle the error and the firmware may be stopped. The ATP SRAM Soft Error Detector and Recovery Mechanism attempts to correct the error by initiating a system reboot.
Figure 4. If the system reboot fails, the error is considered a "hard error" and the e.MMC should be replaced. If the system reboot succeeds, the error is confirmed to be a "soft error." This means that the reboot has resolved the error and operation can continue with the correct data.
Soft errors can corrupt data and cause systems to malfunction or fail. SRAM is particularly vulnerable; hence, it is important to be able to detect errors that may not have been detected by built-in error-correcting codes. Data integrity and high reliability is of great importance to mission-critical applications such as networking, military, health care, financial services, and more. It is therefore important to ensure that soft errors are not left to worsen as they could damage not only very important data but also physical assets.
- |
- +1 赞 0
- 收藏
- 评论 0
本文由涂抹转载自ATP Blogs,原文标题为:What is SRAM Soft Error Detection?,本站所有转载文章系出于传递更多信息之目的,且明确注明来源,不希望被转载的媒体或个人可与我们联系,我们将立即进行删除处理。
相关研发服务和供应服务
相关推荐
Benefits of ATP‘s Dynamic Over-Provisioning for SSDs
The ATP Dynamic OP solution breaks the mold by giving users the freedom and flexibility to configure SSDs according to the actual workloads of specific applications. Through ATP‘s simple software, users can freely and easily set the OP percentage from 7% to 14%, 28%, and 50%.
ATP(华腾国际)DRAM存储模块和NAND闪存产品选型指南(英文)
目录- Company Profile Segment Challenges and Solutions Thermal Solutions Endurance Solutions Security Solutions CFexpress & USB 3.0 Value Line SSDs DDR5 DRAM SOLUTIONS FLASH SOLUTIONS Flash Products Naming Rule Solutions & Technologies Flash Technology Overview table Complete Flash Spec Overview & Product Dimensions
型号- A750PI,E650SC SERIES,S600SC,B800PI,S750 SERIES,S600SI,S600SCA,E750PC SERIES,B600SC,N700PC,S700SC,E650SC,A750 SERIES,E600VC,S800PI,A750PI SERIES,I800PI,A600VC,A650SI,A650SC,N700 SERIES,S650SI,N750,N750PI,A800PI,A700PI,N700SI,N650 SERIES,E600SAA,N700SC,A750,N600SC,A600VC SERIES,E600SA,E650SI,E650SI SERIES,N750 SERIES,E700PIA,TR-03153,N600SI,S650,S650SC,E700PAA,N650SIA,E600SI,B600SC SERIES,S750SC,S600SIA,I700SC,N650SI,N600VI,E600SIA,E750PI,N650SC,N750PI SERIES,N600VC,I600SC,E750PC,S700PI,A650 SERIES,N650,N600 SERIES,N600VC SERIES,S650 SERIES,A650,AES-256,E700PI,A600SI,E750PI SERIES,N700PI,E700PA,S750,S750PI,E700PC,A600SC
ATP(华腾国际)固态硬盘选型指南
描述- Since 1991, we have consistently distinguished ourselves as one of the world’s leading original equipment manufacturers (OEM) of high-performance, high-quality and high-endurance NAND flash products and DRAM modules.
型号- A750PI,S600SC,B800PI,S750 SERIES,S600SI,B600SC,N700PC,A750 SERIES,E650SC,N601,N651SI,A600VI,S800PI,E600VC,I800PI,A600VC,A650SI,A650SC,N651SC,N750,S650SI,A600VI SERIES,N750PI,N651SIE,A800PI,N601 SERIES,A700PI,N651SIA,N650 SERIES,E600SAA,A750,A600VC SERIES,N600SC,E600SA,E650SI,N750 SERIES,E700PIA,N600SI,S650,S650SC,N651SI SERIES,E700PAA,B600SC SERIES,E600SI,N600VI SERIES,I700SC,N600VI,N650SI,E600SIA,N650SC,E750PI,N600VC,I600SC,E750PC,N651,S700PI,A650 SERIES,N650,N751PI,S700PC,N600VC SERIES,N651 SERIES,S650 SERIES,A650,N601SC,S750PC,E700PI,A600SI,N700PI,E700PA,S750,S750PI,E700PC,N651SCE,A600SC
e.MMC MLC Solutions The Global Leader in Specialized Storage and Memory Solutions
型号- E700PI,E600SA,E700PIA,E600SIA,E700PA,E600SI,E600SC,E700PAA,E600SAA,E700PC
ATP Industrial/automotive-grade e.MMC Designed to High-performance Data Transfer, Achieving with up to 128 GB Of Memory
e.MMC is a universal data storage and communication media and is typically used in portable applications. Its small footprint makes it a preferred choice for mobile devices such as smartphones, tablets and laptops. Increasingly, e.MMC is also gaining wide acceptance in embedded systems for industrial and automotive applications, such as infotainment systems, aerospace/avionics, Internet of Things (IoT), industrial automation, and other applications.
Extensive Reliability Tests Prove the Suitability of 3D e.MMC MLC and SLC Mode for Demanding Industrial Automotive Application
Based on the results of the reliability tests, the ATP e.MMC with 3D SLC Mode and MLC NAND flash deliver reliable performance for industrial and automotive applications requiring high density and long endurance.
ATP’s Obsolescence Management Strategy Drives Long-Term Supply Stability and Flexibility
Despite its tiny size, which is smaller than a typical postage stamp, the ATP embedded multimedia card or e.MMC packs many premium features for industrial and automotive applications. The ATP e.MMC provides ultra-high endurance and trustworthy performance even under the most rigid operating scenarios.
Showdown Between Consumer & Industrial SSDs
ATP‘s wide range of industrial SSDs is designed for long, reliable performance under heavy workloads and even in the harshest and most demanding environments. As a true manufacturer, ATP manages the stages of the manufacturing process and conducts thorough testing from wafer/IC level to module and product level to ensure that only products of the highest quality are produced from its facilities.
ATP Electronics to Showcase Latest TSE, PCIe® Gen 4, High-Endurance SSD Solutions for Focused Segments at Embedded World 2023
Also on display at Embedded World 2023 are ATP’s wide range of legacy and latest-generation NAND flash storage solutions in different form factors, endurance ratings, thermal, operating temperature, firmware, and customization options to meet the requirements of industrial and automation systems.
ATP Industrial e.MMC Employs Configurable SRAM Soft Error Detector and Recovery Mechanism, Robust Durability
The ATP industrial e.MMC (Embedded MultiMediaCard) solution is designed to meet the high-performance data transfer and storage requirements of industrial and embedded applications in space-constrained systems requiring rugged reliability and very high endurance.
ATP Electronics Showcases Specialized Storage & Memory Solutions at Electronica 2022
ATP Electronics will be on-site at Electronica 2022. On display at ATP’s booth are customizable thermal, high-endurance, and security solutions as well as the latest showcase of ATP’s industrial memory and storage solutions, such as PCIe Gen. 4 NVMe SSDs, CFexpress cards, and DDR5 memory modules.
ATP NVMe BGA pSLC SSD安全保护及加密功能特点
硬件写保护* 写保护功能将ATP NVMe BGA SSD置于“只读”模式,以防止数据写入设备,并保护重要数据不被意外删除、移动或修改。通过在控制器印电路板(PCB)上的通用输入/输出(GPIO)信号引脚的特定引脚上放置跳线,在存储设备上启用写保护。硬件快速擦除*对于特定应用,主机可以使用GPIO连接器触发“擦除数据”行动。
【经验】DRAM的可靠性受什么因素影响?ATP DRAM为何具备高可靠性?
ATP的DRAM模块经过两个级别的测试,以确保最大的可靠性:1、先进的IC级集成电路测试;2、增强的模块水平测试:老化测试(TDBI)和自动测试设备(ATE)确保模块达到甚至超过合格参数。同时具有工业额定温度,采用密封涂层,使用使用抗硫电阻器(基于项目)和厚度为30µm的金手指镀层。
OMNIVISION Announces New High-Resolution Image Sensors for Industrial and Consumer Security Surveillance Cameras
OMNIVISION announced the addition of two new image sensors to its comprehensive family of solutions for industrial and consumer security surveillance cameras.
电子商城
服务
可定制ATP TE Cooler的冷却功率:40~200W;运行电压:12/24/48V(DC);控温精度:≤±0.1℃; 尺寸:冷面:20*20~500*300;热面:60*60~540*400 (长*宽;单位mm)。
最小起订量: 1 提交需求>
登录 | 立即注册
提交评论