{"id":14357,"date":"2026-01-08T15:14:30","date_gmt":"2026-01-08T07:14:30","guid":{"rendered":"https:\/\/www.oscoo.com\/?p=14357"},"modified":"2026-01-08T15:14:42","modified_gmt":"2026-01-08T07:14:42","slug":"ai-ssd-a-storage-revolution-born-for-ai-workloads","status":"publish","type":"post","link":"https:\/\/www.oscoo.com\/ar\/news\/ai-ssd-a-storage-revolution-born-for-ai-workloads\/","title":{"rendered":"\u0642\u0631\u0635 SSD \u0644\u0644\u0630\u0643\u0627\u0621 \u0627\u0644\u0627\u0635\u0637\u0646\u0627\u0639\u064a: \u062b\u0648\u0631\u0629 \u062a\u062e\u0632\u064a\u0646 \u0648\u0644\u062f\u062a \u0644\u0623\u0639\u0628\u0627\u0621 \u0639\u0645\u0644 \u0627\u0644\u0630\u0643\u0627\u0621 \u0627\u0644\u0627\u0635\u0637\u0646\u0627\u0639\u064a"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"14357\" class=\"elementor elementor-14357\" data-elementor-post-type=\"post\">\n\t\t\t\t<div class=\"elementor-element elementor-element-c578e57 blog-post-container e-flex e-con-boxed e-con e-parent\" data-id=\"c578e57\" data-element_type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-0fe98f1 intro elementor-widget elementor-widget-text-editor\" data-id=\"0fe98f1\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;WDrS-1767690724049&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;pAzD-1767690724048&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;For\u00a0a\u00a0long\u00a0time,\u00a0the\u00a0role\u00a0of\u00a0SSDs\u00a0was\u00a0relatively\u00a0clear.\u00a0Its\u00a0core\u00a0task\u00a0was\u00a0to\u00a0replace\u00a0mechanical\u00a0hard\u00a0drives,\u00a0improving\u00a0the\u00a0overall\u00a0responsiveness\u00a0and\u00a0data\u00a0throughput\u00a0of\u00a0the\u00a0system.\u00a0Whether\u00a0in\u00a0consumer\u00a0computers,\u00a0enterprise\u00a0servers,\u00a0or\u00a0data\u00a0centers,\u00a0the\u00a0technological\u00a0evolution\u00a0of\u00a0SSDs\u00a0basically\u00a0revolved\u00a0around\u00a0several\u00a0fixed\u00a0goals:\u00a0higher\u00a0sequential\u00a0read\/write\u00a0speeds,\u00a0larger\u00a0capacity,\u00a0lower\u00a0cost,\u00a0and\u00a0better\u00a0reliability.\u00a0However,\u00a0with\u00a0the\u00a0rapid\u00a0development\u00a0of\u00a0artificial\u00a0intelligence,\u00a0especially\u00a0large\u00a0models\u00a0and\u00a0deep\u00a0learning,\u00a0the\u00a0workloads\u00a0faced\u00a0by\u00a0storage\u00a0systems\u00a0have\u00a0changed\u00a0significantly.\u00a0In\u00a0AI\u00a0scenarios,\u00a0data\u00a0is\u00a0no\u00a0longer\u00a0simply\u00a0\\&quot;read\u00a0in,\u00a0processed,\u00a0and\u00a0written\u00a0out.\\&quot;\u00a0Instead,\u00a0it\u00a0exhibits\u00a0several\u00a0new\u00a0characteristics:&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>For a long time, the role of <a href=\"\/news\/what-is-an-ssd-the-complete-guide\/\"><span style=\"color: #00ccff;\">SSDs<\/span><\/a> was relatively clear. Its core task was to replace <a href=\"\/news\/ssd-vs-hdd-understanding-the-differences\/\"><span style=\"color: #00ccff;\">mechanical hard drives<\/span><\/a>, improving the overall responsiveness and data throughput of the system. Whether in consumer computers, enterprise servers, or data centers, the technological evolution of SSDs basically revolved around several fixed goals: higher sequential read\/write speeds, larger capacity, lower cost, and better reliability. However, with the rapid development of artificial intelligence, especially large models and deep learning, the workloads faced by storage systems have changed significantly. In AI scenarios, data is no longer simply &#8220;read in, processed, and written out.&#8221; Instead, it exhibits several new characteristics:<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-efe5feb elementor-widget elementor-widget-image\" data-id=\"efe5feb\" data-element_type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"1400\" height=\"578\" src=\"https:\/\/www.oscoo.com\/wp-content\/uploads\/2026\/01\/AI-SSD-article-header-img-1400.webp\" class=\"attachment-full size-full wp-image-14390\" alt=\"\" srcset=\"https:\/\/www.oscoo.com\/wp-content\/uploads\/2026\/01\/AI-SSD-article-header-img-1400.webp 1400w, https:\/\/www.oscoo.com\/wp-content\/uploads\/2026\/01\/AI-SSD-article-header-img-1400-300x124.webp 300w, https:\/\/www.oscoo.com\/wp-content\/uploads\/2026\/01\/AI-SSD-article-header-img-1400-1024x423.webp 1024w, https:\/\/www.oscoo.com\/wp-content\/uploads\/2026\/01\/AI-SSD-article-header-img-1400-768x317.webp 768w, https:\/\/www.oscoo.com\/wp-content\/uploads\/2026\/01\/AI-SSD-article-header-img-1400-18x7.webp 18w, https:\/\/www.oscoo.com\/wp-content\/uploads\/2026\/01\/AI-SSD-article-header-img-1400-500x206.webp 500w, https:\/\/www.oscoo.com\/wp-content\/uploads\/2026\/01\/AI-SSD-article-header-img-1400-800x330.webp 800w\" sizes=\"auto, (max-width: 1400px) 100vw, 1400px\" title=\"\">\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-26a4110 elementor-widget elementor-widget-text-editor\" data-id=\"26a4110\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;LopP-1767690721642&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;3XMn-1767690721643&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;First,\u00a0data\u00a0volume\u00a0has\u00a0exploded.\u00a0Whether\u00a0it's\u00a0the\u00a0massive\u00a0raw\u00a0data\u00a0used\u00a0in\u00a0the\u00a0model\u00a0training\u00a0phase\u00a0or\u00a0the\u00a0model\u00a0parameters\u00a0and\u00a0vector\u00a0databases\u00a0that\u00a0need\u00a0repeated\u00a0access\u00a0during\u00a0the\u00a0inference\u00a0phase,\u00a0the\u00a0data\u00a0size\u00a0far\u00a0exceeds\u00a0that\u00a0of\u00a0traditional\u00a0applications.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><strong>First, data volume has exploded.<\/strong> Whether it&#8217;s the massive raw data used in the model training phase or the model parameters and vector databases that need repeated access during the inference phase, the data size far exceeds that of traditional applications.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-df67385 elementor-widget elementor-widget-text-editor\" data-id=\"df67385\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;LopP-1767690721642&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;3XMn-1767690721643&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;First,\u00a0data\u00a0volume\u00a0has\u00a0exploded.\u00a0Whether\u00a0it's\u00a0the\u00a0massive\u00a0raw\u00a0data\u00a0used\u00a0in\u00a0the\u00a0model\u00a0training\u00a0phase\u00a0or\u00a0the\u00a0model\u00a0parameters\u00a0and\u00a0vector\u00a0databases\u00a0that\u00a0need\u00a0repeated\u00a0access\u00a0during\u00a0the\u00a0inference\u00a0phase,\u00a0the\u00a0data\u00a0size\u00a0far\u00a0exceeds\u00a0that\u00a0of\u00a0traditional\u00a0applications.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;myir-1767690721644&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;E2dF-1767690721645&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Second,\u00a0the\u00a0access\u00a0pattern\u00a0has\u00a0fundamentally\u00a0changed.\u00a0AI\u00a0workloads\u00a0often\u00a0involve\u00a0a\u00a0large\u00a0number\u00a0of\u00a0small,\u00a0random,\u00a0high-concurrency\u00a0data\u00a0block\u00a0accesses,\u00a0rather\u00a0than\u00a0the\u00a0large-block\u00a0sequential\u00a0reads\u00a0and\u00a0writes\u00a0that\u00a0traditional\u00a0storage\u00a0systems\u00a0are\u00a0better\u00a0at\u00a0handling.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><strong>Second, the access pattern has fundamentally changed.<\/strong> AI workloads often involve a large number of small, random, high-concurrency data block accesses, rather than the large-block sequential reads and writes that traditional storage systems are better at handling.<\/div><\/article><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b50d377 elementor-widget elementor-widget-text-editor\" data-id=\"b50d377\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;qbkf-1767690721646&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;x9Kx-1767690721647&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Third,\u00a0storage\u00a0has\u00a0begun\u00a0to\u00a0directly\u00a0impact\u00a0computational\u00a0efficiency.\u00a0In\u00a0AI\u00a0servers,\u00a0the\u00a0computing\u00a0power\u00a0of\u00a0GPUs\u00a0or\u00a0other\u00a0accelerators\u00a0is\u00a0growing\u00a0very\u00a0rapidly.\u00a0If\u00a0the\u00a0storage\u00a0system\u00a0cannot\u00a0keep\u00a0up\u00a0with\u00a0the\u00a0data\u00a0supply\u00a0speed,\u00a0\\&quot;computing\u00a0idling\\&quot;\u00a0occurs,\u00a0actually\u00a0reducing\u00a0the\u00a0overall\u00a0system\u00a0efficiency.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><strong>Third, storage has begun to directly impact computational efficiency.<\/strong> In AI servers, the computing power of GPUs or other accelerators is growing very rapidly. If the storage system cannot keep up with the data supply speed, &#8220;computing idling&#8221; occurs, actually reducing the overall system efficiency.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ace3c98 elementor-widget elementor-widget-text-editor\" data-id=\"ace3c98\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;ls8K-1767690721648&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;gqnX-1767690721649&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;It\u00a0is\u00a0against\u00a0this\u00a0backdrop\u00a0that\u00a0the\u00a0concept\u00a0of\u00a0AI\u00a0SSD\u00a0began\u00a0to\u00a0be\u00a0frequently\u00a0mentioned\u00a0and\u00a0gradually\u00a0moved\u00a0from\u00a0concept\u00a0to\u00a0specific\u00a0products\u00a0and\u00a0technical\u00a0roadmaps.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>It is against this backdrop that the concept of AI SSD began to be frequently mentioned and gradually moved from concept to specific products and technical roadmaps.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f18f57d elementor-widget elementor-widget-heading\" data-id=\"f18f57d\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">What is AI SSD?<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-940af99 elementor-widget elementor-widget-text-editor\" data-id=\"940af99\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;p7t1-1767690757473&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;3uYc-1767690757474&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;To\u00a0many\u00a0newcomers\u00a0to\u00a0this\u00a0concept,\u00a0\\&quot;AI\u00a0SSD\\&quot;\u00a0can\u00a0easily\u00a0be\u00a0misunderstood\u00a0as\u00a0a\u00a0product\u00a0that\u00a0integrates\u00a0AI\u00a0algorithms\u00a0inside\u00a0the\u00a0solid-state\u00a0drive,\u00a0capable\u00a0of\u00a0\\&quot;learning\\&quot;\u00a0or\u00a0\\&quot;optimizing\\&quot;\u00a0on\u00a0its\u00a0own.\u00a0However,\u00a0according\u00a0to\u00a0the\u00a0mainstream\u00a0industry\u00a0definition,\u00a0this\u00a0understanding\u00a0is\u00a0not\u00a0accurate.\u00a0The\u00a0core\u00a0of\u00a0AI\u00a0SSD\u00a0is\u00a0not\u00a0about\u00a0\\&quot;whether\u00a0there\u00a0is\u00a0AI\u00a0inside\u00a0the\u00a0SSD,\\&quot;\u00a0but\u00a0about\u00a0\\&quot;whether\u00a0the\u00a0SSD\u00a0is\u00a0born\u00a0for\u00a0AI\u00a0workloads.\\&quot;\u00a0More\u00a0precisely,\u00a0an\u00a0AI\u00a0SSD\u00a0is\u00a0a\u00a0type\u00a0of\u00a0solid-state\u00a0storage\u00a0device\u00a0deeply\u00a0optimized\u00a0for\u00a0AI\u00a0training,\u00a0inference,\u00a0and\u00a0data\u00a0service\u00a0scenarios.\u00a0This\u00a0optimization\u00a0includes\u00a0both\u00a0hardware-level\u00a0architecture\u00a0design\u00a0and\u00a0changes\u00a0in\u00a0firmware,\u00a0protocols,\u00a0and\u00a0system\u00a0collaboration\u00a0methods.\u00a0In\u00a0terms\u00a0of\u00a0functional\u00a0positioning,\u00a0AI\u00a0SSD\u00a0is\u00a0still\u00a0a\u00a0storage\u00a0device;\u00a0it\u00a0will\u00a0not\u00a0replace\u00a0the\u00a0computational\u00a0role\u00a0of\u00a0GPUs\u00a0or\u00a0CPUs.\u00a0But\u00a0its\u00a0goal\u00a0is\u00a0very\u00a0clear:\u00a0to\u00a0minimize\u00a0the\u00a0performance\u00a0limitations\u00a0imposed\u00a0by\u00a0storage\u00a0on\u00a0AI\u00a0systems.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>To many newcomers to this concept, &#8220;AI SSD&#8221; can easily be misunderstood as a product that integrates AI algorithms inside the solid-state drive, capable of &#8220;learning&#8221; or &#8220;optimizing&#8221; on its own. However, according to the mainstream industry definition, this understanding is not accurate. The core of AI SSD is not about &#8220;whether there is AI inside the SSD,&#8221; but about &#8220;whether the SSD is born for AI workloads.&#8221; More precisely, an AI SSD is a type of solid-state storage device deeply optimized for AI training, inference, and data service scenarios. This optimization includes both hardware-level architecture design and changes in firmware, protocols, and system collaboration methods. In terms of functional positioning, AI SSD is still a storage device; it will not replace the computational role of GPUs or CPUs. But its goal is very clear: to minimize the performance limitations imposed by storage on AI systems.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4f4f865 elementor-widget elementor-widget-heading\" data-id=\"4f4f865\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Practical Bottlenecks of Traditional SSDs in AI Scenarios<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9feaac8 elementor-widget elementor-widget-text-editor\" data-id=\"9feaac8\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;8tPk-1767690793743&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;Jhxw-1767690793744&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;To\u00a0understand\u00a0the\u00a0value\u00a0of\u00a0AI\u00a0SSD,\u00a0one\u00a0must\u00a0first\u00a0be\u00a0clear\u00a0about\u00a0the\u00a0problems\u00a0traditional\u00a0SSDs\u00a0face\u00a0in\u00a0AI\u00a0scenarios.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>To understand the value of AI SSD, one must first be clear about the problems traditional SSDs face in AI scenarios.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-eaa7bf5 elementor-widget elementor-widget-shortcode\" data-id=\"eaa7bf5\" data-element_type=\"widget\" data-widget_type=\"shortcode.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div class=\"elementor-shortcode\"><a href=\"\/oscoo-leading-ssd-manufacturer\/\"><img decoding=\"async\" src=\"\/wp-content\/uploads\/2025\/09\/oscoo-2b-banner-1400x475-1.webp\" style=\"widht:100%;\" alt=\"\" title=\"\"><\/a><\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-796b195 elementor-widget elementor-widget-text-editor\" data-id=\"796b195\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;Fu3n-1767690802457&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;9itg-1767690802462&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;ordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;Y0I8-1767690802456&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Latency\u00a0becomes\u00a0a\u00a0more\u00a0critical\u00a0metric\u00a0than\u00a0bandwidth.&quot;,&quot;marks&quot;:[{&quot;type&quot;:&quot;bold&quot;}]},{&quot;text&quot;:&quot;\u200b\u00a0In\u00a0traditional\u00a0applications,\u00a0sequential\u00a0read\/write\u00a0bandwidth\u00a0is\u00a0often\u00a0an\u00a0important\u00a0indicator\u00a0for\u00a0evaluating\u00a0SSD\u00a0performance,\u00a0such\u00a0as\u00a07GB\/s\u00a0or\u00a014GB\/s.\u00a0But\u00a0in\u00a0AI\u00a0scenarios,\u00a0the\u00a0importance\u00a0of\u00a0latency\u00a0often\u00a0surpasses\u00a0peak\u00a0bandwidth.\u00a0This\u00a0is\u00a0because\u00a0AI\u00a0tasks\u00a0involve\u00a0a\u00a0large\u00a0number\u00a0of\u00a0fine-grained\u00a0data\u00a0requests.\u00a0If\u00a0each\u00a0access\u00a0requires\u00a0waiting\u00a0for\u00a0tens\u00a0of\u00a0microseconds,\u00a0even\u00a0if\u00a0the\u00a0amount\u00a0of\u00a0data\u00a0per\u00a0request\u00a0is\u00a0small,\u00a0the\u00a0cumulative\u00a0effect\u00a0will\u00a0significantly\u00a0slow\u00a0down\u00a0the\u00a0overall\u00a0progress.\u00a0The\u00a0typical\u00a0access\u00a0latency\u00a0for\u00a0many\u00a0enterprise\u00a0SSDs\u00a0is\u00a0between\u00a040-100\u00a0microseconds,\u00a0which\u00a0is\u00a0acceptable\u00a0in\u00a0database\u00a0or\u00a0virtualization\u00a0scenarios,\u00a0but\u00a0appears\u00a0relatively\u00a0high\u00a0in\u00a0large-scale\u00a0AI\u00a0inference\u00a0or\u00a0training.\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;19qF-1767690802459&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;9itg-1767690802462&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;ordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;WfCX-1767690802458&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;IOPS\u00a0is\u00a0no\u00a0longer\u00a0just\u00a0\\&quot;good\u00a0enough.\\&quot;\u200b&quot;,&quot;marks&quot;:[{&quot;type&quot;:&quot;bold&quot;}]},{&quot;text&quot;:&quot;\u00a0IOPS\u00a0(Input\/Output\u00a0Operations\u00a0Per\u00a0Second)\u00a0has\u00a0long\u00a0been\u00a0one\u00a0of\u00a0the\u00a0core\u00a0metrics\u00a0for\u00a0SSDs,\u00a0but\u00a0in\u00a0traditional\u00a0businesses,\u00a0reaching\u00a0hundreds\u00a0of\u00a0thousands\u00a0of\u00a0IOPS\u00a0was\u00a0often\u00a0sufficient.\u00a0AI\u00a0workloads\u00a0are\u00a0completely\u00a0different.\u00a0Scenarios\u00a0like\u00a0vector\u00a0retrieval,\u00a0parameter\u00a0loading,\u00a0and\u00a0model\u00a0sharding\u00a0access\u00a0generate\u00a0an\u00a0extremely\u00a0large\u00a0number\u00a0of\u00a0random\u00a0read\u00a0requests.\u00a0In\u00a0such\u00a0cases,\u00a0the\u00a0IOPS\u00a0of\u00a0traditional\u00a0SSDs\u00a0quickly\u00a0becomes\u00a0a\u00a0system\u00a0bottleneck.\u00a0This\u00a0is\u00a0why\u00a0discussions\u00a0about\u00a0AI\u00a0SSD\u00a0technology\u00a0often\u00a0mention\u00a0targets\u00a0of\u00a0millions\u00a0or\u00a0even\u00a0tens\u00a0of\u00a0millions\u00a0of\u00a0IOPS,\u00a0which\u00a0was\u00a0very\u00a0rare\u00a0in\u00a0the\u00a0past.\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:2}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;T3PF-1767690802461&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;9itg-1767690802462&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;ordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;YL5t-1767690802460&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;The\u00a0CPU\u00a0becomes\u00a0a\u00a0\\&quot;bottleneck\u00a0for\u00a0transfer.\\&quot;&quot;,&quot;marks&quot;:[{&quot;type&quot;:&quot;bold&quot;}]},{&quot;text&quot;:&quot;\u200b\u00a0In\u00a0the\u00a0classic\u00a0server\u00a0architecture,\u00a0the\u00a0data\u00a0path\u00a0between\u00a0the\u00a0SSD\u00a0and\u00a0the\u00a0GPU\u00a0is\u00a0usually:\u00a0SSD\u00a0\u2192\u00a0CPU\u00a0\u2192\u00a0Memory\u00a0\u2192\u00a0GPU.\u00a0This\u00a0model\u00a0worked\u00a0well\u00a0in\u00a0the\u00a0era\u00a0of\u00a0general-purpose\u00a0computing\u00a0but\u00a0exposes\u00a0obvious\u00a0problems\u00a0in\u00a0AI\u00a0servers.\u00a0On\u00a0one\u00a0hand,\u00a0the\u00a0CPU\u00a0needs\u00a0to\u00a0handle\u00a0a\u00a0large\u00a0amount\u00a0of\u00a0data\u00a0movement;\u00a0on\u00a0the\u00a0other\u00a0hand,\u00a0this\u00a0path\u00a0itself\u00a0introduces\u00a0additional\u00a0latency.\u00a0As\u00a0GPU\u00a0computing\u00a0power\u00a0continues\u00a0to\u00a0increase,\u00a0this\u00a0data\u00a0path\u00a0that\u00a0\\&quot;goes\u00a0around\u00a0the\u00a0CPU\u00a0to\u00a0reach\u00a0the\u00a0GPU\\&quot;\u00a0is\u00a0becoming\u00a0a\u00a0constraint\u00a0on\u00a0overall\u00a0system\u00a0efficiency.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><ol><li><strong>Latency becomes a more critical metric than bandwidth.<\/strong>\u200b In traditional applications, <a href=\"\/news\/ssd-read-and-write-speed-explained-in-details\/\"><span style=\"color: #00ccff;\">sequential read\/write<\/span><\/a> bandwidth is often an important indicator for evaluating SSD performance, such as 7GB\/s or 14GB\/s. But in AI scenarios, the importance of latency often surpasses peak bandwidth. This is because AI tasks involve a large number of fine-grained data requests. If each access requires waiting for tens of microseconds, even if the amount of data per request is small, the cumulative effect will significantly slow down the overall progress. The typical access latency for many enterprise SSDs is between 40-100 microseconds, which is acceptable in database or virtualization scenarios, but appears relatively high in large-scale AI inference or training.<\/li><li><strong>IOPS is no longer just &#8220;good enough.&#8221;\u200b<\/strong> IOPS (Input\/Output Operations Per Second) has long been one of the core metrics for SSDs, but in traditional businesses, reaching hundreds of thousands of IOPS was often sufficient. AI workloads are completely different. Scenarios like vector retrieval, parameter loading, and model sharding access generate an extremely large number of random read requests. In such cases, the IOPS of traditional SSDs quickly becomes a system bottleneck. This is why discussions about AI SSD technology often mention targets of millions or even tens of millions of IOPS, which was very rare in the past.<\/li><li><strong>The CPU becomes a &#8220;bottleneck for transfer.&#8221;\u200b<\/strong> In the classic server architecture, the data path between the SSD and the GPU is usually: SSD \u2192 CPU \u2192 Memory \u2192 GPU. This model worked well in the era of general-purpose computing but exposes obvious problems in AI servers. On one hand, the CPU needs to handle a large amount of data movement; on the other hand, this path itself introduces additional latency. As GPU computing power continues to increase, this data path that &#8220;goes around the CPU to reach the GPU&#8221; is becoming a constraint on overall system efficiency.<\/li><\/ol><\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-d0ebd27 elementor-widget elementor-widget-heading\" data-id=\"d0ebd27\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Positioning Differences Between AI SSD and Traditional SSD<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4141528 elementor-widget elementor-widget-text-editor\" data-id=\"4141528\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;uADK-1767690836412&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;RCzA-1767690836413&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;To\u00a0understand\u00a0the\u00a0difference\u00a0between\u00a0the\u00a0two\u00a0more\u00a0intuitively,\u00a0we\u00a0can\u00a0compare\u00a0them\u00a0from\u00a0the\u00a0perspective\u00a0of\u00a0their\u00a0\\&quot;design\u00a0starting\u00a0point.\\&quot;&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>To understand the difference between the two more intuitively, we can compare them from the perspective of their &#8220;design starting point.&#8221;<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f1dda8c elementor-widget elementor-widget-text-editor\" data-id=\"f1dda8c\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<table>\n  <thead>\n    <tr>\n      <th>Comparison Dimension<\/th>\n      <th>Traditional SSD<\/th>\n      <th>AI SSD<\/th>\n    <\/tr>\n  <\/thead>\n  <tbody>\n    <tr>\n      <td>Primary Goal<\/td>\n      <td>Balance between general-purpose performance and cost<\/td>\n      <td>Extreme efficiency for AI workloads<\/td>\n    <\/tr>\n    <tr>\n      <td>Optimization Focus<\/td>\n      <td>Sequential read\/write, capacity, reliability<\/td>\n      <td>Low latency, high IOPS, high concurrency<\/td>\n    <\/tr>\n    <tr>\n      <td>Typical Applications<\/td>\n      <td>PC, Server, Database<\/td>\n      <td>AI Training, Inference, Vector Search<\/td>\n    <\/tr>\n    <tr>\n      <td>Data Access Pattern<\/td>\n      <td>Mixed, primarily sequential<\/td>\n      <td>Small-block, random, high-concurrency<\/td>\n    <\/tr>\n    <tr>\n      <td>System Collaboration<\/td>\n      <td>CPU-centric<\/td>\n      <td>Closer to GPU\/Accelerator<\/td>\n    <\/tr>\n  <\/tbody>\n<\/table>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e072119 key-point elementor-widget elementor-widget-text-editor\" data-id=\"e072119\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;5Icg-1767690843971&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;sNzJ-1767690843970&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;It\u00a0is\u00a0important\u00a0to\u00a0emphasize\u00a0that\u00a0AI\u00a0SSD\u00a0is\u00a0not\u00a0meant\u00a0to\u00a0replace\u00a0traditional\u00a0SSDs.\u00a0In\u00a0the\u00a0vast\u00a0majority\u00a0of\u00a0general-purpose\u00a0scenarios,\u00a0traditional\u00a0SSDs\u00a0remain\u00a0a\u00a0more\u00a0reasonable\u00a0and\u00a0cost-effective\u00a0choice.\u00a0The\u00a0existence\u00a0of\u00a0AI\u00a0SSD\u00a0is\u00a0to\u00a0serve\u00a0systems\u00a0that\u00a0are\u00a0already\u00a0\\&quot;pushed\u00a0to\u00a0the\u00a0limit\\&quot;\u00a0by\u00a0AI\u00a0workloads.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>It is important to emphasize that AI SSD is not meant to replace traditional SSDs. In the vast majority of general-purpose scenarios, traditional SSDs remain a more reasonable and cost-effective choice. The existence of AI SSD is to serve systems that are already &#8220;pushed to the limit&#8221; by AI workloads.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2309975 elementor-widget elementor-widget-heading\" data-id=\"2309975\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">The Core Positioning of AI SSD<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c890bc2 elementor-widget elementor-widget-text-editor\" data-id=\"c890bc2\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;iKG3-1767690861701&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;m2wT-1767690861702&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;From\u00a0a\u00a0system\u00a0perspective,\u00a0the\u00a0essential\u00a0role\u00a0of\u00a0AI\u00a0SSD\u00a0can\u00a0be\u00a0summarized\u00a0in\u00a0one\u00a0sentence:\u00a0&quot;,&quot;marks&quot;:[]},{&quot;text&quot;:&quot;its\u00a0task\u00a0is\u00a0not\u00a0just\u00a0to\u00a0store\u00a0data\u00a0itself,\u00a0but\u00a0to\u00a0provide\u00a0data\u00a0to\u00a0the\u00a0AI\u00a0computing\u00a0unit\u00a0efficiently,\u00a0stably,\u00a0and\u00a0continuously.\u00a0&quot;,&quot;marks&quot;:[{&quot;type&quot;:&quot;bold&quot;}]},{&quot;text&quot;:&quot;To\u00a0achieve\u00a0this\u00a0goal,\u00a0AI\u00a0SSDs\u00a0typically\u00a0focus\u00a0on\u00a0optimization\u00a0in\u00a0the\u00a0following\u00a0directions:&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;2fsy-1767691023381&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;WYgR-1767691042773&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;XOO0-1767691023380&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Extremely\u00a0low\u00a0access\u00a0latency\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:1}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;AA3S-1767691042766&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;WYgR-1767691042773&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;T02o-1767691042765&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Very\u00a0high\u00a0random\u00a0IOPS\u00a0capability\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:2}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;Cq3c-1767691042768&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;WYgR-1767691042773&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;0azo-1767691042767&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Internal\u00a0architecture\u00a0more\u00a0suited\u00a0to\u00a0AI\u00a0data\u00a0access\u00a0patterns\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:3}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;eBYb-1767691042770&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;WYgR-1767691042773&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;cskF-1767691042769&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Tighter\u00a0system-level\u00a0collaboration\u00a0methods&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>From a system perspective, the essential role of AI SSD can be summarized in one sentence: its task is not just to store data itself, but to provide data to the AI computing unit efficiently, stably, and continuously. To achieve this goal, AI SSDs typically focus on optimization in the following directions:<br \/><br \/><\/div><div><ul><li>Extremely low access latency<\/li><li>Very high random IOPS capability<\/li><li>Internal architecture more suited to AI data access patterns<\/li><li>Tighter system-level collaboration methods<\/li><\/ul><\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-70f0d55 elementor-widget elementor-widget-text-editor\" data-id=\"70f0d55\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;TsZ3-1767691042772&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;Zkzw-1767691042771&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;These\u00a0characteristics\u00a0are\u00a0not\u00a0achieved\u00a0by\u00a0simply\u00a0stacking\u00a0parameters,\u00a0but\u00a0often\u00a0require\u00a0rethinking\u00a0the\u00a0SSD's\u00a0controller\u00a0design,\u00a0flash\u00a0memory\u00a0management\u00a0strategies,\u00a0and\u00a0even\u00a0system\u00a0interface\u00a0methods.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>These characteristics are not achieved by simply stacking parameters, but often require rethinking the SSD&#8217;s controller design, flash memory management strategies, and even system interface methods.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fc8516d elementor-widget elementor-widget-heading\" data-id=\"fc8516d\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Key Technical Features and Architectural Approaches of AI SSD<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-74cdf07 elementor-widget elementor-widget-text-editor\" data-id=\"74cdf07\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;Wpn6-1767691051229&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;3W72-1767691051228&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;AI\u00a0SSD\u00a0is\u00a0not\u00a0simply\u00a0about\u00a0taking\u00a0an\u00a0existing\u00a0enterprise\u00a0SSD,\u00a0boosting\u00a0the\u00a0controller\u00a0performance,\u00a0adding\u00a0more\u00a0flash\u00a0memory,\u00a0and\u00a0maximizing\u00a0interface\u00a0speed\u00a0to\u00a0naturally\u00a0suit\u00a0AI\u00a0scenarios.\u00a0The\u00a0real\u00a0difficulty\u00a0lies\u00a0in\u00a0the\u00a0structural\u00a0difference\u00a0between\u00a0the\u00a0access\u00a0patterns\u00a0of\u00a0AI\u00a0workloads\u00a0and\u00a0traditional\u00a0storage\u00a0applications.\u00a0The\u00a0technological\u00a0evolution\u00a0of\u00a0AI\u00a0SSD\u00a0is\u00a0essentially\u00a0a\u00a0redesign\u00a0centered\u00a0around\u00a0data\u00a0access\u00a0patterns.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>AI SSD is not simply about taking an existing enterprise SSD, boosting the controller performance, adding more flash memory, and maximizing interface speed to naturally suit AI scenarios. The real difficulty lies in the structural difference between the access patterns of AI workloads and traditional storage applications. The technological evolution of AI SSD is essentially a redesign centered around data access patterns.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-6465f39 elementor-widget elementor-widget-heading\" data-id=\"6465f39\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">Extremely Low Latency<\/h3>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5e55c80 elementor-widget elementor-widget-text-editor\" data-id=\"5e55c80\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;KH5k-1767691067866&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;NuiR-1767691067867&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;In\u00a0AI\u00a0systems,\u00a0storage\u00a0latency\u00a0often\u00a0directly\u00a0determines\u00a0the\u00a0utilization\u00a0rate\u00a0of\u00a0computing\u00a0resources.\u00a0Taking\u00a0the\u00a0GPU\u00a0as\u00a0an\u00a0example,\u00a0its\u00a0computing\u00a0power\u00a0is\u00a0increasing\u00a0much\u00a0faster\u00a0than\u00a0storage\u00a0systems.\u00a0If\u00a0the\u00a0GPU\u00a0is\u00a0idle\u00a0while\u00a0waiting\u00a0for\u00a0data,\u00a0then\u00a0even\u00a0if\u00a0the\u00a0theoretical\u00a0computing\u00a0power\u00a0of\u00a0the\u00a0GPU\u00a0is\u00a0high,\u00a0the\u00a0actual\u00a0throughput\u00a0will\u00a0drop\u00a0significantly.\u00a0In\u00a0this\u00a0case,\u00a0average\u00a0latency\u00a0is\u00a0not\u00a0sufficient;\u00a0tail\u00a0latency\u00a0is\u00a0even\u00a0more\u00a0critical.\u00a0A\u00a0single\u00a0IO\u00a0operation\u00a0with\u00a0abnormal\u00a0latency\u00a0can\u00a0slow\u00a0down\u00a0the\u00a0execution\u00a0of\u00a0an\u00a0entire\u00a0batch.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>In AI systems, storage latency often directly determines the utilization rate of computing resources. Taking the GPU as an example, its computing power is increasing much faster than storage systems. If the GPU is idle while waiting for data, then even if the theoretical computing power of the GPU is high, the actual throughput will drop significantly. In this case, average latency is not sufficient; tail latency is even more critical. A single IO operation with abnormal latency can slow down the execution of an entire batch.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-a02663f elementor-widget elementor-widget-text-editor\" data-id=\"a02663f\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;cv0Q-1767691075650&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;lByC-1767691075649&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;The\u00a0access\u00a0latency\u00a0of\u00a0traditional\u00a0enterprise\u00a0NVMe\u00a0SSDs\u00a0is\u00a0typically\u00a0at\u00a0the\u00a0level\u00a0of\u00a0tens\u00a0of\u00a0microseconds,\u00a0which\u00a0is\u00a0a\u00a0very\u00a0mature\u00a0and\u00a0stable\u00a0result.\u00a0But\u00a0in\u00a0AI\u00a0scenarios,\u00a0the\u00a0industry\u00a0is\u00a0trying\u00a0to\u00a0further\u00a0compress\u00a0latency\u00a0to\u00a0the\u00a0level\u00a0of\u00a0ten\u00a0microseconds\u00a0or\u00a0even\u00a0close\u00a0to\u00a0single-digit\u00a0microseconds.\u00a0Achieving\u00a0this\u00a0is\u00a0not\u00a0just\u00a0about\u00a0increasing\u00a0interface\u00a0speed;\u00a0it\u00a0requires\u00a0systematic\u00a0optimization\u00a0in\u00a0the\u00a0following\u00a0areas:&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>The access latency of traditional enterprise NVMe SSDs is typically at the level of tens of microseconds, which is a very mature and stable result. But in AI scenarios, the industry is trying to further compress latency to the level of ten microseconds or even close to single-digit microseconds. Achieving this is not just about increasing interface speed; it requires systematic optimization in the following areas:<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c00c7e7 elementor-widget elementor-widget-text-editor\" data-id=\"c00c7e7\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;XiW1-1767691075652&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;b80V-1767691075659&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;gP5O-1767691075651&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Reducing\u00a0interrupts\u00a0and\u00a0context\u00a0switches\u00a0in\u00a0the\u00a0control\u00a0path\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;QH2P-1767691075654&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;b80V-1767691075659&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;Agqf-1767691075653&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Optimizing\u00a0flash\u00a0memory\u00a0access\u00a0scheduling\u00a0strategies\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:2}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;k8Mj-1767691075656&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;b80V-1767691075659&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;SVCe-1767691075655&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Shortening\u00a0the\u00a0internal\u00a0processing\u00a0path\u00a0of\u00a0data\u00a0within\u00a0the\u00a0controller&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><ul><li>Reducing interrupts and context switches in the control path<\/li><li>Optimizing flash memory access scheduling strategies<\/li><li>Shortening the internal processing path of data within the controller<\/li><\/ul><\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0dead2d elementor-widget elementor-widget-text-editor\" data-id=\"0dead2d\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;DQMh-1767691075658&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;Lbb4-1767691075657&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;It\u00a0can\u00a0be\u00a0said\u00a0that\u00a0latency\u00a0optimization\u00a0is\u00a0a\u00a0systematic\u00a0project\u00a0that\u00a0runs\u00a0through\u00a0the\u00a0entire\u00a0design\u00a0process\u00a0of\u00a0AI\u00a0SSD.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>It can be said that latency optimization is a systematic project that runs through the entire design process of AI SSD.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-d2b4503 elementor-widget elementor-widget-heading\" data-id=\"d2b4503\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">Ultra-High IOPS<\/h3>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-35371dd elementor-widget elementor-widget-text-editor\" data-id=\"35371dd\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;fy16-1767691115922&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;EG7L-1767691115921&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;During\u00a0model\u00a0training\u00a0and\u00a0inference,\u00a0data\u00a0access\u00a0often\u00a0exhibits\u00a0\\&quot;fragmented\\&quot;\u00a0characteristics.\u00a0For\u00a0example:&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>During model training and inference, data access often exhibits &#8220;fragmented&#8221; characteristics. For example:<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-16f790f elementor-widget elementor-widget-text-editor\" data-id=\"16f790f\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;Xnhl-1767691115924&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;AZYX-1767691115939&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;kyGj-1767691115923&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Model\u00a0parameters\u00a0are\u00a0split\u00a0into\u00a0numerous\u00a0small\u00a0blocks.\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;ZkSY-1767691115926&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;AZYX-1767691115939&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;IYHX-1767691115925&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Vector\u00a0databases\u00a0require\u00a0frequent\u00a0access\u00a0to\u00a0indexes\u00a0and\u00a0features.\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:2}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;56vg-1767691115928&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;AZYX-1767691115939&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;nUrP-1767691115927&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Multiple\u00a0models\u00a0or\u00a0tasks\u00a0run\u00a0in\u00a0parallel.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><ul><li>Model parameters are split into numerous small blocks.<\/li><li>Vector databases require frequent access to indexes and features.<\/li><li>Multiple models or tasks run in parallel.<\/li><\/ul><\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-d71038e elementor-widget elementor-widget-text-editor\" data-id=\"d71038e\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;E0We-1767691115930&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;2lLt-1767691115929&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;In\u00a0these\u00a0scenarios,\u00a0the\u00a0SSD\u00a0faces\u00a0not\u00a0a\u00a0few\u00a0large,\u00a0continuous\u00a0requests,\u00a0but\u00a0a\u00a0massive\u00a0number\u00a0of\u00a0concurrent\u00a0small\u00a0requests.\u00a0This\u00a0makes\u00a0IOPS\u00a0a\u00a0key\u00a0indicator\u00a0determining\u00a0the\u00a0performance\u00a0ceiling.\u00a0In\u00a0traditional\u00a0enterprise\u00a0SSDs,\u00a0several\u00a0hundred\u00a0thousand\u00a0IOPS\u00a0is\u00a0already\u00a0considered\u00a0high-end.\u00a0In\u00a0the\u00a0planning\u00a0of\u00a0AI\u00a0SSDs,\u00a0common\u00a0targets\u00a0are\u00a0millions,\u00a0several\u00a0million,\u00a0or\u00a0even\u00a0tens\u00a0of\u00a0millions\u00a0of\u00a0IOPS.\u00a0It\u00a0is\u00a0important\u00a0to\u00a0note\u00a0that\u00a0the\u00a0IOPS\u00a0referred\u00a0to\u00a0here\u00a0is\u00a0not\u00a0just\u00a0a\u00a0peak\u00a0value\u00a0under\u00a0laboratory\u00a0conditions,\u00a0but\u00a0a\u00a0sustainable\u00a0capability\u00a0under\u00a0high\u00a0concurrency\u00a0and\u00a0low\u00a0latency\u00a0constraints.\u00a0Improving\u00a0IOPS\u00a0is\u00a0not\u00a0a\u00a0problem\u00a0that\u00a0can\u00a0be\u00a0solved\u00a0simply\u00a0by\u00a0\\&quot;opening\u00a0more\u00a0queues.\\&quot;\u00a0When\u00a0the\u00a0number\u00a0of\u00a0concurrent\u00a0requests\u00a0is\u00a0extremely\u00a0large,\u00a0the\u00a0following\u00a0problems\u00a0quickly\u00a0emerge:&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>In these scenarios, the SSD faces not a few large, continuous requests, but a massive number of concurrent small requests. This makes IOPS a key indicator determining the performance ceiling. In traditional enterprise SSDs, several hundred thousand IOPS is already considered high-end. In the planning of AI SSDs, common targets are millions, several million, or even tens of millions of IOPS. It is important to note that the IOPS referred to here is not just a peak value under laboratory conditions, but a sustainable capability under high concurrency and low latency constraints. Improving IOPS is not a problem that can be solved simply by &#8220;opening more queues.&#8221; When the number of concurrent requests is extremely large, the following problems quickly emerge:<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5aeda00 elementor-widget elementor-widget-text-editor\" data-id=\"5aeda00\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;Tb5K-1767691115932&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;OMWR-1767691115940&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;8Dhq-1767691115931&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Increased\u00a0complexity\u00a0of\u00a0queue\u00a0management\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;Eiav-1767691115934&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;OMWR-1767691115940&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;pAvs-1767691115933&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Uneven\u00a0load\u00a0between\u00a0flash\u00a0memory\u00a0channels\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:2}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;tTq9-1767691115936&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;OMWR-1767691115940&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;5Bbk-1767691115935&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Interference\u00a0from\u00a0write\u00a0amplification\u00a0and\u00a0garbage\u00a0collection&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><ul><li>Increased complexity of queue management<\/li><li>Uneven load between flash memory channels<\/li><li>Interference from write amplification and garbage collection<\/li><\/ul><\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ce71499 elementor-widget elementor-widget-text-editor\" data-id=\"ce71499\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;Sq0P-1767691115938&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;aVTZ-1767691115937&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Therefore,\u00a0AI\u00a0SSDs\u00a0often\u00a0need\u00a0to\u00a0introduce\u00a0more\u00a0aggressive\u00a0concurrent\u00a0scheduling\u00a0strategies\u00a0at\u00a0the\u00a0firmware\u00a0level,\u00a0while\u00a0also\u00a0managing\u00a0flash\u00a0memory\u00a0resources\u00a0more\u00a0finely.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>Therefore, AI SSDs often need to introduce more aggressive concurrent scheduling strategies at the firmware level, while also managing flash memory resources more finely.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9edca07 elementor-widget elementor-widget-heading\" data-id=\"9edca07\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">System Co-Design for GPUs and Accelerators<\/h3>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-08f762b elementor-widget elementor-widget-text-editor\" data-id=\"08f762b\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;n9Cj-1767691138171&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;Xffr-1767691138170&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;In\u00a0classic\u00a0server\u00a0architecture,\u00a0there\u00a0is\u00a0usually\u00a0no\u00a0direct\u00a0data\u00a0path\u00a0between\u00a0the\u00a0SSD\u00a0and\u00a0the\u00a0GPU.\u00a0Data\u00a0must\u00a0first\u00a0pass\u00a0through\u00a0the\u00a0CPU\u00a0and\u00a0system\u00a0memory\u00a0before\u00a0being\u00a0sent\u00a0to\u00a0the\u00a0GPU.\u00a0The\u00a0problems\u00a0of\u00a0this\u00a0architecture\u00a0are\u00a0amplified\u00a0in\u00a0AI\u00a0servers.\u00a0As\u00a0the\u00a0number\u00a0of\u00a0GPUs\u00a0increases\u00a0and\u00a0single-card\u00a0computing\u00a0power\u00a0continues\u00a0to\u00a0improve,\u00a0the\u00a0efficiency\u00a0issues\u00a0of\u00a0this\u00a0indirect\u00a0path\u00a0become\u00a0more\u00a0pronounced.\u00a0To\u00a0solve\u00a0this\u00a0problem,\u00a0AI\u00a0SSDs\u00a0are\u00a0beginning\u00a0to\u00a0make\u00a0new\u00a0attempts\u00a0at\u00a0the\u00a0system\u00a0level,\u00a0such\u00a0as:&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>In classic server architecture, there is usually no direct data path between the SSD and the GPU. Data must first pass through the CPU and system memory before being sent to the GPU. The problems of this architecture are amplified in AI servers. As the number of GPUs increases and single-card computing power continues to improve, the efficiency issues of this indirect path become more pronounced. To solve this problem, AI SSDs are beginning to make new attempts at the system level, such as:<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2cd56f7 elementor-widget elementor-widget-text-editor\" data-id=\"2cd56f7\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;hIn4-1767691138173&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;wV10-1767691138180&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;CVwb-1767691138172&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Supporting\u00a0GPU-direct\u00a0or\u00a0near-direct\u00a0data\u00a0access\u00a0modes.\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;wald-1767691138175&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;wV10-1767691138180&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;ooJU-1767691138174&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Reducing\u00a0unnecessary\u00a0CPU\u00a0involvement.\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:2}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;K3hR-1767691138177&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;wV10-1767691138180&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;0W5H-1767691138176&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Optimizing\u00a0the\u00a0data\u00a0transmission\u00a0path\u00a0between\u00a0storage\u00a0and\u00a0accelerators.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><ul><li>Supporting GPU-direct or near-direct data access modes.<\/li><li>Reducing unnecessary CPU involvement.<\/li><li>Optimizing the data transmission path between storage and accelerators.<\/li><\/ul><\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-14b9406 elementor-widget elementor-widget-text-editor\" data-id=\"14b9406\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;bKpH-1767691138179&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;XUbQ-1767691138178&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;These\u00a0designs\u00a0do\u00a0not\u00a0necessarily\u00a0mean\u00a0completely\u00a0bypassing\u00a0the\u00a0CPU,\u00a0but\u00a0rather\u00a0reducing\u00a0data\u00a0copying\u00a0and\u00a0transfer\u00a0times\u00a0in\u00a0appropriate\u00a0scenarios\u00a0to\u00a0improve\u00a0overall\u00a0efficiency.\u00a0In\u00a0this\u00a0architecture,\u00a0storage\u00a0is\u00a0no\u00a0longer\u00a0just\u00a0a\u00a0\\&quot;passive\u00a0data\u00a0warehouse,\\&quot;\u00a0but\u00a0more\u00a0like\u00a0an\u00a0active\u00a0data\u00a0service\u00a0node\u00a0within\u00a0the\u00a0AI\u00a0system.\u00a0It\u00a0needs\u00a0to\u00a0understand\u00a0the\u00a0upper-level\u00a0access\u00a0patterns,\u00a0respond\u00a0quickly\u00a0to\u00a0concurrent\u00a0requests,\u00a0and\u00a0form\u00a0efficient\u00a0collaboration\u00a0with\u00a0computing\u00a0units.\u00a0This\u00a0is\u00a0also\u00a0why\u00a0the\u00a0design\u00a0of\u00a0AI\u00a0SSDs\u00a0often\u00a0requires\u00a0deep\u00a0collaboration\u00a0with\u00a0the\u00a0entire\u00a0machine\u00a0system,\u00a0and\u00a0even\u00a0GPU\u00a0manufacturers.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>These designs do not necessarily mean completely bypassing the CPU, but rather reducing data copying and transfer times in appropriate scenarios to improve overall efficiency. In this architecture, storage is no longer just a &#8220;passive data warehouse,&#8221; but more like an active data service node within the AI system. It needs to understand the upper-level access patterns, respond quickly to concurrent requests, and form efficient collaboration with computing units. This is also why the design of AI SSDs often requires deep collaboration with the entire machine system, and even GPU manufacturers.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-daa82bc elementor-widget elementor-widget-heading\" data-id=\"daa82bc\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">Large Capacity and High Density<\/h3>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c341e81 elementor-widget elementor-widget-text-editor\" data-id=\"c341e81\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;C7rY-1767691177622&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;w0iI-1767691177621&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;As\u00a0large\u00a0model\u00a0parameter\u00a0sizes\u00a0continue\u00a0to\u00a0grow,\u00a0with\u00a0individual\u00a0models\u00a0often\u00a0reaching\u00a0hundreds\u00a0of\u00a0GB\u00a0or\u00a0even\u00a0several\u00a0TB,\u00a0storage\u00a0systems\u00a0must\u00a0have\u00a0sufficient\u00a0capacity\u00a0and\u00a0density\u00a0to\u00a0support\u00a0actual\u00a0deployment.\u00a0Furthermore,\u00a0AI\u00a0systems\u00a0often\u00a0need\u00a0to\u00a0store\u00a0multiple\u00a0versions\u00a0of\u00a0models,\u00a0training\u00a0data,\u00a0intermediate\u00a0results,\u00a0vector\u00a0databases,\u00a0indexes,\u00a0etc.\u00a0This\u00a0makes\u00a0high\u00a0capacity\u00a0and\u00a0high\u00a0density\u00a0another\u00a0important\u00a0characteristic\u00a0of\u00a0AI\u00a0SSDs.\u00a0However,\u00a0high\u00a0capacity\u00a0is\u00a0not\u00a0\\&quot;free.\\&quot;\u00a0Increasing\u00a0flash\u00a0density\u00a0often\u00a0comes\u00a0with\u00a0costs:&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>As large model parameter sizes continue to grow, with individual models often reaching hundreds of GB or even several TB, storage systems must have sufficient capacity and density to support actual deployment. Furthermore, AI systems often need to store multiple versions of models, training data, intermediate results, vector databases, indexes, etc. This makes high capacity and high density another important characteristic of AI SSDs. However, high capacity is not &#8220;free.&#8221; Increasing flash density often comes with costs:<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-472ec8b elementor-widget elementor-widget-text-editor\" data-id=\"472ec8b\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;08uS-1767691177624&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;rtYU-1767691177631&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;uA8e-1767691177623&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Increased\u00a0access\u00a0latency\u00a0per\u00a0flash\u00a0die.\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;CbJg-1767691177626&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;rtYU-1767691177631&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;eR5F-1767691177625&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Limited\u00a0concurrent\u00a0performance.\\n&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{&quot;index&quot;:2}},{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;JjGq-1767691177628&quot;,&quot;name&quot;:&quot;list-item&quot;,&quot;data&quot;:{&quot;version&quot;:1,&quot;listId&quot;:&quot;rtYU-1767691177631&quot;,&quot;listLevel&quot;:1,&quot;listType&quot;:&quot;unordered&quot;},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;Hdvm-1767691177627&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Challenges\u00a0to\u00a0endurance\u00a0and\u00a0reliability.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div><ul><li>Increased access latency per flash die.<\/li><li>Limited concurrent performance.<\/li><li>Challenges to endurance and reliability.<\/li><\/ul><\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f6ea38e elementor-widget elementor-widget-text-editor\" data-id=\"f6ea38e\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;qrtT-1767691177630&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;G1WD-1767691177629&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Therefore,\u00a0while\u00a0pursuing\u00a0capacity,\u00a0AI\u00a0SSDs\u00a0also\u00a0need\u00a0to\u00a0minimize\u00a0the\u00a0performance\u00a0loss\u00a0caused\u00a0by\u00a0high\u00a0density\u00a0through\u00a0architectural\u00a0and\u00a0scheduling\u00a0designs.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>Therefore, while pursuing capacity, AI SSDs also need to minimize the performance loss caused by high density through architectural and scheduling designs.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7008138 elementor-widget elementor-widget-heading\" data-id=\"7008138\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Industry Status and Future Trends of AI SSD<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7fe709f elementor-widget elementor-widget-text-editor\" data-id=\"7fe709f\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;RcS9-1767691189622&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;2mTh-1767691189623&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;If\u00a0viewed\u00a0from\u00a0the\u00a0perspective\u00a0of\u00a0the\u00a0technology\u00a0lifecycle,\u00a0AI\u00a0SSD\u00a0is\u00a0still\u00a0in\u00a0a\u00a0stage\u00a0of\u00a0\\&quot;early\u00a0adoption\u00a0and\u00a0rapid\u00a0exploration.\\&quot;\u00a0On\u00a0one\u00a0hand,\u00a0AI\u00a0SSD\u00a0is\u00a0not\u00a0just\u00a0a\u00a0marketing\u00a0term\u00a0that\u00a0remains\u00a0at\u00a0the\u00a0conceptual\u00a0level;\u00a0AI\u00a0workloads\u00a0already\u00a0exist\u00a0and\u00a0are\u00a0growing\u00a0rapidly\u00a0in\u00a0data\u00a0centers,\u00a0cloud\u00a0computing\u00a0platforms,\u00a0and\u00a0large\u00a0enterprises.\u00a0On\u00a0the\u00a0other\u00a0hand,\u00a0AI\u00a0SSD\u00a0has\u00a0not\u00a0yet\u00a0formed\u00a0a\u00a0completely\u00a0unified\u00a0and\u00a0standardized\u00a0product\u00a0form.\u00a0True\u00a0SSDs\u00a0designed\u00a0entirely\u00a0around\u00a0AI\u00a0are\u00a0still\u00a0mainly\u00a0limited\u00a0to\u00a0customized,\u00a0pre-research,\u00a0and\u00a0high-end\u00a0enterprise\u00a0products,\u00a0and\u00a0are\u00a0some\u00a0distance\u00a0away\u00a0from\u00a0widespread\u00a0adoption.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>If viewed from the perspective of the technology lifecycle, AI SSD is still in a stage of &#8220;early adoption and rapid exploration.&#8221; On one hand, AI SSD is not just a marketing term that remains at the conceptual level; AI workloads already exist and are growing rapidly in data centers, cloud computing platforms, and large enterprises. On the other hand, AI SSD has not yet formed a completely unified and standardized product form. True SSDs designed entirely around AI are still mainly limited to customized, pre-research, and high-end enterprise products, and are some distance away from widespread adoption.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-1e04802 elementor-widget elementor-widget-text-editor\" data-id=\"1e04802\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;GH15-1767691201219&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;3KOT-1767691201218&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;In\u00a0the\u00a0advancement\u00a0of\u00a0AI\u00a0SSD,\u00a0the\u00a0roles\u00a0within\u00a0the\u00a0traditional\u00a0storage\u00a0industry\u00a0chain\u00a0are\u00a0changing.\u00a0In\u00a0the\u00a0past,\u00a0SSD\u00a0manufacturers\u00a0focused\u00a0more\u00a0on\u00a0the\u00a0performance\u00a0and\u00a0reliability\u00a0of\u00a0single\u00a0devices.\u00a0In\u00a0the\u00a0AI\u00a0scenario,\u00a0they\u00a0need\u00a0to\u00a0participate\u00a0earlier\u00a0in\u00a0system-level\u00a0design,\u00a0collaborate\u00a0with\u00a0server,\u00a0GPU,\u00a0and\u00a0cloud\u00a0platform\u00a0providers,\u00a0and\u00a0perform\u00a0deep\u00a0optimization\u00a0for\u00a0specific\u00a0AI\u00a0workloads.\u00a0This\u00a0is\u00a0blurring\u00a0the\u00a0boundaries\u00a0between\u00a0storage\u00a0original\u00a0equipment\u00a0manufacturers,\u00a0controller\u00a0manufacturers,\u00a0and\u00a0system\u00a0integrators.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>In the advancement of AI SSD, the roles within the traditional storage industry chain are changing. In the past, SSD manufacturers focused more on the performance and reliability of single devices. In the AI scenario, they need to participate earlier in system-level design, collaborate with server, GPU, and cloud platform providers, and perform deep optimization for specific AI workloads. This is blurring the boundaries between storage original equipment manufacturers, controller manufacturers, and system integrators.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-55d6391 elementor-widget elementor-widget-text-editor\" data-id=\"55d6391\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;7f3g-1767691201221&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;LTeL-1767691201220&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;Future\u00a0AI\u00a0SSDs\u00a0will\u00a0likely\u00a0no\u00a0longer\u00a0be\u00a0just\u00a0\\&quot;a\u00a0device\u00a0plugged\u00a0into\u00a0a\u00a0PCIe\u00a0slot,\\&quot;\u00a0but\u00a0will\u00a0be\u00a0more\u00a0deeply\u00a0co-designed\u00a0with\u00a0computing\u00a0resources\u00a0at\u00a0the\u00a0system\u00a0level.\u00a0This\u00a0may\u00a0be\u00a0reflected\u00a0in:\u00a0more\u00a0direct\u00a0data\u00a0paths,\u00a0fewer\u00a0intermediate\u00a0copies,\u00a0and\u00a0tighter\u00a0software-hardware\u00a0collaboration.\u00a0This\u00a0will\u00a0further\u00a0weaken\u00a0the\u00a0boundary\u00a0between\u00a0storage\u00a0and\u00a0computation.\u00a0On\u00a0the\u00a0other\u00a0hand,\u00a0as\u00a0AI\u00a0systems\u00a0scale,\u00a0\\&quot;handing\u00a0all\u00a0computation\u00a0to\u00a0the\u00a0GPU\\&quot;\u00a0is\u00a0not\u00a0necessarily\u00a0the\u00a0optimal\u00a0solution.\u00a0In\u00a0some\u00a0scenarios,\u00a0having\u00a0the\u00a0storage\u00a0device\u00a0handle\u00a0some\u00a0data\u00a0processing\u00a0or\u00a0preprocessing\u00a0tasks\u00a0can\u00a0help\u00a0reduce\u00a0the\u00a0overall\u00a0system\u00a0load.\u00a0Although\u00a0such\u00a0ideas\u00a0are\u00a0still\u00a0in\u00a0the\u00a0exploratory\u00a0stage,\u00a0they\u00a0have\u00a0become\u00a0a\u00a0focus\u00a0of\u00a0industry\u00a0attention.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>Future AI SSDs will likely no longer be just &#8220;a device plugged into a PCIe slot,&#8221; but will be more deeply co-designed with computing resources at the system level. This may be reflected in: more direct data paths, fewer intermediate copies, and tighter software-hardware collaboration. This will further weaken the boundary between storage and computation. On the other hand, as AI systems scale, &#8220;handing all computation to the GPU&#8221; is not necessarily the optimal solution. In some scenarios, having the storage device handle some data processing or preprocessing tasks can help reduce the overall system load. Although such ideas are still in the exploratory stage, they have become a focus of industry attention.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f1f0b11 conclusion elementor-widget elementor-widget-text-editor\" data-id=\"f1f0b11\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<article data-content=\"[{&quot;type&quot;:&quot;block&quot;,&quot;id&quot;:&quot;MV6Q-1767691277385&quot;,&quot;name&quot;:&quot;paragraph&quot;,&quot;data&quot;:{&quot;version&quot;:1},&quot;nodes&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;id&quot;:&quot;lHEX-1767691277384&quot;,&quot;leaves&quot;:[{&quot;text&quot;:&quot;AI\u00a0SSD\u00a0is\u00a0not\u00a0a\u00a0new\u00a0species\u00a0that\u00a0fundamentally\u00a0changes\u00a0what\u00a0storage\u00a0is,\u00a0but\u00a0a\u00a0reshaping\u00a0of\u00a0the\u00a0storage\u00a0role\u00a0centered\u00a0around\u00a0AI\u00a0workloads.\u00a0Its\u00a0emergence\u00a0stems\u00a0from\u00a0the\u00a0fundamental\u00a0changes\u00a0AI\u00a0brings\u00a0to\u00a0data\u00a0access\u00a0patterns;\u00a0its\u00a0value\u00a0lies\u00a0not\u00a0in\u00a0the\u00a0exaggerated\u00a0improvement\u00a0of\u00a0individual\u00a0metrics,\u00a0but\u00a0in\u00a0the\u00a0continuous\u00a0improvement\u00a0of\u00a0system\u00a0efficiency;\u00a0its\u00a0future\u00a0will\u00a0not\u00a0replace\u00a0all\u00a0SSDs,\u00a0but\u00a0will\u00a0become\u00a0an\u00a0indispensable\u00a0part\u00a0of\u00a0AI\u00a0infrastructure.\u00a0If\u00a0the\u00a0GPU\u00a0is\u00a0the\u00a0\\&quot;engine\\&quot;\u00a0of\u00a0the\u00a0AI\u00a0system,\u00a0then\u00a0the\u00a0AI\u00a0SSD\u00a0is\u00a0more\u00a0like\u00a0the\u00a0fuel\u00a0system\u00a0that\u00a0provides\u00a0stable,\u00a0continuous\u00a0supply.\u00a0It\u00a0may\u00a0not\u00a0be\u00a0the\u00a0most\u00a0conspicuous,\u00a0but\u00a0once\u00a0it\u00a0falls\u00a0behind,\u00a0the\u00a0entire\u00a0system\u00a0is\u00a0affected.&quot;,&quot;marks&quot;:[]}]}],&quot;state&quot;:{}}]\"><div>AI SSD is not a new species that fundamentally changes what storage is, but a reshaping of the storage role centered around AI workloads. Its emergence stems from the fundamental changes AI brings to data access patterns; its value lies not in the exaggerated improvement of individual metrics, but in the continuous improvement of system efficiency; its future will not replace all SSDs, but will become an indispensable part of AI infrastructure. If the GPU is the &#8220;engine&#8221; of the AI system, then the AI SSD is more like the fuel system that provides stable, continuous supply. It may not be the most conspicuous, but once it falls behind, the entire system is affected.<\/div><\/article>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>\u0644\u0627 \u064a\u062a\u0639\u0644\u0642 \u062c\u0648\u0647\u0631 \u0627\u0644\u0630\u0643\u0627\u0621 \u0627\u0644\u0627\u0635\u0637\u0646\u0627\u0639\u064a SSD \u0628\u0640 \"\u0645\u0627 \u0625\u0630\u0627 \u0643\u0627\u0646 \u0647\u0646\u0627\u0643 \u0630\u0643\u0627\u0621 \u0627\u0635\u0637\u0646\u0627\u0639\u064a \u062f\u0627\u062e\u0644 \u0645\u062d\u0631\u0643 \u0623\u0642\u0631\u0627\u0635 \u0627\u0644\u062d\u0627\u0644\u0629 \u0627\u0644\u0635\u0644\u0628\u0629\"\u060c \u0628\u0644 \u064a\u062a\u0639\u0644\u0642 \u0628\u0640 \"\u0645\u0627 \u0625\u0630\u0627 \u0643\u0627\u0646 \u0645\u062d\u0631\u0643 \u0623\u0642\u0631\u0627\u0635 \u0627\u0644\u062d\u0627\u0644\u0629 \u0627\u0644\u0635\u0644\u0628\u0629 \u0642\u062f \u0648\u064f\u0644\u062f \u0644\u0623\u0639\u0628\u0627\u0621 \u0639\u0645\u0644 \u0627\u0644\u0630\u0643\u0627\u0621 \u0627\u0644\u0627\u0635\u0637\u0646\u0627\u0639\u064a\". \u0648\u0628\u0634\u0643\u0644 \u0623\u0643\u062b\u0631 \u062f\u0642\u0629\u060c \u0641\u0625\u0646 SSD \u0644\u0644\u0630\u0643\u0627\u0621 \u0627\u0644\u0627\u0635\u0637\u0646\u0627\u0639\u064a \u0647\u0648 \u0646\u0648\u0639 \u0645\u0646 \u0623\u062c\u0647\u0632\u0629 \u0627\u0644\u062a\u062e\u0632\u064a\u0646 \u0630\u0627\u062a \u0627\u0644\u062d\u0627\u0644\u0629 \u0627\u0644\u0635\u0644\u0628\u0629 \u0627\u0644\u0645\u062d\u0633\u0651\u0646\u0629 \u0628\u0639\u0645\u0642 \u0644\u0633\u064a\u0646\u0627\u0631\u064a\u0648\u0647\u0627\u062a \u0627\u0644\u062a\u062f\u0631\u064a\u0628 \u0648\u0627\u0644\u0627\u0633\u062a\u062f\u0644\u0627\u0644 \u0648\u062e\u062f\u0645\u0629 \u0627\u0644\u0628\u064a\u0627\u0646\u0627\u062a \u0641\u064a \u0627\u0644\u0630\u0643\u0627\u0621 \u0627\u0644\u0627\u0635\u0637\u0646\u0627\u0639\u064a. <\/p>","protected":false},"author":4,"featured_media":14389,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"set","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[52],"tags":[],"class_list":["post-14357","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry-news"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/posts\/14357","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/comments?post=14357"}],"version-history":[{"count":47,"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/posts\/14357\/revisions"}],"predecessor-version":[{"id":14406,"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/posts\/14357\/revisions\/14406"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/media\/14389"}],"wp:attachment":[{"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/media?parent=14357"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/categories?post=14357"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.oscoo.com\/ar\/wp-json\/wp\/v2\/tags?post=14357"}],"curies":[{"name":"\u062f\u0628\u0644\u064a\u0648 \u0628\u064a","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}