The Future of AI is Data and Synthetic Data Industry

On March 7, Yao Qian, the director of the Science and Technology Supervision Bureau of the CSRC, wrote in China Finance that it was recommended to focus on the…

The Future of AI is Data and Synthetic Data Industry

On March 7, Yao Qian, the director of the Science and Technology Supervision Bureau of the CSRC, wrote in China Finance that it was recommended to focus on the development of synthetic data industry based on AIGC technology. With higher efficiency, lower cost and higher quality as the “incremental expansion” of the data element market, it helps to create data advantages for the future development of AI. In terms of strengthening the high-quality supply of data elements, we should make overall plans for self-reliance and opening-up. Consider establishing filtered domestic mirror sites for specific data sources such as Wikipedia and Reddit for use by domestic data processors.

Yao Qian, Director of the Science and Technology Regulatory Bureau of the CSRC: Focus on the development of synthetic data industry based on AIGC technology

Interpretation of the news:


In his article published on March 7, Yao Qian, the director of the Science and Technology Supervision Bureau of the China Securities Regulatory Commission (CSRC), emphasized the importance of developing the synthetic data industry based on Artificial Intelligence and Generative Adversarial Networks (AIGC) technology. He highlighted that this industry can help in creating data advantages for the future development of AI. This is because synthetic data production can achieve higher efficiency, lower cost, and higher quality, which can lead to an “incremental expansion” of the data element market.

Yao Qian further emphasizes the need for a stronger focus on the high-quality supply of data elements. He suggests making overall plans for self-reliance and opening-up. This implies that China should not rely solely on foreign sources of data but rather encourage the production and supply of data within the country. To strengthen the local supply chain, he recommends establishing filtered domestic mirror sites that can be used by data processors. For instance, they could create domestic mirrors of data sources such as Wikipedia and Reddit, which are primarily controlled by foreign entities.

The suggested approach’s core objective is to make it easier for data processors in China to access quality data sources while at the same time avoiding the risks of security breaches or the violation of political and cultural values. In China, access to foreign data sources is often limited, mainly due to concerns around cybersecurity, data privacy, and potential political influence. The domestic mirror sites aim to mitigate the risk of accessing data from foreign sources, reducing external reliance and enhancing domestic control.

In conclusion, the rapid development of AI requires access to vast amounts of diverse data. Thus, the production and supply of data sources are becoming a critical factor in determining the success and competitiveness of AI development. However, self-reliance, data security, and data privacy concerns make the acquisition of quality data sources a challenging task. Therefore, developing the synthetic data industry based on AIGC technology and establishing filtered domestic mirror sites can be a viable solution. This will not only help China to acquire the required data for AI development but also strengthen its position in the global AI market.

This article and pictures are from the Internet and do not represent Fpips's position. If you infringe, please contact us to delete:https://www.fpips.com/5567/

It is strongly recommended that you study, review, analyze and verify the content independently, use the relevant data and content carefully, and bear all risks arising therefrom.