{"id":1546,"date":"2021-04-27T10:00:00","date_gmt":"2021-04-27T01:00:00","guid":{"rendered":"https:\/\/blog.testworks.co.kr\/en\/?p=1546"},"modified":"2022-12-28T15:41:40","modified_gmt":"2022-12-28T06:41:40","slug":"the-importance-of-gan-in-creating-ai-training-datasets","status":"publish","type":"post","link":"https:\/\/blog.aiworkx.ai\/en\/the-importance-of-gan-in-creating-ai-training-datasets\/","title":{"rendered":"The Importance of GAN in Creating AI Training Datasets"},"content":{"rendered":"\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" src=\"https:\/\/blog.testworks.co.kr\/en\/wp-content\/uploads\/sites\/3\/2022\/12\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Cover-Image.png\" alt=\"\" class=\"wp-image-1835\" width=\"1200\" srcset=\"https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2022\/12\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Cover-Image.png 800w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2022\/12\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Cover-Image-300x187.png 300w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2022\/12\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Cover-Image-768x479.png 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/figure><\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-1 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-2 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<p><\/p>\n\n\n\n<div style=\"height:40px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p style=\"font-size:18px\">At the International Space Conference (IAC)<span class=\"has-inline-color has-vivid-cyan-blue-color\"><strong>[i]<\/strong><\/span> held in Guadalajara, Mexico on September 27, 2016, SpaceX and Tesla CEO Alan Musk said, \u201cWe will send our first rover to Mars by 2018 and build our first space colony in 2024.\u201d In the movie, Iron Man&#8217;s AI assistant Jarvis performs complex tasks ranging from designing, manufacturing, assembling, and driving a robot suit as well as everyday life according to the gestures and voices of &#8216;Robert Downey Jr.&#8217; With the efforts and imagination of scientists around the world, it seems that we will be able to enjoy the fruits of technology in the not-too-distant future.<\/p>\n\n\n\n<p style=\"font-size:18px\">In June 2015, immediately after the launch of Google Photos, the news of a picture of a black man living in the United States with a black friend was classified as &#8216;gorillas&#8217; was revealed, leading to a controversy over &#8216;racism&#8217;. Google promised to correct it immediately, but in the end, the American information technology magazine Wired exposed in 2018 that Google simply resolved the problem by simply deleting the keywords and classifications from the system. In April 2021 (current time), I had a short conversation with an AI speaker that did not work properly except for \u201cTell me the weather today,\u201d boarded a human-driven &#8216;non-autonomous vehicle&#8217;, and went to work at the Jamsil Testworks headquarters. Still, there is a greater gap between people&#8217;s expectations for artificial intelligence and the actual reality <strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[ii]<\/span><\/strong>. As a person whose job is to develop an AI model, I feel embarrassed.<\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-3 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"366\" height=\"345\" src=\"https:\/\/blog.testworks.co.kr\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-1.png\" alt=\"\" class=\"wp-image-1547\" srcset=\"https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-1.png 366w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-1-300x283.png 300w\" sizes=\"auto, (max-width: 366px) 100vw, 366px\" \/><figcaption><strong>[Figure 1] Gartner Hype Cycle shows the progression of technology from expectations to reality<\/strong><\/figcaption><\/figure><\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-4 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<p style=\"font-size:18px\">Then, despite the innovative announcements made by world-class scholars and CEOs of global IT companies every year, why hasn&#8217;t our life changed so much? Data that is the foundation of AI training is important for the success of the &#8216;4th Industrial Revolution&#8217; led by AI. In industries that have not yet benefited from the AI innovation, developers tend to say, &#8216;Give us AI too&#8217;, but the Machine Learning engineers respond with &#8216;Give us data first&#8217;. I would like to show that Generative Adversarial Network (GAN) <strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[iii]<\/span><\/strong> technology is one way to solve the lack of data problem.<\/p>\n\n\n\n<p style=\"font-size:18px\">Ian goodfellow, who first proposed the GAN, likened it to a game between the police and a counterfeiter. GAN repeats learning with the goal of generating counterfeit bills that can deceive the police in such a way that it synthesizes the fake data at a level that the police cannot distinguish between real and fake. This technology plays two important roles for data for artificial intelligence training.<\/p>\n\n\n\n<div style=\"height:40px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h1 class=\"wp-block-heading\">First, it solves the problem of sparse data through quantitative and qualitative advancement of data.<\/h1>\n\n\n\n<p style=\"font-size:18px\">In order to build a data set for AI learning, 1) data collection and 2) data annotation are required. If even one of these processes is not complete, it is difficult to learn the expected artificial intelligence model.<\/p>\n\n\n\n<p class=\"has-medium-font-size\"><strong>1) Examples of collection difficulties<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\" style=\"font-size:18px\"><li>When data on special situations such as snow\/rain\/yellow dust\/accidents are needed to improve autonomous driving AI<\/li><li>Video data from surveillance cameras for developing a surveillance system of a secure military area.<\/li><li>Data of private parts for developing a diagnosis AI for diagnosing sexually transmitted diseases.<\/li><li>Not possible to create cancer patients to acquire cancer cell data necessary for training an AI for cancer diagnosis.<\/li><li>Difficult to destroy vehicles to obtain damaged vehicle data required for calculating estimates of accidents.<\/li><\/ul>\n\n\n\n<p class=\"has-medium-font-size\"><strong>2) Examples of labeling difficulties<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\" style=\"font-size:18px\"><li>When processing medical and legal data, it is difficult to label without professional knowledge.<\/li><li>When it is necessary to hire and process a native speaker for the evaluation of foreign language pronunciation.<\/li><li>Labeling sensor data values \u200b\u200bthat cannot be confirmed by the human eye.<\/li><li>When the degree of judgment may be different depending on the subjectivity of the evaluator (ex, character expression, style \u2013 Dandy\/Chic)<\/li><li>When workers may feel disgust during processing (ex, pornography, violent scenes)<\/li><\/ul>\n\n\n\n<p style=\"font-size:18px\">By using GAN, a training data set can be built by augmenting a small collected and labeled data set. In the study <strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[iv]<\/span><\/strong> conducted in Tel Aviv (Israel), liver lesion data were collected for 6 years in close cooperation with hospitals and medical institutions to train an AI model for diagnosing liver lesions (cysts, metastases, hemangiomas), and a total of 182 images were collected. Just 182 images!<\/p>\n\n\n\n<p style=\"font-size:18px\">If an AI is trained with only this modest amount of data, not only will it fall into bias and overfitting problems, but it will also be difficult to exhibit robust performance against various new input data. Therefore, they trained it by synthesizing tens of thousands of new data with characteristics of each symptom (cyst, metastasis, hemangioma) using GAN, and obtained performance improvement results (sensitivity). (No Augmentation: 57%, Simple Augmentation: 78.6%, GAN Augmentation: 85.7%)<\/p>\n\n\n\n<p style=\"font-size:18px\">Simple augmentation refers to image data translation, rotation, flip, and scale processing.<\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-5 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"415\" src=\"https:\/\/blog.testworks.co.kr\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-2-1024x415.png\" alt=\"\" class=\"wp-image-1548\" srcset=\"https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-2-1024x415.png 1024w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-2-300x122.png 300w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-2-768x312.png 768w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-2-1536x623.png 1536w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-2-1920x779.png 1920w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-2.png 1999w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption><strong>[Figure 2] AI research results for synthetic data-based diagnostics presented by Tel Aviv<\/strong><\/figcaption><\/figure><\/div>\n\n\n\n<div style=\"height:40px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-6 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<h1 class=\"wp-block-heading\">Second, solving data usability problems through data privacy protection<\/h1>\n\n\n\n<p style=\"font-size:18px\">GAN can also be used to obtain training data that is easy to collect and label, but difficult to utilize. On July 14, 2020, the Korean government announced the \u2018Korean New Deal Comprehensive Plan\u2019<strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[v]<\/span><\/strong>. One of the 10 major tasks of this Korean version of the New Deal is a \u2018data dam\u2019. A data dam is to store up a wide range of data in a &#8216;dam&#8217; and to make it available where it&#8217;s needed. Currently, \u2018de-identification\u2019 plays the role of the dam&#8217;s &#8216;waterway&#8217;. Even at this moment, the amount of data collected from CCTVs and vehicle cameras installed everywhere is incalculable. To utilize large-scale data that contains personal information for industry and academic research without restrictions, the data need to be anonymized for both unstructured (face, voice, etc.) and structured (name, address, resident registration number, etc.) data.<\/p>\n\n\n\n<p style=\"font-size:18px\">However, traditional methods that focus on removing personal information (blurring, or pixelation, etc.) cannot be used for AI research (individual detection\/identification, abnormal behavior\/situation recognition, emotion recognition, etc.). Therefore, it is necessary to anonymize data in an irreversible form so that they cannot be recognized by human eyes but still can be used with minimal performance degradation for training and testing AI.<\/p>\n\n\n\n<p style=\"font-size:18px\">Recently, de-identification processing technology through face synthesis is emerging. GAN-based image synthesis technology <strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[vi]<\/span><\/strong> such as DeepFake de-identifies personal information by superimposing a virtual face on an original photo or video. Unlike existing methods (blurring, or pixelation, etc.), key information that can be meaningful in R&amp;D research, such as the expression and pose of the original person, is included, but the original person in the image is replaced with a newly synthesized person. Researchers at the Technical University of Munich demonstrated CIAGAN (GAN-based Conditional Identity Anonymization) <strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[vii]<\/span><\/strong> technology at CVPR 2020. In addition to creating and synthesizing a person with a new identity, the usability of data was improved through de-identification process designed to selectively adjust key personal information (age, gender, body characteristics, etc.).<\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-7 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"602\" height=\"213\" src=\"https:\/\/blog.testworks.co.kr\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-3.png\" alt=\"\" class=\"wp-image-1549\" srcset=\"https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-3.png 602w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-3-300x106.png 300w\" sizes=\"auto, (max-width: 602px) 100vw, 602px\" \/><figcaption><strong>[Figure 3] CIAGAN Model of Technical University of Munich (CVPR2020)<\/strong><\/figcaption><\/figure><\/div>\n\n\n\n<p><\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-8 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-9 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<p><\/p>\n\n\n\n<p style=\"font-size:18px\">In the Testworks R&amp;D lab, I am helping professors, researchers, and CEOs of private, industry, academia, and public institutions complaining of difficulties in securing data sets through AI training through synthetic data generation. Listening to \u201cthe Voice of Customers\u201d in the industry\/research field, we provide<\/p>\n\n\n\n<ol class=\"wp-block-list\" type=\"1\" style=\"font-size:18px\"><li>minimum actual data quantity required for synthetic data generation<\/li><li>time required for data synthesis<\/li><li>synthetic data<\/li><li>quantitative analysis of synthesized data (FID, PSNR, SSIM, etc.)<\/li><\/ol>\n\n\n\n<p><\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-10 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<p><\/p>\n\n\n\n<p style=\"font-size:18px\">Recently, I have been examining, discussing, and researching the following technologies with my team members.<\/p>\n\n\n\n<ul class=\"wp-block-list\" style=\"font-size:18px\"><li>Domain Adaptation: Data synthesis technology for special situations (snow\/rain\/night\/frost\/dust\/moisture, etc.)<\/li><li>Super resolution: Technology to improve data quality by increasing the resolution of data<\/li><li>Semantic Synthesis: How to additionally synthesize a specific object you want<\/li><li>Image Inpainting: A technology that removes a specific area within an image and synthesizes it so that there is no sense of heterogeneity.<\/li><\/ul>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-11 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"850\" height=\"404\" src=\"https:\/\/blog.testworks.co.kr\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-4.png\" alt=\"\" class=\"wp-image-1551\" srcset=\"https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-4.png 850w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-4-300x143.png 300w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-4-768x365.png 768w\" sizes=\"auto, (max-width: 850px) 100vw, 850px\" \/><figcaption><strong>[Figure 4] Result of Domain Adaptation experiment under study at Testworks<\/strong><\/figcaption><\/figure><\/div>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"452\" src=\"https:\/\/blog.testworks.co.kr\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-5.png\" alt=\"\" class=\"wp-image-1552\" srcset=\"https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-5.png 1000w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-5-300x136.png 300w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/20210820_The-Importance-of-GAN-in-Creating-AI-Training-Datasets_Figure-5-768x347.png 768w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><figcaption><strong>[Figure 5] Super Resolution test result under study at Testworks<\/strong><\/figcaption><\/figure><\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-12 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<p style=\"font-size:18px\">Some people are concerned about the various side effects that GAN has triggered that blur the boundary between real and fake. Other people cite, \u2018There is more than one way to skin a cat\u2019 and embrace the advantages of fake data. It is expected that GAN will be used appropriately to not only revitalize the data ecosystem but also serve as a catalyst for building datasets for training AI.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-13 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-14 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-15 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<p><\/p>\n\n\n\n<div style=\"height:40px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<hr class=\"wp-block-separator is-style-wide\"\/>\n\n\n\n<p class=\"has-normal-font-size\"><strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[i]<\/span><\/strong> IAC, International Astronautical Congress) :&nbsp;<a href=\"https:\/\/www.iafastro.org\/events\/iac\/\">https:\/\/www.iafastro.org\/events\/iac\/<\/a><\/p>\n\n\n\n<p class=\"has-normal-font-size\"><strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[ii]<\/span><\/strong>&nbsp;Gartner Top 10 Strategic Technology Trends for 2020:&nbsp;<a href=\"https:\/\/www.gartner.com\/smarterwithgartner\/gartner-top-10-strategic-technology-trends-for-2020\/\">https:\/\/www.gartner.com\/smarterwithgartner\/gartner-top-10-strategic-technology-trends-for-2020\/<\/a><\/p>\n\n\n\n<p class=\"has-normal-font-size\"><strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[iii]<\/span><\/strong>&nbsp;Goodfellow, Ian J., et al. \u201cGenerative adversarial networks.\u201d&nbsp;<em>arXiv preprint arXiv:1406.2661<\/em>&nbsp;(2014)<\/p>\n\n\n\n<p class=\"has-normal-font-size\"><strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[iv]<\/span><\/strong>&nbsp;Frid-Adar, Maayan, et al. \u201cGAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification.\u201d&nbsp;<em>Neurocomputing<\/em>&nbsp;321 (2018): 321-331.<\/p>\n\n\n\n<p class=\"has-normal-font-size\"><span class=\"has-inline-color has-vivid-cyan-blue-color\"><strong>[v]<\/strong><\/span>&nbsp;<a href=\"http:\/\/www.knewdeal.go.kr\/\">http:\/\/www.knewdeal.go.kr\/<\/a><\/p>\n\n\n\n<p class=\"has-normal-font-size\"><strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[vi]<\/span><\/strong>&nbsp;Rossler, Andreas, et al. \u201cFaceforensics++: Learning to detect manipulated facial images.\u201d&nbsp;<em>Proceedings of the IEEE\/CVF International Conference on Computer Vision<\/em>. 2019.<\/p>\n\n\n\n<p class=\"has-normal-font-size\"><strong><span class=\"has-inline-color has-vivid-cyan-blue-color\">[vii]<\/span><\/strong>&nbsp;Maximov, Maxim, Ismail Elezi, and Laura Leal-Taix\u00e9. \u201cCiagan: Conditional identity anonymization generative adversarial networks.\u201d&nbsp;<em>Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition<\/em>. 2020.<\/p>\n\n\n\n<hr class=\"wp-block-separator is-style-wide\"\/>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-19 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-16 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-17 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-18 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-media-text alignwide is-stacked-on-mobile\" style=\"grid-template-columns:20% auto\"><figure class=\"wp-block-media-text__media\"><img loading=\"lazy\" decoding=\"async\" width=\"696\" height=\"514\" src=\"https:\/\/blog.testworks.co.kr\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/\uae40\ud615\ubcf5_\ucc45\uc784\uc5f0\uad6c\uc6d0_\ud14c\uc2a4\ud2b8\uc6cd\uc2a4.jpg\" alt=\"\" class=\"wp-image-1568 size-full\" srcset=\"https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/\uae40\ud615\ubcf5_\ucc45\uc784\uc5f0\uad6c\uc6d0_\ud14c\uc2a4\ud2b8\uc6cd\uc2a4.jpg 696w, https:\/\/blog.aiworkx.ai\/en\/wp-content\/uploads\/sites\/3\/2021\/08\/\uae40\ud615\ubcf5_\ucc45\uc784\uc5f0\uad6c\uc6d0_\ud14c\uc2a4\ud2b8\uc6cd\uc2a4-300x222.jpg 300w\" sizes=\"auto, (max-width: 696px) 100vw, 696px\" \/><\/figure><div class=\"wp-block-media-text__content\">\n<p><strong>Hyeongbok<\/strong> <strong>Kim <\/strong><\/p>\n\n\n\n<p><strong>Senior Researcher, AI Model Development Team<\/strong><\/p>\n\n\n\n<p>Harbin Institute of Technology, Computer Science and Technology, PhD Course<\/p>\n\n\n\n<p>He returned to Korea due to Covid-19 during his AI research and is currently working at Testworks AI development team. He is interested in social contribution through technology.<\/p>\n<\/div><\/div>\n\n\n\n<p><\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-20 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-21 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-22 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\"><\/div>\n<\/div>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Solving the problem of sparse data through quantitative and qualitative advancement of data<\/p>\n","protected":false},"author":1,"featured_media":1580,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4],"tags":[23,26,428,430,432,84,433,425,426,427,429,431,138],"class_list":["post-1546","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-insight","tag-ai-data-2","tag-ai-data-processing","tag-ai-model-development","tag-augmentation","tag-data-collection","tag-data-labeling","tag-data-processing","tag-gan","tag-generative-adversarial-network","tag-hyeongbok-kim","tag-privacy-protection","tag-synthesis","tag-testworks"],"_links":{"self":[{"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/posts\/1546","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/comments?post=1546"}],"version-history":[{"count":22,"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/posts\/1546\/revisions"}],"predecessor-version":[{"id":1836,"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/posts\/1546\/revisions\/1836"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/media\/1580"}],"wp:attachment":[{"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/media?parent=1546"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/categories?post=1546"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.aiworkx.ai\/en\/wp-json\/wp\/v2\/tags?post=1546"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}