Building a scene-specific synthetic data generator with Omniverse Replicator
Päivämäärä
2024Tekijänoikeudet
© The Author(s)
In today’s world of AI, the amount of training data is a critical factor in the success of model training. Especially in cases where data acquisition is difficult due to rare occurrence of events or annotation cost, synthetic data can be used to supplement data needs. In computer vision, some tasks require pixel-wise annotation which, if done by hand, is labor intensive and error-prone. In this study, we use eDSR methodology to design and evaluate a synthetic data generator, to serve as a reference generator for those who seek to start synthetic visual data generation from scratch. A generator, combining an Omniverse Replicator Python script and 3D assets, is developed and the quality of the synthetic data outputs is measured by training three different neural networks to predict segmentation masks from a real-world scene. In addition to the generator, a model of scene-specific synthetic data generation pipeline is presented, to complement the reference generator as a source of knowledge for newcomers in the field. Two major processes in synthetic data generator building are observed to be domain gap bridging and domain randomization. Domain gap bridging aims to increase the visual similarity in the synthetic scene and the real world, while domain randomization aims to increase the data distribution. Because the main benefit of synthetic data is minimal annotation cost, the optimization of generation speed should be integrated in the development process. The Python code developed is available in: https://github.com/jkuhno/reference-SDGenerator
...
Metadata
Näytä kaikki kuvailutiedotKokoelmat
- Pro gradu -tutkielmat [29559]
Lisenssi
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
Rapid automatized naming and learning disabilities : does RAN have a specific connection to reading or not : a replication
Heikkilä, Riikka (2006)The aim of this study was to replicate the study of Waber, Wolff, Forbes, and Weiler (2000), in which the specificity of naming speed deficits (NSD) to reading disability (RD) was examined. 193 children (ages 8 to 11) ... -
Using deep learning to generate synthetic B-mode musculoskeletal ultrasound images
Cronin, Neil J.; Finni, Taija; Seynnes, Olivier (Elsevier, 2020)Background and Objective Deep learning approaches are common in image processing, but often rely on supervised learning, which requires a large volume of training images, usually accompanied by hand-crafted labels. As ... -
On dynamics of parvoviral replication protein NS1
Niskanen, Einari (University of Jyväskylä, 2010) -
Lack of evidence of mimivirus replication in human PBMCs
Abrahão, Jônatas; Silva, Lorena; Oliveira, Danilo; De Freitas Almeida, Gabriel (Elsevier Masson, 2018)The Acanthamoeba polyphaga mimivirus (APMV) was first isolated during a pneumonia outbreak in Bradford, England, and since its discovery many research groups devoted efforts to understand whether this virus could be ... -
Commentary: Misguided Effort with Elusive Implications, and Sifting Signal from Noise with Replication Science
Hagger, Martin; Chatzisarantis, Nikos L. D. (Frontiers Research Foundation, 2016)
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.