[Generative AI] Generative AI | Capabilities of Generative AI

Recent Posts

Recent Comments

Tags more

Archives

Today

Total

Code&Data Insights

[Generative AI] Generative AI | Capabilities of Generative AI 본문

Artificial Intelligence/Natural Language Processing

[Generative AI] Generative AI | Capabilities of Generative AI

paka_corn 2024. 3. 23. 23:25

Generative AI

: 트레이닝 데이터를 기반으로 다양한 포맷(text, images, code, audio, video)의 새로운 컨텐츠를 생성해주는 AI 모델.

Generative AI models are trained on substantial datasets of existing content and learn to generate new content similar to the data they were trained on.

- Generative AI can start from the prompt,

prompt can be Text/ Image/ Video/etc and then,

it generates a new content as text/images/audio/video/code/data.

- we use generative AI tools

for increase productivity / add tangible value to work / save money / maximize the brand value

- building blocks of generative AI include GANs, VAEs, transformers, and diffusion models

Generative AI Models

Large Language Models : Process and generate text

1) ChatGPT- text generation

2) DALL-E- image generation

3) Synthesia - video generation

4) Copilot - code generation

Capabilities of Generative AI

1) Text generation

LLMs

- generate human-like text

- learn patterns and structures from data sets

- generate coherent and contextually relevant

=> text completion, responses, conversation, explanations, summarization

=> Question answering, translation, image and text pairing, conversational interactions

(ex) ChatGPT(OpenAI), PaLM(Google)

2) Image generation

Leveraging Deep Learning techniques

- GANs, VAEs

=> StyleGAN : High quality resolution novel images

=> DeepArt : Complex and detailed artwork from a sketch

=> Novel images based on textual descriptions

Applications

- Training data

- Medical imaging

- Scientific visualization

3) Audio generation

- Musical compositions

- Text-to-speech (TTS) audio

- Synthetic voices

- Natural-sounding speech

=> WaveGAN : raw audio waveforms , realistic sounds

=> OpenAI's MuseNet : original music in various genres and instrumentations

=> Google's Tacotron2, Mozilla TTS : Advanced TTS, Highly realistic synthetic speech(tone, pitch, pronounciation)

4) Video generation

- Create basic animations to complex scenes

- Transform images into dynamic videos

- Incorporate temporal coherence

- Exhibit smooth motion and transitions

=> Textual prompt-based new videos

=> Users can specify desired content

=> Users guide video generation process

5) Code generation

- Code Snippets

- Functions

- Complete programs

- debug

=> Gihub Copilot, IBM Watson Code Assistant

6) Data generation and augmentation

- Generate new data(samples)

=> Augment existing datasets (image/text/tabular data/statistical distribution)

=> Increase the diversity and variability of data

7) Virtual worlds

- Virtual avatars simulating (realistic behaviours)

- Complex virtual environment (realistic textures)

=> Metaverse

Reference : [IBM|Coursera] Generative AI: Introduction and Applications

'Artificial Intelligence > Natural Language Processing' 카테고리의 다른 글

[The Gradient] Car-GPT: Could LLMs finally make self-driving cars happen? (0)	2024.04.01
[Generative AI] Generate Text using Generative AI (ChatGPT-3.5) (1)	2024.03.26
[NLP] Large Language Model (LLM) (0)	2024.03.23
[Deep NLP] Attention \| Transformer (0)	2023.12.26
[Deep NLP] Word Embeddings \| Word2Vec (1)	2023.12.26

'Artificial Intelligence/Natural Language Processing' Related Articles

Comments

Code&Data Insights

[Generative AI] Generative AI | Capabilities of Generative AI 본문

[Generative AI] Generative AI | Capabilities of Generative AI

Generative AI

Generative AI Models

Capabilities of Generative AI

'Artificial Intelligence > Natural Language Processing' 카테고리의 다른 글

티스토리툴바