OpenAI has introduced an enhanced iteration of its text-to-image tool, known as DALL-E. This new version leverages ChatGPT, OpenAI’s widely recognized AI chatbot, to simplify the process of generating image prompts.
Contemporary AI-powered image generation tools operate by translating image descriptions, or prompts, into a diverse range of artistic styles, spanning from highly realistic to fantastical renditions. However, formulating the perfect prompt often poses a significant challenge, to the extent that the field of “prompt engineering” has emerged as a dedicated profession.
OpenAI’s latest offering, DALL-E 3, integrates ChatGPT to assist in refining prompts. Subscribers of OpenAI’s premium ChatGPT plans, ChatGPT Plus and ChatGPT Enterprise, now have the capability to enter a request for an image and fine-tune it through interactions with the chatbot, receiving the results directly within the chat application.
ChatGPT can enhance prompts, even when they consist of just a few words, by making them more descriptive and offering additional guidance to the DALL-E 3 model.
The integration of ChatGPT is not the sole enhancement featured in DALL-E 3. OpenAI reports that DALL-E 3 generates higher-quality images, particularly when handling longer prompts, which closely align with the intended descriptions.
Moreover, it exhibits improved performance when dealing with content that has historically posed challenges for image-generating models, such as textual descriptions and depictions of human hands.
Beyond these improvements, DALL-E 3 incorporates novel mechanisms aimed at mitigating algorithmic bias and enhancing safety. For instance, DALL-E 3 will reject requests that seek images resembling the work of living artists or featuring public figures.
Additionally, artists now have the option to exclude specific pieces or all of their artwork from being used for training future iterations of OpenAI’s text-to-image models. (OpenAI, along with some of its competitors, faces legal scrutiny for allegedly utilizing copyrighted artwork by artists in the training of its generative AI image models.)
The launch of DALL-E 3 occurs amidst intensifying competition in the field of generative AI, particularly in the domain of image synthesis. Competitors like Midjourney and Stability AI continue to refine their image-generating models, increasing the competitive pressure on OpenAI to maintain its leadership.
OpenAI’s plan involves initially introducing DALL-E 3 to premium ChatGPT users in October, followed by making it accessible to research laboratories and API customers. However, the company has not disclosed whether it intends to release a free web tool, similar to its previous offerings, DALL-E 2 and the original DALL-E model, and has left the timing of such a release uncertain.
from Firstpost Tech Latest News https://ift.tt/MRgo24O
No comments:
Post a Comment
please do not enter any spam link in the comment box.