Latest news with #Qwen-VL

Alibaba launches Qwen-VLo to rival ChatGPT-4o in AI image generation

India Today

a day ago

Business
India Today

Alibaba launches Qwen-VLo to rival ChatGPT-4o in AI image generation

Chinese tech company Alibaba has announced its new AI model, Qwen-VLo, which aims to take on rivals like ChatGPT-4o in the area of image generation. This new model can understand user instructions more accurately and generate high-quality images based on that understanding. The company revealed details of the model in a blog its previous image-focused models such as Qwen-VL, the newly introduced Qwen-VLo is said to be much better at handling complex prompts and producing precise results. One of the major improvements is that it can make specific changes to images — like changing colours or backgrounds — without altering unrelated parts of the image. This was a common problem with earlier versions, where minor edits often led to unnecessary changes in the overall is designed to understand the context behind a user's request. So, if a user asks for an image to resemble a certain weather condition or be drawn in a particular art style, the model can respond accordingly. It can even create images that look like they belong to a certain time period, which gives it the flexibility to be used for creative tasks. The model also supports multiple languages apart from Chinese and English, making it more useful to users across different regions. While the full list of supported languages has not been revealed, the addition signals Alibaba's intention to reach a wider global key feature that sets Qwen-VLo apart is its ability to take in more than one image at a time. In simple terms, users can upload different objects or elements and ask the model to combine them. For example, a user can upload a picture of a basket and separate images of products like soap or shampoo and ask the AI to place those items inside the basket. This feature, however, is still in development and hasn't been made fully available also gives users the ability to resize images into various formats — including square, portrait, and widescreen — using dynamic resolution training. The images are created step-by-step from top to bottom and left to right, which helps with better control and accuracy during has pointed out that the model is currently in its early stage, and users might experience some issues like inconsistency or results that don't fully match the instructions. However, the company says improvements are ongoing. It is also exploring the use of image segmentation and detection maps to improve the model's understanding of objects and scenes within an company believes that in the future, AI models like Qwen-VLo could be capable of not just generating beautiful images, but also expressing ideas and emotions through visuals.- Ends

Alibaba releases Qwen-VLo, its latest AI image model rivaling OpenAI's GPT-4o

Indian Express

2 days ago

Business
Indian Express

Alibaba releases Qwen-VLo, its latest AI image model rivaling OpenAI's GPT-4o

Alibaba has launched a new AI image generation model called Qwen-VLo that is said to have the ability to understand context and generate images based on that understanding. 'Today, we are excited to introduce a new model, Qwen VLo, a unified multimodal understanding and generation model. This newly upgraded model not only 'understands' the world but also generates high-quality recreations based on that understanding, truly bridging the gap between perception and creation,' the company said in a blog post published on June 26. Unlike previous Alibaba models such as Qwen-VL, Qwen-VLo can offer the user more detailed images with significantly more accuracy. While previous models altered unrelated details within the image when the user requested only minor changes (such as colour), Qwen-VLo is able to preserve the original structure of the image and make the requested changes to it, as per the e-commerce giant. The model is also able to understand open-ended requests, such as artistic style, weather changes, or even making the image bear resemblance to a specific time period. Alibaba also announced that the model would support multiple languages besides Chinese and English. One of the model's notable features is Multiple Image Input. The model takes existing images provided by the user, alters the text within them, and is even able to manipulate them to become part of the generated image. For instance, in an example given by the company, the user provided images of individual bathing products and a basket, then asked Qwen-VLo to put the products into the basket. However, this feature has not been officially rolled out within the model yet. Qwen-VLo makes use of dynamic resolution training, allowing the user to re-size their images as per required dimensions, including 1:1, 3:4, and 16:9. The model also uses a progressive top-to-bottom, left-to-right generation process, which helps in tasks requiring fine control. However, in its blog post, the company has said that the model is still in the preview stage and users could encounter errors such as inconsistency and non-compliance. The company further theorised that its AI models could be capable of conveying ideas and meanings through the images it creates in the future. Alibaba also proposed model generating segmentation/ detection maps to further improve the performance of Qwen-VLo. Widely known for its e-commerce business in China, Alibaba has thrown its hat into the AI race. The company's CEO, Eddie Wu, even said that Alibaba is now fully focused on AI model development and aims to build AI systems with human-level intellectual capabilities. (This article has been curated by Purv Ashar, who is an intern with The Indian Express)

Latest news with #Qwen-VL

Alibaba launches Qwen-VLo to rival ChatGPT-4o in AI image generation

Alibaba releases Qwen-VLo, its latest AI image model rivaling OpenAI's GPT-4o

Get Started Now: Download the App