TechCrunch News 01月28日
Viral AI company DeepSeek releases new image model family
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

DeepSeek公司发布了名为Janus Pro的全新多模态AI模型,该系列模型参数量从10亿到70亿不等,并声称其性能超越了OpenAI的DALL-E 3。这些模型可在Hugging Face平台下载,采用MIT许可证,允许商业用途。Janus Pro采用自回归框架,既能分析也能生成图像,在GenEval和DPG-Bench基准测试中,最大的Janus Pro 7B模型表现优于DALL-E 3等其他模型。尽管Janus Pro目前仅支持384x384分辨率的小尺寸图像,但考虑到其紧凑的模型大小,其性能依然令人印象深刻。DeepSeek的语言模型因其高效的计算技术,引发了人们对美国在AI领域领先地位以及AI芯片需求可持续性的讨论。

🚀DeepSeek发布Janus Pro系列多模态AI模型,参数量从10亿到70亿,性能据称超越OpenAI的DALL-E 3。

🖼️Janus Pro采用“自回归框架”,可分析并生成图像,在GenEval和DPG-Bench测试中,Janus Pro 7B模型表现突出,胜过DALL-E 3等模型。

⚖️尽管Janus Pro目前仅支持384x384小尺寸图像,但其紧凑的模型大小和性能表现,使其成为下一代统一多模态模型的有力竞争者。

💰DeepSeek的语言模型采用高效计算技术,引发了人们对美国AI领导地位及AI芯片需求的讨论,其背后有量化交易公司High-Flyer Capital Management的资金支持。

DeepSeek, the viral AI company, has released a new set of multimodal AI models that it claims can outperform OpenAI’s DALL-E 3.

The models, which are available for download from the AI dev platform Hugging Face, are a part of a new model family that DeepSeek is calling Janus Pro. They range in size from 1 billion parameters to 7 billion parameters. Parameters roughly correspond to a model’s problem-solving skills, and models with more parameters generally perform better than those with fewer parameters.

Janus Pro is under an MIT license, meaning it can be used commercially without restriction.

Image outputs from DeepSeek’s Janus Pro models.Image Credits:DeepSeek

Janus Pro, which DeepSeek describes as a “novel autoregressive framework,” can both analyze and create new images. According to the company, on two AI evaluation benchmarks, GenEval and DPG-Bench, the largest Janus Pro model, Janus Pro 7B, beats DALL-E 3 as well as models such as PixArt-alpha, Emu3-Gen, and Stability AI‘s Stable Diffusion XL.

Some of those models are on the older side, granted. And Janus Pro can only analyze and generate small images — images 384 x 384 in resolution. But the Janus Pro family’s performance is impressive, considering the models’ compact sizes.

“Janus Pro surpasses previous unified model and matches or exceeds the performance of task-specific models,” DeepSeek writes in a post on Hugging Face. “The simplicity, high flexibility, and effectiveness of Janus Pro make it a strong candidate for next-generation unified multimodal models.”

DeepSeek’s new Janus Pro models compared with the competition.Image Credits:DeepSeek

DeepSeek, a Chinese AI lab funded largely by the quantitative trading firm High-Flyer Capital Management, broke into the mainstream consciousness this week after its chatbot app rose to the top of the Apple App Store charts. DeepSeek’s language models, which were trained using compute-efficient techniques, have led many Wall Street analystsand technologists — to question whether the U.S. can maintain its lead in the AI race, and whether the demand for AI chips will sustain.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

DeepSeek Janus Pro 多模态AI DALL-E 3 人工智能
相关文章