• Skip to navigation
  • Skip to content

logo

  • Company

    Connecting every possibility
    with technology and services

    • About
    • Affiliated Companies
    • Our History
    • Brand Resources
    • Partnerships
    • Contact
    NAVER 사옥
  • Service

    From Search to Discovery
    On-Service AI

    • Featured Services
    • Portal
    • Tools
    • Search
    • Advertisement
    • Commerce
    • Cloud
    • Fintech
    • 1784
    • Contents
    • Data Center GAK
    • Community
    • 전체 서비스
    • Map
    NAVER 서비스
  • Tech

    Everyday Powered by Innovation
    Everyday Tech

    • Innovation
    • HyperCLOVA X
    • Spatial AI
    • Robotics
    • Immersive Media
    NAVER 기술
  • ESG

    Better Future
    connected by NAVER

    • NAVER Sustainability
    • Sustainable Management
    • Social
    • Tech for People
    • Environment
    • Principle
    • ESG Resources
    NAVER 지속가능성
  • IR

    IR

    • IR Updates
    • Corporate Governance
    • IR Calendar
    • Financial Information
    • Financial Information
    • IR Resources
    NAVER 투자정보
  • Media

    Media

    • News
    • Media Events
    • NAVER Reports
    NAVER 뉴스룸
  • Story

    NAVER Story

    All
  • Careers
통합검색 입력 폼
  • 한눈에 보는 네이버 전체 서비스 소개

    한눈에 보는 네이버
    전체 서비스 소개

  • 네이버 로고 아이덴티티 브랜드 리소스

    네이버 로고 아이덴티티
    브랜드 리소스

  • 5,400만+ 유저를 고객으로 네이버 광고 검색 상품

    5,400만+ 유저를 고객으로
    네이버 광고 검색 상품

  • NAVER Auunal Report ESG Library

    한눈에 보는 네이버
    전체 서비스 소개

  • NAVER Brand Resource Logo and color

    네이버 로고 아이덴티티
    브랜드 리소스

  • NAVER MAP Connecting online and offline

    5,400만+ 유저를 고객으로
    네이버 광고 검색 상품

logo
logo
  • Company
    • About
    • Affiliated Companies
    • Our History
    • Brand Resources
    • Partnerships
    • Contact
  • Service
    • Featured Services
    • Portal
    • Tools
    • Search
    • Advertisement
    • Commerce
    • Cloud
    • Fintech
    • 1784
    • Contents
    • Data Center GAK
    • Community
    • 전체 서비스
    • Map
  • Tech
    • Innovation
    • HyperCLOVA X
    • Spatial AI
    • Robotics
    • Immersive Media
  • ESG
    • NAVER Sustainability
    • Sustainable Management
    • Social
    • Tech for People
    • Environment
    • Principle
    • ESG Resources
  • IR
    • IR Updates
    • Corporate Governance
    • IR Calendar
    • Financial Information
    • Financial Information
    • IR Resources
  • Media
    • News
    • Media Events
    • NAVER Reports
  • Story
  • Careers
Tech

NAVER Unveils HyperCLOVA X–based Image and Speech Processing Technology, Advancing to “Multimodal Generative AI”

2024.08.22
공유하기

NAVER Unveils HyperCLOVA X–based Image and Speech Processing Technology, Advancing to “Multimodal Generative AI”

공유하기

NAVER Unveils HyperCLOVA X–based Image and Speech Processing Technology, Advancing to “Multimodal Generative AI”

- From inferring situations in photos to analyzing tables and graphs, it is also possible to solve math shape problems, expanding the scope of CLOVA X as a productivity enhancement tool

- HyperCLOVA X–based voice multimodal technology has also been introduced on the Tech Blog: featuring natural conversation powered by a large language model

August 22, 2024

NAVER’s conversational AI agent, CLOVA X, will add visual information processing capabilities through a service update on the 27th. In addition, NAVER unveiled generative AI–based speech synthesis technology through the Tech Blog of CLOVA’s official website on the 20th. NAVER is advancing its competitiveness in generative AI technology by upgrading its HyperCLOVA X model to a “multimodal” AI that can process not only text but also images and voice simultaneously.

From inferring situations in photos to analyzing tables and graphs, recognizing products, and explaining their contents—expanding the scope of CLOVA X as a productivity enhancement tool

CLOVA X’s image understanding feature has been updated, enabling users to interact with AI based on information extracted from images uploaded to the CLOVA X dialog and queries entered. CLOVA X is capable of performing various tasks, such as describing phenomena in photos or inferring situations. It can also understand and analyze tables and graphs in the form of images or pictures. It is expected to be used for logical writing, code writing, translation, and other tasks and will be further utilized as a productivity enhancement tool based on its image understanding ability.

In particular, NAVER’s excellent know-how in AI-based document processing and character recognition technology, combined with HyperCLOVA X, a large language model (LLM) knowledgeable in various fields, will provide more accurate and reliable services. After it received 1,480 questions from the Republic of Korea GED exam in the form of images, which it was made to solve, CLOVA X showed a correct answer rate of about 84%, higher than the 78% rate of the OpenAI GPT-4o.

HyperCLOVA X–based voice multimodal technology has also been introduced on the Tech Blog: featuring natural conversation powered by a large language model

In addition, NAVER unveiled its HyperCLOVA X–based voice AI technology through the Tech Blog on CLOVA’s official website on the 20th. More advanced than the existing speech recognition and speech synthesis technology, this model utilizes the superior contextual understanding and directive interpretation capabilities of a LLM to improve language structure and pronunciation accuracy, as well as emotional expression.

NAVER, which has proven its technological competitiveness with various voice AI services such as “CLOVA Note” for voice recording, “CLOVA CareCall” for AI call support for the elderly, and “CLOVA Dubbing” for AI voice synthesis, is looking to provide more convenient services through its voice multimodal LLM technology. On its Tech Blog, NAVER presented the possibility of combining various services with a voice multimodal LLM, such as real-time voice translation, language learning, and counseling.

“HyperCLOVA X, which started as a LLM, is evolving into a large vision language model with image understanding capabilities and, finally, a voice multimodal LLM,” said Sung Nako, Head of Hyperscale AI at NAVER CLOUD. “We will introduce HyperCLOVA X’s advanced capabilities to various NAVER services, including CLOVA X, a conversational AI agent, to create new user value and offer it as an enterprise AI solution, further expanding the HyperCLOVA X ecosystem.”

Meanwhile, NAVER will actively practice “AI safety” in the process of upgrading HyperCLOVA X to a multimodal LLM and applying it to its services. Building on its AI Safety Framework (ASF), which was unveiled in June and evaluates the potential risks of AI systems, NAVER plans to continue to review voice AI technology, in particular, to provide safer services.

​

​

HyperCLOVA XMultimodalNAVER
Download all images
Read More

Related content

  • NAVER 2026.03.31
    NAVER D2SF Makes Follow-On Investment in Soundable Health, an AI-Driven Health Tech Startup Scaling in the U.S. Market
    NAVERNAVER D2SF
  • NAVER 2026.03.31
    NAVER D2SF Makes Follow-On Investment in Nuvilab, AI Nutrition Analytics Startup
    NAVERNAVER D2SF
  • NAVER 2026.03.10
    NAVER D2SF Invests in Anyware Robotics, a Physical AI Startup Starting from Logistics Automation
    NAVERNAVER D2SF
  • NAVER 2026.03.10
    NAVER D2SF Invests in Physical AI Startup ‘Khameleon’
    NAVERNAVER D2SF
  • NAVER 2026.02.05
    NAVER D2SF Invests in Cashmere, a Data Infrastructure Platform Bridging Premium Content and AI
    NAVERNAVER D2SF
  • NAVER 2026.02.05
    NAVER D2SF Invests in AI Workflow Automation Platform ‘CNAPS.AI’
    NAVERNAVER D2SF
Previous slide
Next slide
We the Navigators
  • Partner Support
    • Naver Advertisement
    • Naver Smartstore
    • Naver Smartplace
    • Naver Business School
    • Naver Impact
    • SME Support
  • Developer Support
    • NAVER Developers
    • Open API
    • Opensource
    • NAVER D2
    • NAVER D2SF
  • Resource Center
    • IR Resources
    • ESG Resources
    • NAVER Reports
    • Brand Resources
  • Major Affiliates
    • NAVER CLOUD
    • SNOW
    • NAVER LABS
    • NAVER WEBTOON
    • NAVER FINANCIAL
  • blog link
  • naverTV link
  • instagram link
  • youtube link
  • ffinicial link
  • Contact
  • Partnerships
  • 고객센터
  • Integrity Line Integrity Line
  • 개인정보 처리방침
  • 이용약관
  • 운영정책
  • Contact
  • Partnerships
  • Integrity Line

©NAVER CORP.