Spring Boot與百度AI語音識別API集成實踐

作者：編程疏影 2024-05-31 08:12:19

通過這篇文章，我們詳細介紹了如何在Spring Boot 3.x項目中集成百度AI語音識別API。我們探討了API的特點、配置方法、創建REST API以實現語音識別功能、以及優化和調試的最佳實踐。

本專題系統講解了如何利用SpringBoot集成音頻識別技術，涵蓋了從基礎配置到復雜應用的方方面面。通過本文，讀者可以了解到在智能語音填單、智能語音交互、智能語音檢索等場景中，音頻識別技術如何有效提升人機交互效率。無論是本地存儲檢索，還是云服務的集成，豐富的應用實例為開發者提供了全面的解決方案。繼續深入研究和實踐這些技術，將有助于推動智能應用的廣泛普及和發展，提升各類業務的智能化水平。

Spring Boot與百度AI語音識別API集成實踐

百度AI語音識別API是目前國內領先的語音識別服務之一，具備以下幾個顯著特點：

高準確率：依托百度大規模的語音庫和深度學習技術，能夠提供高準確率的語音識別結果。
多場景應用：支持遠場、近場、多語種等多種場景的語音識別應用，覆蓋電話客服、語音助手、智能音箱等多種應用場景。
靈活接入：提供HTTP接口，方便開發者在各種語言和框架中集成。
實時性：支持實時語音識別，對于需要實時反饋的應用場景非常適用。

配置并對接百度AI語音識別API

要使用百度AI語音識別API，首先需要在百度AI開放平臺上注冊賬號并創建應用，獲取API Key和Secret Key。

獲取API Key和Secret Key：

登錄百度AI開放平臺，創建一個語音識別應用，記錄下分配的API Key和Secret Key。

Spring Boot項目配置：

在項目的application.properties文件中添加以下配置：

baidu.ai.appId=your_app_id
   baidu.ai.apiKey=your_api_key
   baidu.ai.secretKey=your_secret_key

配置百度AI客戶端：

需要引入百度AI的SDK，創建一個配置類來初始化客戶端。

引入依賴：
在pom.xml文件中添加百度語音識別SDK依賴。

<dependency>
       <groupId>com.baidu.aip</groupId>
       <artifactId>java-sdk</artifactId>
       <version>4.15.1</version>
   </dependency>

創建配置類：

import com.baidu.aip.speech.AipSpeech;
   import org.springframework.beans.factory.annotation.Value;
   import org.springframework.context.annotation.Bean;
   import org.springframework.context.annotation.Configuration;

   @Configuration
   public class BaiduAIConfig {

       @Value("${baidu.ai.appId}")
       private String appId;

       @Value("${baidu.ai.apiKey}")
       private String apiKey;

       @Value("${baidu.ai.secretKey}")
       private String secretKey;

       @Bean
       public AipSpeech aipSpeech() {
           AipSpeech client = new AipSpeech(appId, apiKey, secretKey);
           // 可選：設置連接超時參數
           client.setConnectionTimeoutInMillis(2000);
           client.setSocketTimeoutInMillis(60000);
           return client;
       }
   }

創建語音識別和轉換功能的REST API

接下來，我們將創建一個REST API，用于接收語音并通過百度AI語音識別API進行轉換。

創建Spring Boot主類：

import org.springframework.boot.SpringApplication;
   import org.springframework.boot.autoconfigure.SpringBootApplication;

   @SpringBootApplication
   public class SpeechRecognitionApplication {
       public static void main(String[] args) {
           SpringApplication.run(SpeechRecognitionApplication.class, args);
       }
   }

實現語音識別的REST API：

import com.baidu.aip.speech.AipSpeech;
   import com.baidu.aip.speech.AipSpeechResponse;
   import org.springframework.beans.factory.annotation.Autowired;
   import org.springframework.http.ResponseEntity;
   import org.springframework.http.MediaType;
   import org.springframework.web.bind.annotation.*;
   import org.springframework.web.multipart.MultipartFile;

   import java.io.IOException;
   import org.json.JSONObject;

   @RestController
   @RequestMapping("/api/speech")
   public class SpeechRecognitionController {

       @Autowired
       private AipSpeech aipSpeech;

       @PostMapping(value = "/recognize", consumes = MediaType.MULTIPART_FORM_DATA_VALUE)
       public ResponseEntity<String> recognizeSpeech(@RequestParam("audio") MultipartFile audioFile) throws IOException {
           // 將音頻文件轉為字節數組
           byte[] audioData = audioFile.getBytes();
           
           // 執行語音識別
           JSONObject response = aipSpeech.asr(audioData, "pcm", 16000, null);
           
           // 檢查返回結果中的錯誤碼
           if (response.getInt("err_no") != 0) {
               return ResponseEntity.status(500).body(response.toString());
           }

           // 返回識別結果
           return ResponseEntity.ok(response.toString(4));
       }
   }

注意：

audioFile.getBytes()方法將上傳的音頻文件轉換為字節數組。
aipSpeech.asr方法接受音頻數據、音頻格式（如pcm）、采樣率（如16000）以及其他可選參數。
response對象中包含了識別結果，如果err_no不為0，則表示識別出錯。

測試 REST API：

可以使用工具（如Postman）或者編寫測試類來測試上述API。

import org.junit.jupiter.api.Test;
   import org.springframework.beans.factory.annotation.Autowired;
   import org.springframework.boot.test.context.SpringBootTest;
   import org.springframework.mock.web.MockMultipartFile;
   import org.springframework.test.web.servlet.MockMvc;
   import org.springframework.test.web.servlet.request.MockMvcRequestBuilders;

   import static org.springframework.test.web.servlet.result.MockMvcResultMatchers.status;

   @SpringBootTest
   class SpeechRecognitionApplicationTests {

       @Autowired
       private MockMvc mockMvc;

       @Test
       void testRecognizeSpeech() throws Exception {
           MockMultipartFile file = new MockMultipartFile(
                   "audio", "test.pcm", "audio/wav", new byte[]{/* test audio data */});

           mockMvc.perform(MockMvcRequestBuilders.multipart("/api/speech/recognize")
                           .file(file))
                   .andExpect(status().isOk());
       }
   }

項目中的優化與調試方法

優化連接和請求：

適當設置連接超時和讀取超時，以提高請求的響應速度和穩定性。

錯誤處理：

增加錯誤處理邏輯，例如網絡錯誤、API調用錯誤，并提供詳細的錯誤信息。

@PostMapping(value = "/recognize", consumes = MediaType.MULTIPART_FORM_DATA_VALUE)
   public ResponseEntity<String> recognizeSpeech(@RequestParam("audio") MultipartFile audioFile) {
       try {
           byte[] audioData = audioFile.getBytes();
           JSONObject response = aipSpeech.asr(audioData, "pcm", 16000, null);

           if (response.getInt("err_no") != 0) {
               return ResponseEntity.status(500).body("Error: " + response.optString("err_msg"));
           }

           return ResponseEntity.ok(response.toString(4));
       } catch (IOException e) {
           return ResponseEntity.status(500).body("File read error: " + e.getMessage());
       } catch (Exception e) {
           return ResponseEntity.status(500).body("Unexpected error: " + e.getMessage());
       }
   }

日志記錄

使用日志記錄API調用過程中的關鍵事件和錯誤信息，方便調試和定位問題。

import org.slf4j.Logger;
import org.slf4j.LoggerFactory;
import org.springframework.beans.factory.annotation.Autowired;
import org.springframework.http.MediaType;
import org.springframework.http.ResponseEntity;
import org.springframework.web.bind.annotation.PostMapping;
import org.springframework.web.bind.annotation.RequestMapping;
import org.springframework.web.bind.annotation.RequestParam;
import org.springframework.web.bind.annotation.RestController;
import org.springframework.web.multipart.MultipartFile;
import org.json.JSONObject;
import com.baidu.aip.speech.AipSpeech;

import java.io.IOException;

@RestController
@RequestMapping("/api/speech")
public class SpeechRecognitionController {

    private static final Logger logger = LoggerFactory.getLogger(SpeechRecognitionController.class);

    @Autowired
    private AipSpeech aipSpeech;

    @PostMapping(value = "/recognize", consumes = MediaType.MULTIPART_FORM_DATA_VALUE)
    public ResponseEntity<String> recognizeSpeech(@RequestParam("audio") MultipartFile audioFile) {
        try {
            byte[] audioData = audioFile.getBytes();
            logger.info("接收到的音頻文件大小: {}", audioData.length);

            JSONObject response = aipSpeech.asr(audioData, "pcm", 16000, null);

            if (response.getInt("err_no") != 0) {
                logger.error("語音識別錯誤: {}", response.optString("err_msg"));
                return ResponseEntity.status(500).body("錯誤: " + response.optString("err_msg"));
            }

            logger.info("識別結果: {}", response.toString(4));
            return ResponseEntity.ok(response.toString(4));
        } catch (IOException e) {
            logger.error("文件讀取錯誤", e);
            return ResponseEntity.status(500).body("文件讀取錯誤: " + e.getMessage());
        } catch (Exception e) {
            logger.error("意外錯誤", e);
            return ResponseEntity.status(500).body("意外錯誤: " + e.getMessage());
        }
    }
}

總結

通過這篇文章，我們詳細介紹了如何在Spring Boot 3.x項目中集成百度AI語音識別API。我們探討了API的特點、配置方法、創建REST API以實現語音識別功能、以及優化和調試的最佳實踐。希望這篇文章能夠幫助你在實際項目中實現高效的語音識別功能。

責任編輯：武曉燕來源：路條編程

百度 AI 語音

成人免费xxxxx在线视频软件_久久精品久久久_亚洲国产精品久久久_天天色天天色_亚洲人成一区_欧美一级欧美三级在线观看

Spring Boot與百度AI語音識別API集成實踐