Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

live_voice_call多说几句就会卡住,无法接受语音 #21

Open
changwu opened this issue Feb 10, 2025 · 4 comments
Open

live_voice_call多说几句就会卡住,无法接受语音 #21

changwu opened this issue Feb 10, 2025 · 4 comments

Comments

@changwu
Copy link

changwu commented Feb 10, 2025

如题,在log中,并没有看到我发送语音和识别的消息。附件是录屏,才说两句就无法输入了。

我代码部署在ubuntu 22上,客户端访问是win11+谷歌chrome。

请问这是什么问题呢?

@changwu
Copy link
Author

changwu commented Feb 10, 2025

Image

@luminghao-bytedance
Copy link
Collaborator

@changwu 可以提供一些问题复现时,后端输出的日志信息吗

@yakel
Copy link

yakel commented Feb 12, 2025

遇到了同样的问题,看起来是 ASR 识别后返回 text 是空的。
贴下我的日志,后面是有在说话的:

2025-02-12 09:36:32,916 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:32,916 - WARNING - asr audio data, len=3212
2025-02-12 09:36:32,917 - INFO - Sent Data INFO ASR SEVER data len=1404
2025-02-12 09:36:32,979 - INFO - Received asr server response: sequence=16 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1505)
response: sequence=16 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1505)
2025-02-12 09:36:32,980 - INFO - asr buffer incremented: 2, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,021 - INFO - Received asr server response: sequence=17 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1606)
response: sequence=17 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1606)
2025-02-12 09:36:33,021 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,063 - INFO - Received asr server response: sequence=18 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1706)
response: sequence=18 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1706)
2025-02-12 09:36:33,064 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,089 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,089 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,089 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,090 - INFO - Sent Data INFO ASR SEVER data len=1412
2025-02-12 09:36:33,176 - INFO - Received asr server response: sequence=19 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1806)
response: sequence=19 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1806)
2025-02-12 09:36:33,177 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,178 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,178 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,178 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,179 - INFO - Sent Data INFO ASR SEVER data len=1375
2025-02-12 09:36:33,267 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,267 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,267 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,268 - INFO - Sent Data INFO ASR SEVER data len=1322
2025-02-12 09:36:33,357 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,357 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,357 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,358 - INFO - Sent Data INFO ASR SEVER data len=1268
2025-02-12 09:36:33,372 - INFO - Received asr server response: sequence=20 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1907)
response: sequence=20 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=1907)
2025-02-12 09:36:33,372 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,421 - INFO - Received asr server response: sequence=21 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2007)
response: sequence=21 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2007)
2025-02-12 09:36:33,422 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,446 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,447 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,447 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,447 - INFO - Sent Data INFO ASR SEVER data len=1266
2025-02-12 09:36:33,463 - INFO - Received asr server response: sequence=22 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2107)
response: sequence=22 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2107)
2025-02-12 09:36:33,464 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,537 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,537 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,537 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,538 - INFO - Sent Data INFO ASR SEVER data len=1285
2025-02-12 09:36:33,627 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,627 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,627 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,628 - INFO - Sent Data INFO ASR SEVER data len=1264
2025-02-12 09:36:33,649 - INFO - Received asr server response: sequence=23 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2208)
response: sequence=23 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2208)
2025-02-12 09:36:33,650 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,700 - INFO - Received asr server response: sequence=24 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2308)
response: sequence=24 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2308)
2025-02-12 09:36:33,700 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,717 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,717 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,717 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,718 - INFO - Sent Data INFO ASR SEVER data len=1274
2025-02-12 09:36:33,743 - INFO - Received asr server response: sequence=25 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2409)
response: sequence=25 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2409)
2025-02-12 09:36:33,743 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,807 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,807 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:33,808 - WARNING - asr audio data, len=3212
2025-02-12 09:36:33,809 - INFO - Sent Data INFO ASR SEVER data len=1293
2025-02-12 09:36:33,919 - INFO - Received asr server response: sequence=26 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2509)
response: sequence=26 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2509)
2025-02-12 09:36:33,920 - INFO - asr buffer incremented: 0, utterances: [Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]
2025-02-12 09:36:33,986 - INFO - Received asr server response: sequence=27 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2609)
response: sequence=27 last_package=False result=ASRResult(text='肉馅', utterances=[Utterance(definite=False, end_time=1140, start_time=840, text='肉馅', words=[Word(blank_duration=None, end_time=920, start_time=840, text='肉'), Word(blank_duration=None, end_time=1140, start_time=1060, text='馅')])]) audio=ASRAudioInfoRsp(duration=2609)
2025-02-12 09:36:33,987 - WARNING - ASR yield the buffer: 肉馅
2025-02-12 09:36:33,987 - INFO - Sending output event= SentenceRecognized, data len:0 , payload: sentence='肉馅'
2025-02-12 09:36:33,988 - INFO - need recreate tts client
2025-02-12 09:36:33,988 - INFO - with logID: 957e9682-8c5f-453a-af42-e4e6c330a467 , header: {'X-Tt-Logid': '957e9682-8c5f-453a-af42-e4e6c330a467', 'X-Api-Resource-Id': 'volc.service_type.10029', 'X-Api-Access-Key': 'EZMQzQ9_cr-9lfb_IS-T6B_KZ5-ODGVR', 'X-Api-App-Key': '4480852288', 'X-Api-Connect-Id': '84487dc7-234d-4614-915f-b69ace484ece'}
2025-02-12 09:36:33,989 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,989 - WARNING - service is InProgress, will ignore the incoming input
2025-02-12 09:36:33,996 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:33,996 - WARNING - service is InProgress, will ignore the incoming input
2025-02-12 09:36:34,083 - INFO - Dial server with LogID: 957e9682-8c5f-453a-af42-e4e6c330a467
2025-02-12 09:36:34,868 - INFO - HTTP Request: POST https://ark.cn-beijing.volces.com/api/v3/chat/completions "HTTP/1.1 200 OK"
2025-02-12 09:36:34,870 - WARNING - llm time cost: 0.6126649379730225
2025-02-12 09:36:35,389 - INFO - receive tts response: event=350 transcript=肉馅经典又好吃! audio len=0
2025-02-12 09:36:35,389 - WARNING - TTS start, the transcript: 肉馅经典又好吃!
2025-02-12 09:36:35,389 - INFO - Sending output event= TTSSentenceStart, data len:0 , payload: sentence='肉馅经典又好吃!'
2025-02-12 09:36:35,606 - INFO - receive tts response: event=352 transcript=None audio len=1389
2025-02-12 09:36:35,606 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,660 - INFO - receive tts response: event=352 transcript=None audio len=1728
2025-02-12 09:36:35,660 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,710 - INFO - receive tts response: event=352 transcript=None audio len=2112
2025-02-12 09:36:35,710 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,759 - INFO - receive tts response: event=352 transcript=None audio len=1920
2025-02-12 09:36:35,759 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,799 - INFO - receive tts response: event=352 transcript=None audio len=1920
2025-02-12 09:36:35,799 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,849 - INFO - receive tts response: event=352 transcript=None audio len=2112
2025-02-12 09:36:35,849 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,906 - INFO - receive tts response: event=352 transcript=None audio len=1920
2025-02-12 09:36:35,906 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,950 - INFO - receive tts response: event=352 transcript=None audio len=1536
2025-02-12 09:36:35,950 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,950 - INFO - receive tts response: event=352 transcript=None audio len=192
2025-02-12 09:36:35,950 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,951 - INFO - receive tts response: event=352 transcript=None audio len=2304
2025-02-12 09:36:35,951 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,952 - INFO - receive tts response: event=352 transcript=None audio len=1152
2025-02-12 09:36:35,952 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,952 - INFO - receive tts response: event=351 transcript=None audio len=0
2025-02-12 09:36:35,952 - WARNING - TTS time cost: 0.5627191066741943
2025-02-12 09:36:35,952 - INFO - Sending output event= TTSSentenceEnd, data len:18285 , payload: None
2025-02-12 09:36:35,952 - INFO - receive tts response: event=350 transcript=最爱啥肉馅饺子呀? audio len=0
2025-02-12 09:36:35,952 - WARNING - TTS start, the transcript: 最爱啥肉馅饺子呀?
2025-02-12 09:36:35,953 - INFO - Sending output event= TTSSentenceStart, data len:0 , payload: sentence='最爱啥肉馅饺子呀?'
2025-02-12 09:36:35,953 - INFO - receive tts response: event=352 transcript=None audio len=1389
2025-02-12 09:36:35,953 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,954 - INFO - receive tts response: event=352 transcript=None audio len=1728
2025-02-12 09:36:35,954 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,954 - INFO - receive tts response: event=352 transcript=None audio len=192
2025-02-12 09:36:35,955 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,955 - INFO - receive tts response: event=352 transcript=None audio len=1920
2025-02-12 09:36:35,955 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,969 - INFO - receive tts response: event=352 transcript=None audio len=2112
2025-02-12 09:36:35,970 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,970 - INFO - receive tts response: event=352 transcript=None audio len=1920
2025-02-12 09:36:35,970 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,970 - INFO - receive tts response: event=352 transcript=None audio len=1920
2025-02-12 09:36:35,970 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:35,986 - INFO - receive tts response: event=352 transcript=None audio len=2112
2025-02-12 09:36:35,986 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:36,018 - INFO - receive tts response: event=352 transcript=None audio len=1728
2025-02-12 09:36:36,019 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:36,019 - INFO - receive tts response: event=352 transcript=None audio len=192
2025-02-12 09:36:36,019 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:36,019 - INFO - receive tts response: event=352 transcript=None audio len=2496
2025-02-12 09:36:36,019 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:36,020 - INFO - receive tts response: event=352 transcript=None audio len=1152
2025-02-12 09:36:36,020 - INFO - receive tts response: event=352 transcript=None audio len=0
2025-02-12 09:36:36,020 - INFO - receive tts response: event=351 transcript=None audio len=0
2025-02-12 09:36:36,020 - WARNING - TTS time cost: 0.06763005256652832
2025-02-12 09:36:36,020 - INFO - Sending output event= TTSSentenceEnd, data len:18861 , payload: None
2025-02-12 09:36:36,022 - INFO - receive tts response: event=350 transcript= audio len=0
2025-02-12 09:36:36,022 - WARNING - TTS start, the transcript:
2025-02-12 09:36:36,022 - INFO - Sending output event= TTSSentenceStart, data len:0 , payload: sentence=' '
2025-02-12 09:36:36,022 - INFO - receive tts response: event=351 transcript=None audio len=0
2025-02-12 09:36:36,022 - WARNING - TTS time cost: 0.00040984153747558594
2025-02-12 09:36:36,022 - INFO - Sending output event= TTSSentenceEnd, data len:0 , payload: None
2025-02-12 09:36:36,035 - INFO - receive tts response: event=152 transcript=None audio len=0
2025-02-12 09:36:36,040 - INFO - Sending output event= TTSDone, data len:0 , payload: None
2025-02-12 09:36:36,110 - WARNING - ASR client closed
2025-02-12 09:36:36,110 - INFO - ASR client is disconnected, skip the audio input.
2025-02-12 09:36:37,111 - INFO - ASR client is disconnected, skip the audio input.
2025-02-12 09:36:38,113 - INFO - ASR client is disconnected, skip the audio input.
2025-02-12 09:36:39,116 - INFO - ASR client is disconnected, skip the audio input.
2025-02-12 09:36:40,117 - INFO - ASR client is disconnected, skip the audio input.
2025-02-12 09:36:41,104 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,105 - WARNING - need recreate asr conn
2025-02-12 09:36:41,118 - INFO - ASR client is disconnected, skip the audio input.
2025-02-12 09:36:41,227 - INFO - Connected to wss://openspeech.bytedance.com/api/v3/sauc/bigmodel, log_id: e96035eb-700a-49d5-8dc2-d939743a303d
2025-02-12 09:36:41,285 - INFO - Inited asr client: sequence=None last_package=False result=None audio=None
2025-02-12 09:36:41,286 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,286 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,287 - INFO - Sent Data INFO ASR SEVER data len=1528
2025-02-12 09:36:41,287 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,287 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,287 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,287 - INFO - Sent Data INFO ASR SEVER data len=1429
2025-02-12 09:36:41,288 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,288 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,288 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,288 - INFO - Sent Data INFO ASR SEVER data len=1386
2025-02-12 09:36:41,366 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,367 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,367 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,368 - INFO - Sent Data INFO ASR SEVER data len=2484
2025-02-12 09:36:41,456 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,457 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,457 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,457 - INFO - Sent Data INFO ASR SEVER data len=2882
2025-02-12 09:36:41,547 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,547 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,548 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,548 - INFO - Sent Data INFO ASR SEVER data len=2869
2025-02-12 09:36:41,636 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,637 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,637 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,637 - INFO - Sent Data INFO ASR SEVER data len=2561
2025-02-12 09:36:41,726 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,726 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,726 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,727 - INFO - Sent Data INFO ASR SEVER data len=2908
2025-02-12 09:36:41,816 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,816 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,816 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,816 - INFO - Sent Data INFO ASR SEVER data len=3016
2025-02-12 09:36:41,996 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:41,996 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:41,996 - WARNING - asr audio data, len=3212
2025-02-12 09:36:41,996 - INFO - Sent Data INFO ASR SEVER data len=3001
2025-02-12 09:36:42,086 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,086 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,086 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,087 - INFO - Sent Data INFO ASR SEVER data len=2943
2025-02-12 09:36:42,119 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,119 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,120 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,120 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,121 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,121 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,121 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,121 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,121 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,121 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,133 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,176 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,176 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,176 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,177 - INFO - Sent Data INFO ASR SEVER data len=2969
2025-02-12 09:36:42,224 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,266 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,266 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,266 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,267 - INFO - Sent Data INFO ASR SEVER data len=2880
2025-02-12 09:36:42,313 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,356 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,356 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,357 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,357 - INFO - Sent Data INFO ASR SEVER data len=2693
2025-02-12 09:36:42,402 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,446 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,447 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,447 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,447 - INFO - Sent Data INFO ASR SEVER data len=1940
2025-02-12 09:36:42,493 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,537 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,537 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,537 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,538 - INFO - Sent Data INFO ASR SEVER data len=1490
2025-02-12 09:36:42,584 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,627 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,627 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,627 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,628 - INFO - Sent Data INFO ASR SEVER data len=1386
2025-02-12 09:36:42,671 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,717 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,717 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,717 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,718 - INFO - Sent Data INFO ASR SEVER data len=1362
2025-02-12 09:36:42,761 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,898 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,899 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,899 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,900 - INFO - Sent Data INFO ASR SEVER data len=1387
2025-02-12 09:36:42,943 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:42,987 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:42,987 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:42,987 - WARNING - asr audio data, len=3212
2025-02-12 09:36:42,988 - INFO - Sent Data INFO ASR SEVER data len=1338
2025-02-12 09:36:43,035 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,077 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,077 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,077 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,078 - INFO - Sent Data INFO ASR SEVER data len=1457
2025-02-12 09:36:43,120 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,167 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,167 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,167 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,168 - INFO - Sent Data INFO ASR SEVER data len=2029
2025-02-12 09:36:43,211 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,257 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,257 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,257 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,258 - INFO - Sent Data INFO ASR SEVER data len=2675
2025-02-12 09:36:43,303 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,347 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,347 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,347 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,348 - INFO - Sent Data INFO ASR SEVER data len=2890
2025-02-12 09:36:43,391 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,436 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,437 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,437 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,438 - INFO - Sent Data INFO ASR SEVER data len=2503
2025-02-12 09:36:43,484 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,527 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,527 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,527 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,528 - INFO - Sent Data INFO ASR SEVER data len=2886
2025-02-12 09:36:43,574 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,617 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,617 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,617 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,618 - INFO - Sent Data INFO ASR SEVER data len=2889
2025-02-12 09:36:43,662 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,798 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,798 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,798 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,799 - INFO - Sent Data INFO ASR SEVER data len=2959
2025-02-12 09:36:43,846 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,888 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,888 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,888 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,889 - INFO - Sent Data INFO ASR SEVER data len=3004
2025-02-12 09:36:43,933 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:43,976 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:43,977 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:43,977 - WARNING - asr audio data, len=3212
2025-02-12 09:36:43,977 - INFO - Sent Data INFO ASR SEVER data len=2516
2025-02-12 09:36:44,022 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,067 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,067 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,067 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,067 - INFO - Sent Data INFO ASR SEVER data len=1894
2025-02-12 09:36:44,109 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,157 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,157 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,158 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,158 - INFO - Sent Data INFO ASR SEVER data len=1388
2025-02-12 09:36:44,201 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,248 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,249 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,249 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,250 - INFO - Sent Data INFO ASR SEVER data len=1393
2025-02-12 09:36:44,301 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,336 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,337 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,337 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,337 - INFO - Sent Data INFO ASR SEVER data len=1382
2025-02-12 09:36:44,382 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,426 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,427 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,427 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,427 - INFO - Sent Data INFO ASR SEVER data len=1401
2025-02-12 09:36:44,469 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,517 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,517 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,517 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,518 - INFO - Sent Data INFO ASR SEVER data len=1431
2025-02-12 09:36:44,559 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,698 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,699 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,699 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,700 - INFO - Sent Data INFO ASR SEVER data len=1386
2025-02-12 09:36:44,741 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,787 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,787 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,787 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,788 - INFO - Sent Data INFO ASR SEVER data len=1426
2025-02-12 09:36:44,830 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,877 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,877 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,877 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,879 - INFO - Sent Data INFO ASR SEVER data len=1408
2025-02-12 09:36:44,919 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:44,967 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:44,967 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:44,967 - WARNING - asr audio data, len=3212
2025-02-12 09:36:44,968 - INFO - Sent Data INFO ASR SEVER data len=1392
2025-02-12 09:36:45,021 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,057 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,057 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,057 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,058 - INFO - Sent Data INFO ASR SEVER data len=1394
2025-02-12 09:36:45,102 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,147 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,147 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,147 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,148 - INFO - Sent Data INFO ASR SEVER data len=1504
2025-02-12 09:36:45,191 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,238 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,238 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,238 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,239 - INFO - Sent Data INFO ASR SEVER data len=1566
2025-02-12 09:36:45,284 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,327 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,327 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,327 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,328 - INFO - Sent Data INFO ASR SEVER data len=1548
2025-02-12 09:36:45,372 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,417 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,417 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,417 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,418 - INFO - Sent Data INFO ASR SEVER data len=1441
2025-02-12 09:36:45,463 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,598 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,599 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,599 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,600 - INFO - Sent Data INFO ASR SEVER data len=1406
2025-02-12 09:36:45,645 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,687 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,687 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,687 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,688 - INFO - Sent Data INFO ASR SEVER data len=2221
2025-02-12 09:36:45,732 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,777 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,777 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,777 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,777 - INFO - Sent Data INFO ASR SEVER data len=3043
2025-02-12 09:36:45,827 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,867 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,867 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,867 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,868 - INFO - Sent Data INFO ASR SEVER data len=2781
2025-02-12 09:36:45,914 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:45,957 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:45,957 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:45,957 - WARNING - asr audio data, len=3212
2025-02-12 09:36:45,958 - INFO - Sent Data INFO ASR SEVER data len=3029
2025-02-12 09:36:46,004 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:46,047 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:46,047 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:46,048 - WARNING - asr audio data, len=3212
2025-02-12 09:36:46,049 - INFO - Sent Data INFO ASR SEVER data len=3017
2025-02-12 09:36:46,094 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:46,137 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:46,137 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:46,137 - WARNING - asr audio data, len=3212
2025-02-12 09:36:46,138 - INFO - Sent Data INFO ASR SEVER data len=2285
2025-02-12 09:36:46,188 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:46,227 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:46,227 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:46,227 - WARNING - asr audio data, len=3212
2025-02-12 09:36:46,228 - INFO - Sent Data INFO ASR SEVER data len=1566
2025-02-12 09:36:46,272 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:46,319 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:46,319 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:46,319 - WARNING - asr audio data, len=3212
2025-02-12 09:36:46,320 - INFO - Sent Data INFO ASR SEVER data len=1442
2025-02-12 09:36:46,362 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:46,498 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:46,498 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:46,498 - WARNING - asr audio data, len=3212
2025-02-12 09:36:46,499 - INFO - Sent Data INFO ASR SEVER data len=1555
2025-02-12 09:36:46,541 - INFO - Received asr server response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
response: sequence=None last_package=False result=ASRResult(text='', utterances=[]) audio=ASRAudioInfoRsp(duration=None)
2025-02-12 09:36:46,587 - INFO - Received input event: UserAudio, payload: UserAudio, data len:3212
2025-02-12 09:36:46,587 - WARNING - receive input, event=UserAudio payload=None data_len=3212
2025-02-12 09:36:46,587 - WARNING - asr audio data, len=3212
2025-02-12 09:36:46,588 - INFO - Sent Data INFO ASR SEVER data len=1448

@Zpenya
Copy link

Zpenya commented Feb 12, 2025

遇到了同样的问题,看起来是 ASR 识别后返回 text 是空的。

我也有同样的问题,基本说两三句就不返回asr结果了,但是一直在收音。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants