北京邮电大学学报(社会科学版) ›› 2022, Vol. 24 ›› Issue (6): 7-20.doi: 10.19722/j.cnki.1008-7729.2022.0109

• 网络文化 • 上一篇    下一篇

基于文本挖掘的优质带货直播的内容特征研究

马林烨(1991—),女,河北衡水人,博士在读   

  1. 1.北京邮电大学 经济管理学院,北京100876;2.北京邮电大学 现代邮政学院,北京100876;
    3.中国移动政企客户分公司,北京100032
  • 收稿日期:2022-09-16 出版日期:2022-12-30 发布日期:2023-01-10
  • 作者简介:马林烨(1991—),女,河北衡水人,博士在读
  • 基金资助:
    教育部首批新文科研究与改革实践项目(2021090003)

Content Characteristics of High-quality Live Streaming Based on Text Mining

  1. 1.School of Economics and Telecommunications, Beijing University of Posts and Telecommunications, 
    Beijing 100876, China;2.School of Modern Post, Beijing University of Posts and Telecommunications,
     Beijing 100876, China;3.Government and Enterprise Customer Branch of China Mobile, 
    Beijing 100032, China
  • Received:2022-09-16 Online:2022-12-30 Published:2023-01-10

摘要: 为了探究优质带货直播的内容特征,对直播视频生成的字幕文本进行文本挖掘。首先,利用LDA模型提取主题,将优质带货直播的内容特征分为交互性、个性化、可视性、专业性、可靠性和娱乐性,并重新定义。然后,选定合适的机器学习算法预测主题概率。最后,利用统计分析给出优质带货直播内容特征的一般分布规律。研究发现:直播购物内容具有实用、娱乐和社交价值;体验品的介绍时长高于搜索品,体验品在专业性、娱乐性上高于搜索品,可视化上低于搜索品并进一步发现服装和化妆品确实有类似特征。另外,相较于成熟的直播间,新直播间的商品介绍时长缩短,响应性提高。结合前人研究成果得出合理解释,有助于指导新主播入门和优化平台直播流程。

关键词: 直播购物, 文本挖掘, 内容特征, LDA模型

Abstract: To explore the content characteristics of high-quality live streaming, text mining is conducted on the subtitle text. Firstly, the LDA model was used to extract themes, and the content of high-quality live streaming was divided into responsiveness, personalization, visualization, professionalism, reliability and entertainment, and the new definitions of them were given. Then, a suitable machine learning algorithm was used to predict the topic probability. Finally, the probability distribution of the content characteristics of high-quality live streaming was given by statistical analysis. The research showed that live streaming content provided utilitarian, hedonic and social value. The introduction duration of experience products was higher than that of search products, and experience products are higher than search products in terms of professionalism and entertainment, and lower in terms of visualization. In addition, the product introduction duration of the new live streaming studios was relatively shorter than that of mature ones and the new live streaming studios paid more attention to responsiveness. Some reasonable explanations were given for these findings based on previous research results. This study would help guide the work of new anchors and optimize the process for live streaming platforms.

Key words: live streaming shopping, text mining, content characteristic, LDA model

中图分类号: