基于HBase与Netty的煤矿微震时序大数据存储优化
DOI:
CSTR:
作者:
作者单位:

1.辽宁大学 信息学院, 辽宁 沈阳 110036 ; 2.辽宁煤电产业控股有限公司红阳三矿, 辽宁 辽阳 110101

作者简介:

丁琳琳(1983—),女,教授,从事大数据管理及图数据管理等方面研究。

通讯作者:

中图分类号:

TP392

基金项目:

国家自然科学基金项目(62072220);国家重点研发计划项目(2022YFC3004603);辽宁省自然科学基金计划项目(2022-KF-13-06)


Coal Mine Microseismic Timing Big Data Storage Optimization Based on HBase and Netty
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在当前智能煤矿场景中,大量煤矿微震传感器所产生的时序数据呈爆炸式增长,进而对现有的存储系统及性能都有了更高的要求。目前已经存在基于分布式列族数据库HBase能够存储工业时序大数据的实例,但是由于没有考虑到特定业务场景中数据的特征关联问题,现有的策略仍然无法较好地满足煤矿微震波形时序数据的特定存储需求。针对上述问题,基于分布式存储系统HBase,利用煤矿微震波形时序数据的特征,提出了基于HBase与Netty的煤矿微震时序大数据存储性能优化(CM2TS-HBase),分成四个部分,分别为数据采集层、数据预处理层、数据中转层以及数据存储层。其中,数据采集层分为离线部分与实时部分,离线部分即存储在数据中心硬盘中的历史微震时序数据文件,实时部分即部署在某煤矿的多个微震波形传感器通过网络实时地向数据预处理层进行数据缓冲;数据预处理层实现对波形时序数据的文件进行对齐、解析以及序列化操作。根据煤矿微震波形时序数据特征提出了适用于微震波形时序数据的HBase数据表结构、预分区策略以及主键优化策略,有效地解决了数据存储过程中出现的数据热点问题以及数据分散问题;数据中转层提出了基于Netty与Redis的数据转发中间件平台为整个存储体系提供异步处理机制,较好地解决了高并发存储问题;数据存储层是基于分布式数据库HBase作为存储体系的底层存储媒介。最终根据真实数据集的存储耗时证明了相较于原生存储方法(HBase API)与基于金融时序数据存储优化(FTBase),CM2TS-HBase在煤矿微震时序数据的存储性能有了明显提高。

    Abstract:

    In the current intelligent coal mine scenario, the time-series data generated by a large number of coal mine microseismic sensors is growing explosively, which in turn has higher requirements for the existing storage system and performance. There are already instances where HBase, a distributed column family database, can store industrial time-series big data, the existing strategies are still unable to better meet the needs of coal mine microseismic waveform time-series data because they do not consider the feature association of data in specific business scenarios. specific storage requirements. In view of the above problems, based on the distributed storage system HBase, using the characteristics of coal mine microseismic waveform time series data, a coal mine microseismic time series big data storage performance optimization (CM2TS-HBase) based on HBase and Netty is proposed, which is divided into four parts, namely data acquisition layer, data preprocessing layer, data transfer layer, and data storage layer. Among them, the data acquisition layer is divided into offline part and real-time part. The offline part is the historical microseismic time series data files stored in the hard disk of the data center, and the real-time part is the multiple microseismic waveform sensors deployed in a coal mine. The layer performs data buffering; the data preprocessing layer implements alignment, parsing, and serialization operations on files of waveform time series data. According to the characteristics of coal mine microseismic waveform time series data, the HBase data table structure, pre-partition strategy and primary key optimization strategy suitable for microseismic waveform time series data are proposed, which effectively solves the data hot spot problem and data dispersion problem in the data storage process; data transfer layer A data forwarding middleware platform based on Netty and Redis is proposed to provide an asynchronous processing mechanism for the entire storage system, which better solves the problem of high concurrent storage; the data storage layer is based on the distributed database HBase as the underlying storage medium of the storage system. Finally, according to the time-consuming storage of real data sets, it is proved that compared with the native storage method (HBase API) and financial time-series data storage optimization (FT -HBase), CM2TS -HBase has significantly improved the storage performance of coal mine microseismic time-series data.

    参考文献
    相似文献
    引证文献
引用本文

丁琳琳,王智涵,顾英豪,等.基于HBase与Netty的煤矿微震时序大数据存储优化[J].中国矿山工程,2023,52(5):29-35.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-12-24
  • 出版日期:
文章二维码