kafka logManager类 kafka存储机制

russle

2015-08-26

关注关注

logManager类：管理kafka数据log的类，包括数据clean，flush等操作

Log类：每个tplog的对象

logSegment：每个tplog目录下的文件对象

filemessageSet：每个log file的管道类

base offset：在topic中的绝对offset值

offsetindex：每个log index的管道map类，存储相对offset值和文件position

按照partition分区topic，分发到各个机子上

partition上有多个log文件，每个log文件一个索引文件

log文件是实际的数据，索引文件是log文件里数据的相对偏移量和在log文件里的position，偏移量offset是一段数据生成一个offset，避免offset文件过大

1.初始化：

val RecoveryPointCheckpointFile = "recovery-point-offset-checkpoint"
  val LockFile = ".lock"
  val InitialTaskDelayMs = 30*1000
  private val logCreationOrDeletionLock = new Object
  private val logs = new Pool[TopicAndPartition, Log]()//所有log的对象,一个topicpartition 一个log对象

  //获得log文件，并获得文件channel锁
  createAndValidateLogDirs(logDirs)
  private val dirLocks = lockLogDirs(logDirs)
  private val recoveryPointCheckpoints = logDirs.map(dir => (dir, new OffsetCheckpoint(new File(dir, RecoveryPointCheckpointFile)))).toMap
  //遍历所有的log，生成Log对象，并且执行log clean（checkposition）
  loadLogs()

主要方法loadLogs：

if (cleanShutdownFile.exists) {//表示上次关闭kafka时，已经clean完，这次不需要clean
        debug(
          "Found clean shutdown file. " +
          "Skipping recovery for all logs in data directory: " +
          dir.getAbsolutePath)
      } else {
        // log recovery itself is being performed by `Log` class during initialization
        brokerState.newState(RecoveringFromUncleanShutdown)
      }

      //获得log下recover文件
      val recoveryPoints = this.recoveryPointCheckpoints(dir).read

      val jobsForDir = for {
        dirContent <- Option(dir.listFiles).toList
        logDir <- dirContent if logDir.isDirectory
      } yield {
        Utils.runnable {
          debug("Loading log '" + logDir.getName + "'")
          //从文件目录上获得topic和partition
          val topicPartition = Log.parseTopicPartitionName(logDir.getName)
          //从map中获得topic的自定义config，如果
          val config = topicConfigs.getOrElse(topicPartition.topic, defaultConfig)
          val logRecoveryPoint = recoveryPoints.getOrElse(topicPartition, 0L)

          val current = new Log(logDir, config, logRecoveryPoint, scheduler, time)
          val previous = this.logs.put(topicPartition, current)
          //判断是否有重复的topic+partition
          if (previous != null) {
            throw new IllegalArgumentException(
              "Duplicate log directories found: %s, %s!".format(
              current.dir.getAbsolutePath, previous.dir.getAbsolutePath))
          }
        }
      }
      //对每个logDir执行 上边的runnable，生成Log对象添加到log pool中
      jobs(cleanShutdownFile) = jobsForDir.map(pool.submit).toSeq

其中new Log方法，为初始化log file和index

主方法：loadSegments

1.处理swap文件，log则重新加载（rename），index则删除

2.加载log和index，恢复不存在的index

private def loadSegments() {
    // create the log directory if it doesn't exist
    dir.mkdirs()
    
    // first do a pass through the files in the log directory and remove any temporary files 
    // and complete any interrupted swap operations
    for(file <- dir.listFiles if file.isFile) {
      if(!file.canRead)
        throw new IOException("Could not read file " + file)
      val filename = file.getName
      if(filename.endsWith(DeletedFileSuffix) || filename.endsWith(CleanedFileSuffix)) {
        // if the file ends in .deleted or .cleaned, delete it
        file.delete()
      } else if(filename.endsWith(SwapFileSuffix)) {//文件用于swap时候，恢复log
        // we crashed in the middle of a swap operation, to recover:
        // if a log, swap it in and delete the .index file
        // if an index just delete it, it will be rebuilt
        //如果是index则删除，如果是log则重新加载（重命名），并删除已经存在的index
        val baseName = new File(Utils.replaceSuffix(file.getPath, SwapFileSuffix, ""))
        if(baseName.getPath.endsWith(IndexFileSuffix)) {
          file.delete()
        } else if(baseName.getPath.endsWith(LogFileSuffix)){
          // delete the index
          val index = new File(Utils.replaceSuffix(baseName.getPath, LogFileSuffix, IndexFileSuffix))
          index.delete()
          // complete the swap operation
          val renamed = file.renameTo(baseName)
          if(renamed)
            info("Found log file %s from interrupted swap operation, repairing.".format(file.getPath))
          else
            throw new KafkaException("Failed to rename file %s.".format(file.getPath))
        }
      }
    }

    // now do a second pass and load all the .log and .index files
    for(file <- dir.listFiles if file.isFile) {
      val filename = file.getName
      if(filename.endsWith(IndexFileSuffix)) {
        // if it is an index file, make sure it has a corresponding .log file 查看index log是否对应的 log，如果没有则删除
        val logFile = new File(file.getAbsolutePath.replace(IndexFileSuffix, LogFileSuffix))
        if(!logFile.exists) {
          warn("Found an orphaned index file, %s, with no corresponding log file.".format(file.getAbsolutePath))
          file.delete()
        }
      } else if(filename.endsWith(LogFileSuffix)) {
        // if its a log file, load the corresponding log segment
        // 文件名是start offset
        val start = filename.substring(0, filename.length - LogFileSuffix.length).toLong
        val hasIndex = Log.indexFilename(dir, start).exists
        //建立tplog中 每个日志文件对象 logsegment，包含filemessage，offsetindex，baseoffset值
        val segment = new LogSegment(dir = dir, 
                                     startOffset = start,
                                     indexIntervalBytes = config.indexInterval, 
                                     maxIndexSize = config.maxIndexSize,
                                     rollJitterMs = config.randomSegmentJitter,
                                     time = time)
        if(!hasIndex) {
          error("Could not find index file corresponding to log file %s, rebuilding index...".format(segment.log.file.getAbsolutePath))
          //重建index文件和内存索引,文件和内存索引是用的channel map机制
          segment.recover(config.maxMessageSize)
        }
        segments.put(start, segment)
      }
    }

    if(logSegments.size == 0) {
      // no existing segments, create a new mutable segment beginning at offset 0
      segments.put(0L, new LogSegment(dir = dir,
                                     startOffset = 0,
                                     indexIntervalBytes = config.indexInterval, 
                                     maxIndexSize = config.maxIndexSize,
                                     rollJitterMs = config.randomSegmentJitter,
                                     time = time))
    } else {
      recoverLog()
      // reset the index size of the currently active log segment to allow more entries
      activeSegment.index.resize(config.maxIndexSize)
    }

    // sanity check the index file of every segment to ensure we don't proceed with a corrupt segment
    for (s <- logSegments)
      s.index.sanityCheck()
  }

-----------------------------初始化完毕---------------------------------

startup方法中三个功能：

1.cleanupLogs

2.flushDirtyLogs

3.checkpointRecoveryPointOffsets

1.cleanupLogs

两个方法一个是超时(超时是modify时间)，一个是大小（大小是最老的小于diff）

private def cleanupExpiredSegments(log: Log): Int = {
    val startMs = time.milliseconds
    //参数为log manager开始时间-tplog的修改时间 和 配置retention时间 比较，超过则需要删除，返回true
    //删除的是最后一次修改时间超过retention time的
    log.deleteOldSegments(startMs - _.lastModified > log.config.retentionMs)
  }

/**
   * 删除规则，是tplog超过阈值，从最老的开始找，找到file的大小小于diff的时候删除
   * 如果当前log file大小大于diff，则停止（原则是等最后一个文件可删除）
   *  Runs through the log removing segments until the size of the log
   *  is at least logRetentionSize bytes in size
   */
  private def cleanupSegmentsToMaintainSize(log: Log): Int = {
    if(log.config.retentionSize < 0 || log.size < log.config.retentionSize)
      return 0//当配置小于0，或log大小小于配置
    var diff = log.size - log.config.retentionSize
    def shouldDelete(segment: LogSegment) = {
      if(diff - segment.size >= 0) {//如果需要删除的大小 大于或等于 logfile，则返回true
        diff -= segment.size
        true
      } else {
        false
      }
    }
    log.deleteOldSegments(shouldDelete)
  }

参数：

清理日志，距离上次修改时间大于config时间，则删除

val logCleanupIntervalMs = props.getLongInRange("log.retention.check.interval.ms", 5*60*1000, (1, Long.MaxValue))

log clean参数，达到log大小上限，log的position

val logRetentionBytes = props.getLong("log.retention.bytes", -1)

def deleteOldSegments(predicate: LogSegment => Boolean): Int = {
    // find any segments that match the user-supplied predicate UNLESS it is the final segment 
    // and it is empty (since we would just end up re-creating it
    val lastSegment = activeSegment
    //超时，并且包含segment，则删除，获得删除list segment
    val deletable = logSegments.takeWhile(s => predicate(s) && (s.baseOffset != lastSegment.baseOffset || s.size > 0))
    val numToDelete = deletable.size
    if(numToDelete > 0) {
      lock synchronized {
        // we must always have at least one segment, so if we are going to delete all the segments, create a new one first
        if(segments.size == numToDelete)
          roll()
        // remove the segments for lookups
        deletable.foreach(deleteSegment(_))//从segment集合中移除，修改文件名称为delete结尾，并异步删除
      }
    }
    numToDelete
  }

2.flushDirtyLogs

flush的message条数和时间间隔
    /* the maximum time in ms that a message in any topic is kept in memory before flushed to disk */
  val logFlushIntervalMs = props.getLong("log.flush.interval.ms", logFlushSchedulerIntervalMs)
  
  /**
   * Flush any log which has exceeded its flush interval and has unwritten messages.
   */
  private def flushDirtyLogs() = {
    debug("Checking for dirty logs to flush...")

    for ((topicAndPartition, log) <- logs) {
      try {
        val timeSinceLastFlush = time.milliseconds - log.lastFlushTime
        debug("Checking if flush is needed on " + topicAndPartition.topic + " flush interval  " + log.config.flushMs +
              " last flushed " + log.lastFlushTime + " time since last flush: " + timeSinceLastFlush)
        if(timeSinceLastFlush >= log.config.flushMs)
          log.flush
      } catch {
        case e: Throwable =>
          error("Error flushing topic " + topicAndPartition.topic, e)
      }
    }
  }
  
    @threadsafe
  def flush() {
    LogFlushStats.logFlushTimer.time {
      log.flush()
      index.flush()
    }
  }

3.checkpointRecoveryPointOffsets

checkpointRecoveryPointOffsets，标记logdir上的恢复点，避免启动时，需要恢复所有log，生成index

是按照logdir遍历，logdir中包含多个tplog

/**
   * Make a checkpoint for all logs in provided directory.
   */
  private def checkpointLogsInDir(dir: File): Unit = {
    //获得当前dir的所有tplog，value：Map【TopicAndPartition, Log】
    val recoveryPoints = this.logsByDir.get(dir.toString)
    if (recoveryPoints.isDefined) {
      //mapValues重新生成map的value，write参数（topicAndPartition：recoverPoint）；
      //write将tplog的offset写入recover文件的tmp文件中，删除旧文件，rename为recover文件 _是Log对象（value）
      this.recoveryPointCheckpoints(dir).write(recoveryPoints.get.mapValues(_.recoveryPoint))
    }
  }

logmanager里实现log compact功能

if(cleanerConfig.enableCleaner)
      cleaner.startup()//log compact

log kafka

russle

0 关注 0 粉丝 0 动态

关注关注

Kafka源码解析（一）---LogSegment以及Log初始化

我们先回想一下Kafka的日志结构是怎样的？Kafka 日志对象由多个日志段对象组成，而每个日志段对象会在磁盘上创建一组文件，包括消息日志文件、位移索引文件、时间戳索引文件以及已中止事务的索引文件。当然，如果你没有使用 Kafka 事务，已中止事务的索引文

jiaomrswang 2020-06-07

RabbitMQ如何保证消息的可靠投递？

String message = "this is info message " + i;autoAck=false: RabbitMQ会等待消费者显示回复确认消息后才从内存中移出消息。deliveryTag: 用来标识信道中投递的消息

zhuxue 2020-10-14

Linux后台执行命令：&与nohup的用法

大家可能有这样的体验：某个程序运行的时候，会产生大量的log，但实际上我们只想让它跑一下而已，log暂时不需要或者后面才有需要。所以在这样的情况下，我们希望程序能够在后台进行，也就是说，在终端上我们看不到它所打出的log。为了实现这个需求，我们介绍以下几种

zhangbingb 2020-09-21

Linux 入侵痕迹清理技巧

本文转载自微信公众号「 Bypass」，作者 Bypass 。在攻击结束后，如何不留痕迹的清除日志和操作记录，以掩盖入侵踪迹，这其实是一个细致的技术活。你所做的每一个操作，都要被抹掉;你所上传的工具，都应该被安全地删掉。编辑history记录文件，删除部分

HeronLinuxampARM 2020-09-14

为什么排序的复杂度为O(N log N)

基本上所有正而八经的算法教材都会解释像快速排序quicksort和堆排序heapsort这样的排序算法有多快，但并不需要复杂的数学就能证明你可以逐渐趋近的速度有多快。大多数计算机专业的科学家使用大写字母 O 标记来指代“趋近，直到到达一个常数比例因子”，这

美丽的泡沫 2020-09-08

Filebeat简介

Filebeat附带预构建的模块，这些模块包含收集、解析、充实和可视化各种日志文件格式数据所需的配置，每个Filebeat模块由一个或多个文件集组成，这些文件集包含摄取节点管道、Elasticsearch模板、Filebeat勘探者配置和Kibana仪表盘

goodstudy 2020-08-19

JS中DOM元素的操作

<button id="btn" class="btnlist" name="btn_n">点我一下</button>. innerHTML语法: ele.innerHTM

luvhl 2020-08-17

javascript解析json格式的数据方法详解

JSON 是一种简单的数据格式，比xml更轻巧。它是 JavaScript 原生格式，这意味着在 JavaScript 中处理 JSON 数据不需要任何特殊的 API 或工具包。那么如何用JavaScript来解析json呢？var o={“key”:”v

littleFatty 2020-08-16

MySQL是如何保证数据的完整性

数据的一致性和完整性对于在线业务的重要性不言而喻，如何保证数据不丢呢？今天我们就探讨下关于数据的完整性和强一致性，MySQL做了哪些改进。在Oracle和MySQL这种关系型数据库中，讲究日志先行策略,只要日志持久化到磁盘，就能保证MySQL异常重启后，数

gamestart0 2020-08-15

URML 2020-08-15

如何在JavaScript实现休眠或等待功能，实现sleep函数

JavaScript不具有 sleep() 函数，该函数会导致代码在恢复执行之前等待指定的时间段。JavaScript中没有 sleep() 方法，所以你可以尝试使用下一个最好的方法 setTimeout()。不幸的是，setTimeout() 不能像你

sfkong 2020-08-02

debian apach2 wsgi 自定义log logrotate 之后无权限访问

今天测试发现web打不开了，看下error.log发现是自定义log属主变成了root adm，apache2无法访问，后来搜索发现，apache2的日志由logrotate定期压缩备份清理，看了下/etc/logrotate.d/apache2 的配置有

82941732 2020-07-27

es6 Promise 对象、.then()

// resolve // 状态改成fulfilled. },=>{ // 第二个回调成功reject

whynotgonow 2020-07-26

filebeat配置文件

#input设置，支持Docker,Container,HTTP JSON,Log,Kafka,MQTT,NetFlow,Redis,TCP,DCP,Syslog,Stdin. #output设置，可以output到kafka,logstash,elast

偏头痛杨 2020-07-18

mysql的日志模块

连接器-------->分析器------->优化器--------->执行器-------->存储引擎 #如下图。一家商店有一个记账板，当赊账顾客多的时候，会临时记录在记账板上，避免频繁去记账本上查找更新对应顾客的信息。避免高峰

timewind 2020-07-04

Golang保存PostgreSQL数据至结构

db, err := sql.Open("postgres", "user=admin password=123456 dbname=test sslmode=disable"). if err != nil {.

89407707 2020-06-27

封装excel数据层代码，log模块导入

封装excel操作代码，提高复用率。整体封装思想阐述：。表内用例格式构建。首先获取表体第一行的数据组成的列表。之后逐条将表头与数据zip封包，之后转换为字典。从excel中读取的数据，除了数值，其他不管保存的时候什么格式，读取出来都是str. 解决该问参考

xiaoxiaoCNDS 2020-06-26

TypeScript（13）：元组

我们知道数组中元素的数据类型都一般是相同的，如果存储的元素数据类型不同，则需要使用元组。元组中元素使用索引来访问，第一个元素的索引值为 0，第二个为 1，以此类推第 n 个为 n-1，语法格式如下:. console.log // 返回元组的大小。m

lyjava 2020-06-26

TypeScript（06）：运算符

运算符用于执行程序代码运算，会针对一个以上操作数项目来进行运算。以上实例中 7、5 和 12 是操作数。关系运算符用于计算结果是否为 true 或者 false。逻辑运算符用于测定变量或值之间的逻辑。

ChaITSimpleLove 2020-06-25

nginx 日志切割

nginx 日志一般都是两种access.log error.log ，可以每个location 区域配置一份，也就是每个请求服务一个日志。它的日志不会自动切割，需要人为根据时间或者日志量切割。　　LOG_PATH=/opt/nginx/logs 　

Strongding 2020-06-25

安科网

kafka logManager类 kafka存储机制

russle

russle

相关推荐

Kafka源码解析（一）---LogSegment以及Log初始化

RabbitMQ如何保证消息的可靠投递？

Linux后台执行命令：&与nohup的用法

Linux 入侵痕迹清理技巧

为什么排序的复杂度为O(N log N)

Filebeat简介

JS中DOM元素的操作

javascript解析json格式的数据方法详解

MySQL是如何保证数据的完整性

mysql解决时区相关问题

如何在JavaScript实现休眠或等待功能，实现sleep函数

debian apach2 wsgi 自定义log logrotate 之后无权限访问

es6 Promise 对象、.then()

filebeat配置文件

mysql的日志模块

Golang保存PostgreSQL数据至结构

封装excel数据层代码，log模块导入

TypeScript（13）：元组

TypeScript（06）：运算符

nginx 日志切割

russle