当前位置：首页 > news >正文

大白话解析LevelDB：ShardedLRUCache

news 2025/8/16 3:26:10

文章目录

- Cache 接口定义
- ShardedLRUCache 的实现
- - ShardedLRUCache 的构造函数
  - ShardedLRUCache::Insert(const Slice& key, void* value, size_t charge, void (\*deleter)(const Slice& key, void* value))
  - ShardedLRUCache::Lookup(const Slice& key)
  - ShardedLRUCache::Release(Handle* handle)
  - ShardLRUCache::Erase(const Slice& key)
  - ShardedLRUCache::Value(Handle* handle)
  - ShardedLRUCache::NewId()
  - ShardedLRUCache::Prune()
  - ShardedLRUCache::TotalCharge()
- 总结

ShardedLRUCache是Cache的一种实现，所以在看ShardedLRUCache的实现之前，我们需要先了解下Cache的定义。

Cache 接口定义

我们可以先看下Cache的使用姿势，再来看Cache的接口定义。

Cache* cache = NewLRUCache(100);
Handle* handle = cache->Insert("key", "value", 5, nullptr);
cache->Erase("key"); // 即使这里将缓存项从 Cache 中移除了，但该缓存项还是会保留在内存中
cache->Value(handle); // 此时还是可以通过 handle 获取到缓存项的 Value
cache->Release(handle); // 这里将缓存项的引用计数减一，如果引用计数为 0，那么该缓存项会被销毁。

Cache的接口定义如下：

class LEVELDB_EXPORT Cache {public:Cache() = default;Cache(const Cache&) = delete;Cache& operator=(const Cache&) = delete;// 用户调用 Insert() 方法插入一个缓存项时，会同时传入一个 deleter。// 当 Cache 析构时，会调用所有缓存项的 deleter 来对 Cache 里的// 所有缓存项进行清理。virtual ~Cache();// 声明了一个抽象的 Handle 类型，用于表示 Cache 中的一个缓存项。struct Handle {};// 这个方法用于向 Cache 中插入一对 Key-Value，Cache 的实现类会在内部// 基于这对 Key-Value 生成一个 Handle 对象，将该 Handle 插入到 Cache// 中，并把这个 Handle 的指针返回给用户。//// 返回 Handle 指针，相当于这个 Cache 缓存项的引用计数加一，即使另外一个线程// 将这个缓存项从 Cache 中移除了，该缓存项还是会保留在内存中。// 用户需要在使用完这个缓存项后，调用 Release() 方法来将该缓存项的引用计数减一。// 可以通过以下例子来理解://      Cache* cache = NewLRUCache(100);//      Handle* handle = cache->Insert("key", "value", 5, nullptr);//      cache->Erase("key"); // 即使这里将缓存项从 Cache 中移除了，但该缓存项还是会保留在内存中//      cache->Value(handle); // 此时还是可以通过 handle 获取到缓存项的 Value//      cache->Release(handle); // 这里将缓存项的引用计数减一，如果引用计数为 0，那么该缓存项会被销毁。//// charge 参数表示这个缓存项的大小，因为缓存项里只存储了 value 的指针，所以计算出// 该缓存项的大小，需要用户告知 Cache。virtual Handle* Insert(const Slice& key, void* value, size_t charge,void (*deleter)(const Slice& key, void* value)) = 0;// 通过 key 查找 Cache 中的缓存项://      - 如果找到了，返回该缓存项的 Handle 指针。//      - 同时将该缓存项的引用计数加一。如果没有找到，返回 nullptr。virtual Handle* Lookup(const Slice& key) = 0;// 将缓存项的引用计数减一，如果引用计数为 0，那么该缓存项会被销毁。virtual void Release(Handle* handle) = 0;// 返回缓存项的 Value。virtual void* Value(Handle* handle) = 0;// 指定一个 key，从 Cache 中移除一个缓存项。// 如果该缓存项的引用计数为 0，那么该缓存项会被销毁。virtual void Erase(const Slice& key) = 0;// 返回一个新的数字，作为该 Cache 的 ID。virtual uint64_t NewId() = 0;// 移除 Cache 中所有没有正在被使用的缓存项。// 比如在一些内存紧张的情况下，客户端可能会希望把 Cache 里没有正在被使用的缓存项移除，// 腾出一些内存空间。virtual void Prune() {}// 计算 Cache 中所有缓存项的大小之和。virtual size_t TotalCharge() const = 0;
};

ShardedLRUCache 的实现

ShardedLRUCache 的构造函数

ShardedLRUCache内部有一个LRUCache数组，LRUCache shard_[kNumShards]。

意思是ShardedLRUCache不是一个大Cache，而是将这个大的Cache shard 为多个小Cache，每个小Cache叫做一个shard。

所以叫做ShardedLRUCache。

// capacity: ShardedLRUCache 的总容量
// 根据总容量计算每个 shard 里的 LRUCache 的容量。
explicit ShardedLRUCache(size_t capacity) : last_id_(0) {// 计算 per_shard: 每个 shard 的容量。// per_shard = ⌈capacity / kNumShards⌉ (向上取整)const size_t per_shard = (capacity + (kNumShards - 1)) / kNumShards;for (int s = 0; s < kNumShards; s++) {// 给每个 shard 里的 LRUCache 设置容量。shard_[s].SetCapacity(per_shard);}
}

ShardedLRUCache::Insert(const Slice& key, void* value, size_t charge, void (deleter)(const Slice& key, void value))

计算key的hash值，然后根据hash值选择一个shard，将key插入到该shard的LRUCache中。

Handle* Insert(const Slice& key, void* value, size_t charge,void (*deleter)(const Slice& key, void* value)) override {// 计算 key 的 hash 值，然后根据 hash 值选择一个 shard，// 将 key 插入到该 shard 的 LRUCache 中。const uint32_t hash = HashSlice(key);return shard_[Shard(hash)].Insert(key, hash, value, charge, deleter);
}

HashSlice(key)的实现如下:

static inline uint32_t HashSlice(const Slice& s) { return Hash(s.data(), s.size(), 0); }

继续看Hash(s.data(), s.size(), 0)的实现:

LevelDB 在MurMurHash 算法的基础上做了一点修改，主要是为了提高性能。

相比于其他 Hash 算法， MurmurHash 对于规律性较强的 Key，随机分布特征表现更良好。

uint32_t Hash(const char* data, size_t n, uint32_t seed) {// Similar to murmur hashconst uint32_t m = 0xc6a4a793;const uint32_t r = 24;const char* limit = data + n;uint32_t h = seed ^ (n * m);// Pick up four bytes at a timewhile (data + 4 <= limit) {uint32_t w = DecodeFixed32(data);data += 4;h += w;h *= m;h ^= (h >> 16);}// Pick up remaining bytesswitch (limit - data) {case 3:h += static_cast<uint8_t>(data[2]) << 16;FALLTHROUGH_INTENDED;case 2:h += static_cast<uint8_t>(data[1]) << 8;FALLTHROUGH_INTENDED;case 1:h += static_cast<uint8_t>(data[0]);h *= m;h ^= (h >> r);break;}return h;
}

LRUCache::Insert(key, hash, value, charge, deleter)的实现可移步参考大白话解析LevelDB：LRUCache;

ShardedLRUCache::Lookup(const Slice& key)

Handle* Lookup(const Slice& key) override {// 使用与 Insert 相同的 Hash 算法计算 key 的 hash 值，// 找到对应的 shard，然后在该 shard 的 LRUCache 中查找 key。const uint32_t hash = HashSlice(key);return shard_[Shard(hash)].Lookup(key, hash);
}

ShardedLRUCache::Release(Handle* handle)

void Release(Handle* handle) override {// Handle 中已经存好 hash 值了，不需要重新计算。// 找到对应的 shard，然后让该 shard 的 LRUCache 释放 handle。LRUHandle* h = reinterpret_cast<LRUHandle*>(handle);shard_[Shard(h->hash)].Release(handle);
}

我们可以移步到大白话解析LevelDB：LRUCache看下LRUHandle的定义，LRUCache::Insert(key, hash, value, charge, deleter)里会将key和hash生成一个LRUHandle缓存项，该缓存项里存储了非常多的信息。

所以只要拿到handle，就可以直接读取出该缓存项的hash值了，不需要重新计算。

ShardLRUCache::Erase(const Slice& key)

void Erase(const Slice& key) override {// 使用与 Insert 相同的 Hash 算法计算 key 的 hash 值，// 找到对应的 shard，然后让该 shard 的 LRUCache 移除 key。const uint32_t hash = HashSlice(key);shard_[Shard(hash)].Erase(key, hash);
}

ShardedLRUCache::Value(Handle* handle)

handle里已经存储了value，从handle中直接获取即可。

void* Value(Handle* handle) override { return reinterpret_cast<LRUHandle*>(handle)->value; }

ShardedLRUCache::NewId()

返回一个新的数字，作为该 Cache 的 ID。

uint64_t NewId() override {MutexLock l(&id_mutex_);return ++(last_id_);
}

ShardedLRUCache::Prune()

没啥好说的，挨个调用每个 shard 里LRUCache的LRUCache::Prune()。

void Prune() override {for (int s = 0; s < kNumShards; s++) {shard_[s].Prune();}
}

ShardedLRUCache::TotalCharge()

没啥好说的，挨个调用每个 shard 里LRUCache的LRUCache::TotalCharge()。

size_t TotalCharge() const override {size_t total = 0;for (int s = 0; s < kNumShards; s++) {total += shard_[s].TotalCharge();}return total;
}

总结

可以看到，ShardedLRUCache只是一个LRUCache的封装，包含多个LRUCache shard。

插入、查找、删除等操作都是基于LRUCache的操作，只是在操作之前，需要先计算出key的hash值，然后根据hash值选择一个shard，再在该shard的LRUCache中进行操作。

接下来我们移步看下LRUCache的实现: 大白话解析LevelDB：LRUCache。

查看全文

http://www.lryc.cn/news/311734.html

GDOI2024游记

学编程怎么样才能更快入手，编程怎么简单易学

Android 通知--判断通知是否有跳转

【计算机网络】IO多路转接之poll

性能比较：in和exists

【Java设计模式】五、建造者模式

nginx代理minio教程避坑过的教程避开SignatureDoesNotMatch

Linux进程详细介绍

2024年3月产品认证基础考试简答题及答案

cuda python torch 虚拟环境配置

激光炸弹刷题笔记

Vue3学习记录（三）--- 组合式API之生命周期和模板引用

Batch Normalization和Layer Normalization和Group normalization

命名实体识别NER（综合代码示例）

关于jQuery日历插件：daterangepicker

【贪心算法】最大子序和

爬虫逆向网站案例

蓝桥杯复习之前缀和

动态规划（算法竞赛、蓝桥杯）--背包DP求具体方案

go写mysql取得自增字段值

Vue 3的Composition API和vue2的不同之处

go语言基础 -- 面向对象编程

HarmonyOS—编译构建概述

下载element-ui 资源，图标 element-icons.woff，element-icons.ttf 无法解码文件字体

Flutter整体框架