Kindle Highlights
Preface
Kindle
的 Clippings.txt
里面有一大堆重复的(子集关系)标记, 并且还有很多无用的冗余信息!
所以写了一个简单的脚本去处理。
Content
todo
- 书籍分类
const cfg = {
file: "clippings.txt",
book: "重启咲良田"
}
import fs from "node:fs";
function trim(s: string) {
return s.replace(/(^。*)|(^\s*)|(\s*$)/g, "");
}
let filename = cfg.file;
let content = fs.readFileSync(filename, "utf8");
let lines = content.split("==========\n")
.filter(s=>s.startsWith(cfg.book))
.map(s => trim(s.split("\n").slice(-2)[0]))
.filter(s => s !== "");
// console.log(lines)
let result: string[] = [lines[0]];
// 取内容最全的一行
for (let i = 1; i < lines.length; ++i) {
let l = result.length - 1;
if (lines[i].includes(result[l]))
result[l] = lines[i];
else if (!result[l].includes(lines[i]))
result.push(lines[i]);
}
// console.log(result);
fs.writeFileSync(filename + ".bak", result.join("\n"), "utf8");
本博客所有文章除特别声明外,均采用 CC BY-NC-SA 4.0 许可协议。转载请注明来自 静谧之园!