From 921cf3649638d91f9d50ee7c7e2ddf1eb2600da8 Mon Sep 17 00:00:00 2001 From: Mat Kelly Date: Mon, 13 Dec 2021 11:30:53 -0500 Subject: [PATCH] Add FastWARC (#114) * Update README.md * Capitalize the description to appease the linter --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 43f324d..5ae4c96 100644 --- a/README.md +++ b/README.md @@ -139,6 +139,7 @@ This list of tools and software is intended to briefly describe some of the most ### WARC I/O Libraries +* [FastWARC](https://github.com/chatnoir-eu/chatnoir-resiliparse) - A high-performance WARC parsing library (Python). * [HadoopConcatGz](https://github.com/helgeho/HadoopConcatGz) - A Splitable Hadoop InputFormat for Concatenated GZIP Files (and `*.warc.gz`). *(Stable)* * [jwarc](https://github.com/iipc/jwarc) - Reading and write WARC files with a typesafe API (Java). * [Jwat](https://sbforge.org/display/JWAT/JWAT) - Libraries and tools for reading/writing/validating WARC/ARC/GZIP files (Java). *(Stable)*