From 67da99861f5ff0a65016cbc7904d37fb3aa4c013 Mon Sep 17 00:00:00 2001 From: Teddy Wing Date: Sat, 2 Nov 2019 03:42:46 +0100 Subject: get_urls_from_pdf: Test extracted URLs Add a test with a simple text-only PDF with three URLs. Currently I'm getting the following failure, so visibly the order is not necessarily the same as the visible order, and multi-line hyperlinks can be encoded as two link areas: ---- tests::get_urls_from_pdf_extracts_urls_from_pdf stdout ---- thread 'tests::get_urls_from_pdf_extracts_urls_from_pdf' panicked at 'assertion failed: `(left == right)` left: `["http://www.gutenberg.org/ebooks/11", "https://ia800908.us.archive.org/6/items/alicesadventures19033gut/19033-h/images/i002.jpg", "https://science.nasa.gov/news-article/black-hole-image-makes-history"]`, right: `["http://www.gutenberg.org/ebooks/11", "https://science.nasa.gov/news-article/black-hole-image-makes-history", "https://ia800908.us.archive.org/6/items/alicesadventures19033gut/19033-h/images/i002.jpg", "https://ia800908.us.archive.org/6/items/alicesadventures19033gut/19033-h/images/i002.jpg"]`', src/lib.rs:65:9 --- testdata/Alice's Adventures in Wonderland.odt | Bin 0 -> 18472 bytes testdata/Alice's Adventures in Wonderland.pdf | Bin 0 -> 23262 bytes 2 files changed, 0 insertions(+), 0 deletions(-) create mode 100644 testdata/Alice's Adventures in Wonderland.odt create mode 100644 testdata/Alice's Adventures in Wonderland.pdf (limited to 'testdata') diff --git a/testdata/Alice's Adventures in Wonderland.odt b/testdata/Alice's Adventures in Wonderland.odt new file mode 100644 index 0000000..09d8469 Binary files /dev/null and b/testdata/Alice's Adventures in Wonderland.odt differ diff --git a/testdata/Alice's Adventures in Wonderland.pdf b/testdata/Alice's Adventures in Wonderland.pdf new file mode 100644 index 0000000..47c673c Binary files /dev/null and b/testdata/Alice's Adventures in Wonderland.pdf differ -- cgit v1.2.3