Noah Hellman
242a64e3b4
attr: mv wildcard State use to Parser methods
...
avoid potential name conflict
2023-04-05 21:17:33 +02:00
Noah Hellman
f5724fcc9c
html: rm FilteredEvents
...
no longer useful as no peeking is needed, use simple early exit instead
2023-04-05 21:17:33 +02:00
Noah Hellman
4cacb61a68
jotdown_wasm: rm unmatched closing div
2023-04-05 21:17:33 +02:00
Noah Hellman
172f555272
attr: step one char at a time
...
make sure attr can keep track of all state so one char can be provided
at a time
this allows not restarting from beginning if we find out we need more
chars to finish parsing attributes
2023-04-05 21:17:33 +02:00
Noah Hellman
336927faef
html: avoid peek of next event
...
Try to make rendering of each event independent.
The only case where we need to peek is when a backref link should be
added to the last paragraph within a footnote.
Before, when exiting a paragraph, we would peek and add the link before
emitting the close tag if the next event is a the footnote end.
Now, the paragraph end event skips emitting a paragraph close tag if it
is within a footnote. The next event will then always close the
paragraph, and if it is a footnote end, it will add the backref link
before closing.
2023-04-05 21:17:33 +02:00
Noah Hellman
d175ab4f47
jotdown_wasm: rm debug print
2023-04-05 21:17:33 +02:00
Noah Hellman
e1b12ba642
attr: rename State::{Attribute->Key}
...
more accurate name
2023-04-05 21:17:33 +02:00
Noah Hellman
86ee4ee520
lex: rm lex::Kind::Whitespace
...
Whitespace tokens do not necessarily create new events but they work as
a delimiter for words with attributes and affect some container
delimiters. Now when we can read the source from inline, we can instead
inspect for whitespace when needed.
Removing the whitespace token allows the lexer to continue a lot longer
without stopping. E.g. a typical line in a paragraph with no special
characters can turn into a single token.
2023-04-05 21:17:33 +02:00
Noah Hellman
9454a2e393
inline: apply word attribute when flushing event buf
...
fixes issue with e.g `[text]({.cls})` where attributes get immediately
applied to `[text](` where link should have priority.
2023-04-05 21:17:33 +02:00
Noah Hellman
08ef15655b
inline: extract merge_str_events
...
decrease size of inline::Parser::next, make more readable
2023-04-05 21:17:33 +02:00
Noah Hellman
1b7bb25519
inline: allow reading src
2023-04-05 21:17:33 +02:00
Noah Hellman
8382fe122f
lib: derive Clone for Parser
...
should now be safe to do
2023-04-05 21:17:33 +02:00
Noah Hellman
3a1a3996e9
inline: take str per line instead of full inline iter
...
gets rid of DiscontinousChars which is large and requires cloning on
peek
resolves #4
2023-04-05 21:17:33 +02:00
Noah Hellman
8169feb1f6
inline: impl links w/o lookahead
...
needed to get rid of DiscontinuousChars
2023-04-05 21:17:33 +02:00
Noah Hellman
66d821f03e
inline: replace Delim by Opener
...
no need to have a delimiter for closer, only opener needs to be stored
in stack
2023-04-05 21:17:33 +02:00
Noah Hellman
ed5aed3759
inline: impl verbatim w/o lookahead
...
avoid lookahead for containers that can cross newline
needed to get rid of DiscontinuousChars iter
2023-04-05 21:17:33 +02:00
Noah Hellman
e8e551fd8b
inline: separate lex, span to separate Input object
...
easier handling of mutable pointers, can borrow self.input instead of
whole self
can e.g. borrow mutable state while still eating new tokens
2023-04-05 21:17:33 +02:00
Noah Hellman
5d6d0e0840
inline: use lex kinds
...
import Sequence and use Delimiter, Symbol consistently
2023-04-05 21:17:33 +02:00
Noah Hellman
f192ea2aa6
inline: use ? to flatten methods
2023-04-05 21:17:33 +02:00
Noah Hellman
491c5f2866
inline: always push events from parse methods
...
in order to more conveniently allow pushing in arbitrary order
parse methods now return an Option<()> that functions kind of like a
std::ops::ControlFlow. Some(()) means the token was parsed, None means
continue parsing.
2023-04-05 21:17:33 +02:00
Noah Hellman
9429f90307
inline: reuse event buffer between blocks
...
make sure not to allocate a new buffer on each block
2023-04-05 21:17:33 +02:00
Noah Hellman
1e5e56c463
only assert in debug builds
...
these are primarily used to detect bugs during e.g. fuzzing.
most of these asserts have negligible impact on performance, but if they
are not debug asserts it is not obvious that they dont affect
performance of release builds
2023-04-05 21:17:33 +02:00
Noah Hellman
3f011b9367
bench: add inline-attrs
2023-04-05 21:17:33 +02:00
Noah Hellman
443d6323c7
make: only rm symlink .dj files
2023-04-05 21:17:33 +02:00
Noah Hellman
d171cbb516
inline: add quote test
2023-04-05 21:17:33 +02:00
Noah Hellman
0a501fec10
lib: add test attr_inline_consecutive{,_invalid}
2023-04-05 21:17:33 +02:00
Noah Hellman
718f2df60e
lib: add test link_reference_multiline
2023-04-05 21:17:33 +02:00
Noah Hellman
66fd099af1
lib: add test attr_inline_multiline
2023-04-05 21:17:33 +02:00
Noah Hellman
722f549ffd
clippy: allow blocks_in_if_conditions everywhere
2023-04-05 21:17:33 +02:00
Noah Hellman
a603ea2124
lib: Add SpanLinkTag::Unresolved variant
...
keep the tag for unresolved links, and allow distinguishing between
`[tag][tag with empty url]` and `[tag][non-existent tag]`.
closes #26
2023-04-05 21:17:32 +02:00
Noah Hellman
05a4992d99
lib: don't prepend mailto to url in autolink Event
...
better to provide the original url, the event is already tagged as email
also avoids a string allocation
2023-04-05 21:17:04 +02:00
Noah Hellman
62e73100a6
bug fix: set LinkType to Email for email autolinks
2023-03-20 23:39:51 +01:00
Noah Hellman
e458955d00
PR #25 Fuzz for invalid HTML
...
Merge branch 'fuzz_html'
closes #25
2023-03-20 23:37:54 +01:00
Noah Hellman
ac49c92445
changelog: fix release link
2023-03-18 15:09:59 +01:00
Noah Hellman
6266e6eb49
ci: add fuzz html step
2023-03-17 19:01:29 +01:00
Noah Hellman
14065177ae
attr: fix name/key/value validation
...
match reference implementation
2023-03-17 18:57:36 +01:00
Noah Hellman
0719b2de65
block: fix class attribute parsing
...
match reference implementation
2023-03-17 18:57:36 +01:00
Noah Hellman
33d8215a2a
html: fix invalid html for footnote inside image
2023-03-17 18:57:36 +01:00
Noah Hellman
c6022004bb
html: fix alt text on nested images
2023-03-17 18:57:35 +01:00
Noah Hellman
fc374be56c
html: escape img src values
2023-03-17 18:57:10 +01:00
Noah Hellman
5768b24907
html: escape quotes in img alt text
2023-03-17 18:45:20 +01:00
Noah Hellman
bd831058f7
make: add afl_tmin target
...
minimize all failing cases to help debugging
2023-03-17 18:45:20 +01:00
Noah Hellman
0d884a65d5
afl: add debug feature
...
leave out debug prints when actually fuzzing to increase fuzz
performance
2023-03-17 18:45:20 +01:00
Noah Hellman
9a7c57f524
afl: add html target, checking for invalid html
2023-03-17 18:45:20 +01:00
Noah Hellman
8f70f596b9
afl: add main file
...
for testing that crashes have been resolved
previously, binary of the main crate was used, but targets may have more
validation than simply checking for panics
2023-03-17 18:45:20 +01:00
Noah Hellman
3e56903885
afl: mv parse target impl to lib
2023-03-17 18:45:20 +01:00
Noah Hellman
2606e2f4fc
afl: gen -> parse, parse only
2023-03-17 18:45:20 +01:00
Noah Hellman
648a6dbef2
lib: derive Clone for Event and Container
2023-03-16 20:05:20 +01:00
Noah Hellman
16491a4a99
PR #19 Support escapes in attributes
2023-03-12 10:25:31 +01:00
kmaasrud
e3f39d4b88
feat: support escapes in attributes
...
Related issue: #1
2023-03-12 08:19:56 +01:00