Questions: Structure aware fuzzing, using internal functions and more #32

pauldreik · 2019-09-29T09:43:13Z

Hi!
I have played around with the fuzzers and have several variations and new fuzzers on my private repo. I wanted to file an issue here to have a discussion if and how to upstream this. I also have some thoughts, so I figured I post everything here in case anyone want to comment.

To prove this is not just hot air, here are the issues I have found, all luckily pretty harmless:

doh: fix undefined behaviour in swap curl#4350 (undefined behaviour)
Fix harmless two byte buffer write overflow in doh_encode curl#4352 (buffer overflow in doh)
doh: clean up dangling DOH handles on easy close curl#4366 (doh resource leak)
cookie: pass in the correct cookie amount to qsort() curl#4386 (qsort on wild cookies)
urlapi: avoid index underflow for short ipv6 hostnames curl#4389 (index underflow)
doh: respect timeout and restrict debug builds to http and https curl#4406 (timeout in doh)
cookie: avoid harmless use after free curl#4454 (harmless use after free)

Cmake and C++

To make debugging and development easy, I use C++ (it's my primary language) and cmake because it is easy to get an IDE (I use qtcreator) to navigate the code, debug and build it. It is easy to work with, since one can easily set a breakpoint in the debugger when trying to get the fuzzer to reach a certain place in the code. I also refactored the fuzzer to use RAII style so the cleanup is easier.

Using boost asio instead of cstyle select

I use asio to asynchronously wait for network events. Since I am used to it and it is the most well known way to network in C++, it is a good choice. I think I also managed to handle timeouts in a way such that the fuzzing speed is improved, but this is a tricky topic since it involves interaction between curl, asio and waiting for the OS.
I think it is a good foundation to build on, since it is possible to for instance interface openssl through asio, which would open up for fuzzing the contents of https instead of the encrypted layer as the current fuzzers do.

Internal fuzzers

The existing fuzzers use curl as just any libcurl user. I wanted to stress test some functions directly, so I put some fuzzers which access curl from within. I do this by adding an optional section inside a curl CMakefile, which includes files from the curl-fuzzer repo. This will uglify curl, but it is by default off and placed within if (ENABLE_FUZZING) sections.

The benefit is that these fuzz targets can be very focused, and do not need to go through being invented by the TLV fuzzer and sent over the socket. Not all functions can be reached by the existing fuzzers.

I add internal fuzzers for

cookie handling
doh payload encode/decode
escape/unescape
netrc parsing

I think it may be a good idea to fuzz setting curloptions here, so one does not have to let that happen only through the existing fuzzers.

Fuzzing a single function like this, for instance the doh parser, is what libFuzzer is excellent at. It will not be limited by network timeouts or tearing up/down sockets.

These internal fuzzers cover almost everything within a few hours, so these are nice to have, but could perhaps run through a CI build instead of wasting slots at oss fuzz (unless they schedule fuzzing effort to avoid it, I do not know).

Structure aware fuzzing

The fuzz data uses TLV (type length value) which means the default mutation strategy will very likely break the content. Most of the input the fuzzer engine generates is most likely garbage, and has to be rejected while unpacking the TLV during setting curl options.

This is inefficient, but it also makes it difficult for the fuzzing engine to make meaningful input since both the type, content and alignment with other blocks have to match.

So I wrote a custom mutator, which parses the TLV while throwing away parts that don't make sense. With a list of blocks, it can now apply mutation on a single block, then serializing it again. It also implements the custom crossover, for mixing two test cases.

This works, but I have a hard time evaluating it since it takes a very long time to build up the corpus for the existing fuzzers. It sure finds new paths when starting from empty, so I figure if it can do that, it will also be good at exposing cases not yet known.

A proposal

I think it is good to discuss before sending pull requests. Here is my suggestion:

the existing fuzzers are left as is, with build system and all. they work well, and until "proven" it does not make sense to remove them
a new fuzzer build is made in parallel to the existing. It uses cmake.
the fuzzers which do what the existing ones do, but with the custom mutator, are added to the new fuzzer build
the internal fuzzers are added (but maybe not added by default to ossfuzz)

What do I want help with now

I would like to get either access to the corpus from oss fuzz or a copy of it, so I can see which parts of curl are not fuzzed and evaluate the new fuzzers I write. I think the link is this one, based on another project I work with. It needs login through one of the admins: https://console.cloud.google.com/storage/browser/curl-backup.clusterfuzz-external.appspot.com

Thanks,
Paul

The text was updated successfully, but these errors were encountered:

cmeister2 · 2019-09-29T11:25:01Z

Cmake and C++

To make debugging and development easy, I use C++ (it's my primary language) and cmake because it is easy to get an IDE (I use qtcreator) to navigate the code, debug and build it. It is easy to work with, since one can easily set a breakpoint in the debugger when trying to get the fuzzer to reach a certain place in the code. I also refactored the fuzzer to use RAII style so the cleanup is easier.

I think that given everything thus far is in Automake, adding CMake is going to be a tricky sell. I have no issue with moving towards a more RAII-style setup, however; when I wrote this originally I was coming from a very C oriented background.

Using boost asio instead of cstyle select

I use asio to asynchronously wait for network events. Since I am used to it and it is the most well known way to network in C++, it is a good choice. I think I also managed to handle timeouts in a way such that the fuzzing speed is improved, but this is a tricky topic since it involves interaction between curl, asio and waiting for the OS.
I think it is a good foundation to build on, since it is possible to for instance interface openssl through asio, which would open up for fuzzing the contents of https instead of the encrypted layer as the current fuzzers do.

Unsure about this. You might be used to this but most libcurl examples use select(). I'm basically worried about maintaining and extending any code added here.

Internal fuzzers

The existing fuzzers use curl as just any libcurl user. I wanted to stress test some functions directly, so I put some fuzzers which access curl from within. I do this by adding an optional section inside a curl CMakefile, which includes files from the curl-fuzzer repo. This will uglify curl, but it is by default off and placed within if (ENABLE_FUZZING) sections.

The benefit is that these fuzz targets can be very focused, and do not need to go through being invented by the TLV fuzzer and sent over the socket. Not all functions can be reached by the existing fuzzers.

I add internal fuzzers for
* cookie handling

* doh payload encode/decode

* escape/unescape

* netrc parsing
I think it may be a good idea to fuzz setting curloptions here, so one does not have to let that happen only through the existing fuzzers.

Fuzzing a single function like this, for instance the doh parser, is what libFuzzer is excellent at. It will not be limited by network timeouts or tearing up/down sockets.

These internal fuzzers cover almost everything within a few hours, so these are nice to have, but could perhaps run through a CI build instead of wasting slots at oss fuzz (unless they schedule fuzzing effort to avoid it, I do not know).

We have I think one internal fuzzer already (curl_fnmatch) which doesn't need anything special patching in libcurl. I'd obviously like to see more internal fuzzers! Hopefully these can be added in a similar way to the fnmatch fuzzer.

Structure aware fuzzing

The fuzz data uses TLV (type length value) which means the default mutation strategy will very likely break the content. Most of the input the fuzzer engine generates is most likely garbage, and has to be rejected while unpacking the TLV during setting curl options.

This is inefficient, but it also makes it difficult for the fuzzing engine to make meaningful input since both the type, content and alignment with other blocks have to match.

So I wrote a custom mutator, which parses the TLV while throwing away parts that don't make sense. With a list of blocks, it can now apply mutation on a single block, then serializing it again. It also implements the custom crossover, for mixing two test cases.

This works, but I have a hard time evaluating it since it takes a very long time to build up the corpus for the existing fuzzers. It sure finds new paths when starting from empty, so I figure if it can do that, it will also be good at exposing cases not yet known.

At this point in time it might be worth looking into data formats that are easier to fuzz. One of the nice things about TLVs obviously is that they allow null bytes, which are historically fun to fuzz with.

Are you talking about adding a custom mutator as per https://github.com/google/fuzzing/blob/master/docs/structure-aware-fuzzing.md ? If so, that is quite interesting. Would love to see a PR on that.

A proposal

I think it is good to discuss before sending pull requests. Here is my suggestion:
* the existing fuzzers are left as is, with build system and all. they work well, and until "proven" it does not make sense to remove them

Agreed

* a new fuzzer build is made in parallel to the existing. It uses cmake.

Honestly, not a fan of this. This just appears to be adding cmake into the mix because you want to use cmake, not for any technical reason. Having had exposure to all sorts of build systems, I'm reasonably certain that anything you can do in cmake you can do in Automake.

* the fuzzers which do what the existing ones do, but with the custom mutator, are added to the new fuzzer build

I think we should start with http, and in automake we just create a curl_http_cmut_fuzzer target, which just adds in a new file containing the mutator code. I can check the stats overnight between the two fuzzers.

* the internal fuzzers are added (but maybe not added by default to ossfuzz)

Assuming we can write these in such a way that doesn't require curl hackery, it'd be good to get these into the ossfuzz build regardless.

What do I want help with now

I would like to get either access to the corpus from oss fuzz or a copy of it, so I can see which parts of curl are not fuzzed and evaluate the new fuzzers I write. I think the link is this one, based on another project I work with. It needs login through one of the admins: https://console.cloud.google.com/storage/browser/curl-backup.clusterfuzz-external.appspot.com

Thanks,
Paul

I can perhaps update the corpus in the repository; it's been a while since I've done that. Alternatively you could just monitor the coverage report from oss-fuzz: https://storage.googleapis.com/oss-fuzz-coverage/curl/reports/20190927/linux/report.html. These URLs are apparently unsecured, so you could monitor the coverage in response to any fixes that get made.

pauldreik · 2019-09-29T13:47:59Z

The internal fuzzers

These have to go "inside" curl, since they need internal headers which are hidden otherwise (I might be wrong, but I am completely new to curl until I started fuzzing this). I use functions from doh.h, for instance, which are inside UNITTEST macros. I think I tried to hack around it, but then I needed curlconfig.h etc so I did it this way. No big harm, see below under the CMake section below.

I took a look at the fnmem and it seemed to be copied, I tried to mimick that for the doh fuzzer but could not get it working. Instead I went for adding to the existing cmake system in curl.

CMake

Honestly, not a fan of this. This just appears to be adding cmake into the mix because you want to use cmake, not for any technical reason. Having had exposure to all sorts of build systems, I'm reasonably certain that anything you can do in cmake you can do in Automake.

There is already CMake support in place in curl, although a bit sketchy, but it works well enough. To be able to get this to work with building the fuzzers from inside instead of outside like the existing, I had to do the following:

add C++ as language in curl/CMakeLists.txt
add this to curl/CMakeLists.txt

option(ENABLE_FUZZING "enables fuzzing" Off)

in curl/CMakeLists.txt

add

if(ENABLE_FUZZING)
    include(${CMAKE_SOURCE_DIR}/../curl-fuzzer/intree_fuzzer/CMakeLists.txt)
else()

The changes above can even be patched dynamically in the fuzzer build script, if one does not want to change curl in any way due to this. I do not know how many users the cmake curl build has and if they would be upset by adding C++ as a language?

The cmake for building the fuzzers (in curl-fuzzer/intree_fuzzer/CMakeLists.txt ) is not hard at all, it is about 100 sparse lines of vanilla cmake.

I understand the resistance to yet another build system but cmake enables easy working in an IDE as well as having separated build/source directories. It makes developing fuzzers much easier when having debugging, IDE and code completion working straight out of the box.

Now, curl already has cmake going and if we agree that the existing fuzzers should continue living in parallel, I do not see much harm in having these extra fuzzers build with cmake? The benefit for me is huge. I use autotools in another project I am the author and maintainer of, and it has taught me that cmake is a better option for me. Will an auto tools based solution be less than 100 lines of code?

Considering that the new fuzzers are hooking into the existing build system of curl instead of being a parallel build conf as the existing fuzzers, is there any benefit of writing a new autotools based build to avoid 100 lines of cmake?

How do you develop and debug the fuzzers, do you use an IDE or perhaps an emacs/gdb style setup? Everyone is different, I settled for using an IDE and that works good for me and won't stop anyone using gdb on the command line or some other editor of their choice.

TLV and custom mutation

At this point in time it might be worth looking into data formats that are easier to fuzz. One of the nice things about TLVs obviously is that they allow null bytes, which are historically fun to fuzz with.
I do not think it is possible to get it easier by switching format, the important thing is adding functionality so the mutation can be efficient. Keeping the TLV so one can exchange data between the existing and new fuzzers is great, because it means one can take corpus from one to the other, or let them fuzz in parallel sharing output folder. They can combine strengths that way.

Are you talking about adding a custom mutator as per https://github.com/google/fuzzing/blob/master/docs/structure-aware-fuzzing.md ? If so, that is quite interesting. Would love to see a PR on that.
Yes, that is exactly what I am doing.
Thanks for the prompt answer!

Boost Asio

I understand having the select based approach is good for maintenance. But if it would be possible to get https fuzzing and higher performance, wouldn't it be worth it? I image this would be a separate fuzzer, keeping the existing ones as is. So it is more of adding than replacing. Asio is not just another random library, it is being standardized into C++ and it is very well proven.

Corpus

I can perhaps update the corpus in the repository; it's been a while since I've done that. Alternatively you could just monitor the coverage report from oss-fuzz: https://storage.googleapis.com/oss-fuzz-coverage/curl/reports/20190927/linux/report.html. These URLs are apparently unsecured, so you could monitor the coverage in response to any fixes that get made.

Thanks for that link, I did not know those were public! That helps a lot, but is there anything preventing you from downloading the corpus and providing me with a copy? It makes things much easier for me. Or allowing me to add myself as cc on the oss-fuzz config, then I can make a PR on oss fuzz and then download it myself. I am not sure having the corpus in the repo is good, I do this myself but it tends to become huge diffs which makes it harder to navigate around the git diffs (suddenly there is 1000 pages of corpus change I have to ignore).

Http fuzzer

I can see from the link you sent that the the following areas are not fuzzed

proxy
time
partial uploads
some kinds of authentication (not counting the windows only)?

I have added proxy and time myself, but I need a good corpus to know if I can't reach the right places because of a poor corpus/lack of cpu time, or the fuzzer target is not correct.

pauldreik · 2019-09-29T13:58:31Z

I just saw that the existing fuzzers have made doh requests which I added 14 days ago #30, but still have not gotten a working reply! That says something about how long time it takes to fuzz. It would be lovely to see if structured aware fuzzing would be able to find that faster.

https://storage.googleapis.com/oss-fuzz-coverage/curl/reports/20190927/linux/src/curl/lib/doh.c.html#L622

cmeister2 · 2019-09-30T22:18:03Z

The internal fuzzers

These have to go "inside" curl, since they need internal headers which are hidden otherwise (I might be wrong, but I am completely new to curl until I started fuzzing this). I use functions from doh.h, for instance, which are inside UNITTEST macros. I think I tried to hack around it, but then I needed curlconfig.h etc so I did it this way. No big harm, see below under the CMake section below.

If you can provide the source of one of these fuzzers I can see if it can be added as is?

CMake

Honestly, not a fan of this. This just appears to be adding cmake into the mix because you want to use cmake, not for any technical reason. Having had exposure to all sorts of build systems, I'm reasonably certain that anything you can do in cmake you can do in Automake.

There is already CMake support in place in curl, although a bit sketchy, but it works well enough. To be able to get this to work with building the fuzzers from inside instead of outside like the existing, I had to do the following:
...

The changes above can even be patched dynamically in the fuzzer build script, if one does not want to change curl in any way due to this. I do not know how many users the cmake curl build has and if they would be upset by adding C++ as a language?

Unknown, @bagder might know.

The cmake for building the fuzzers (in curl-fuzzer/intree_fuzzer/CMakeLists.txt ) is not hard at all, it is about 100 sparse lines of vanilla cmake.

I understand the resistance to yet another build system but cmake enables easy working in an IDE as well as having separated build/source directories. It makes developing fuzzers much easier when having debugging, IDE and code completion working straight out of the box.

Now, curl already has cmake going and if we agree that the existing fuzzers should continue living in parallel, I do not see much harm in having these extra fuzzers build with cmake? The benefit for me is huge. I use autotools in another project I am the author and maintainer of, and it has taught me that cmake is a better option for me. Will an auto tools based solution be less than 100 lines of code?

Sorry, but personally I don't think it flies to introduce a new build system which is the only way of building a set of fuzzers. I'd be happy if you wanted to provide a CMake file as well as Automake rules, because that's how it's done in the main curl repository. I'm also happy if you want to provide the source of the fuzzers and have me write the Automake rules to compile them.

I'm not entirely sure why CMake requires you to hack around the libcurl CMake files either; can't CMake just link to the augmented libcurl compiled and installed during the build process?

Considering that the new fuzzers are hooking into the existing build system of curl instead of being a parallel build conf as the existing fuzzers, is there any benefit of writing a new autotools based build to avoid 100 lines of cmake?

I think it's more like 5 lines per fuzzer assuming the fuzzers can be written like fnmatch.

How do you develop and debug the fuzzers, do you use an IDE or perhaps an emacs/gdb style setup? Everyone is different, I settled for using an IDE and that works good for me and won't stop anyone using gdb on the command line or some other editor of their choice.

Personally, gdb and Sublime.

Boost Asio

I understand having the select based approach is good for maintenance. But if it would be possible to get https fuzzing and higher performance, wouldn't it be worth it? I image this would be a separate fuzzer, keeping the existing ones as is. So it is more of adding than replacing. Asio is not just another random library, it is being standardized into C++ and it is very well proven.

If it's a separate fuzzer then I don't really see a problem here apart from maintenance. Almost certainly worth getting in.

Corpus

I can perhaps update the corpus in the repository; it's been a while since I've done that. Alternatively you could just monitor the coverage report from oss-fuzz: https://storage.googleapis.com/oss-fuzz-coverage/curl/reports/20190927/linux/report.html. These URLs are apparently unsecured, so you could monitor the coverage in response to any fixes that get made.

Thanks for that link, I did not know those were public! That helps a lot, but is there anything preventing you from downloading the corpus and providing me with a copy? It makes things much easier for me. Or allowing me to add myself as cc on the oss-fuzz config, then I can make a PR on oss fuzz and then download it myself. I am not sure having the corpus in the repo is good, I do this myself but it tends to become huge diffs which makes it harder to navigate around the git diffs (suddenly there is 1000 pages of corpus change I have to ignore).

Probably worth getting @bagder's input here because I don't know what the policies are surrounding oss-fuzzing and HackerOne bounties. I've personally recused myself from submitting faults in curl due to fuzzing discovered issues; if I didn't I'd certainly have a head on the competition by having access to the corpus.

Http fuzzer

I can see from the link you sent that the the following areas are not fuzzed
* proxy

* time

* partial uploads

* some kinds of authentication (not counting the windows only)?
I have added proxy and time myself, but I need a good corpus to know if I can't reach the right places because of a poor corpus/lack of cpu time, or the fuzzer target is not correct.

In general I hand craft corpus entries myself if I know I'm trying to hit something specific; have you tried doing this?

cmeister2 · 2019-09-30T22:18:42Z

I just saw that the existing fuzzers have made doh requests which I added 14 days ago #30, but still have not gotten a working reply! That says something about how long time it takes to fuzz. It would be lovely to see if structured aware fuzzing would be able to find that faster.

https://storage.googleapis.com/oss-fuzz-coverage/curl/reports/20190927/linux/src/curl/lib/doh.c.html#L622

As per above - have you tried handcrafting a corpus file here?

pauldreik · 2019-10-02T06:16:38Z

Sorry, but personally I don't think it flies to introduce a new build system which is the only way of building a set of fuzzers. I'd be happy if you wanted to provide a CMake file as well as Automake rules, because that's how it's done in the main curl repository. I'm also happy if you want to provide the source of the fuzzers and have me write the Automake rules to compile them.

Thanks, that works fine with me!
I will put my fuzzers up, and it will be easier to discuss.

If it's a separate fuzzer then I don't really see a problem here apart from maintenance. Almost certainly worth getting in.

Probably worth getting @bagder's input here because I don't know what the policies are surrounding oss-fuzzing and HackerOne bounties. I've personally recused myself from submitting faults in curl due to fuzzing discovered issues; if I didn't I'd certainly have a head on the competition by having access to the corpus.

I understand your position on this. Not much to do then. In case you have an old corpus lying around, that would help as well.

In general I hand craft corpus entries myself if I know I'm trying to hit something specific; have you tried doing this?

Yes, and while possible I prefer to have the fuzzer being able to detect new input for itself - if it can't find input easily, it will probably not find other interesting input easily. Which was shown by the doh response example, which still had not found the reply even after 14 days.

As per above - have you tried handcrafting a corpus file here?

I might misremember since I have worked with X things in different directions, but I think I temporarily restricted the fuzzer to force use of doh and fuzzing only the response. This is a prime example of how setting a breakpoint in the debugger at the doh parse function makes it very nice to confirm that the fuzzer actually reaches there.

pauldreik · 2019-10-02T20:08:15Z

I tried to clean up my code a bit and add some comments, but I publish it here right now instead of polishing it, so we can continue the discussion. Should I perhaps make a pull request (not intended for merge), for ease of putting comments in the code etc?

I use Debian Buster and Ubuntu 18.04 for developing and running this.

Build instructions are here, I have not had time to do a fresh build on an independent machine so they might have some bug: https://github.com/pauldreik/curl-fuzzer/blob/paul/localfuzz_public0/intree_fuzzer/README.md

The asio fuzzer: https://github.com/pauldreik/curl-fuzzer/blob/paul/localfuzz_public0/intree_fuzzer/src/asiofuzzer/AsioHttpFuzzer.cpp

The internal function fuzzers: https://github.com/pauldreik/curl-fuzzer/tree/paul/localfuzz_public0/intree_fuzzer/src/insidefuzzers

The custom mutator: https://github.com/pauldreik/curl-fuzzer/blob/paul/localfuzz_public0/intree_fuzzer/src/networkfuzzers/TLVMutator.hpp

Here is the curl branch: https://github.com/pauldreik/curl/tree/paul/localfuzz_public0
Here is the curl-fuzzer branch: https://github.com/pauldreik/curl-fuzzer/tree/paul/localfuzz_public0

pauldreik · 2019-10-03T17:38:41Z

And while polishing up the fuzz data, I discovered another issue I had not seen before:
curl/curl#4454 (harmless use after free)

cmeister2 · 2019-10-08T10:09:54Z

Thanks for this - sorry I haven't responded sooner.

The autoformatter has made comparing the code a bit more tricky than expected; the current code is "curl"-style, so I don't know whether we want to continue to enforce that in this repository or not (I think we do; I should see if there's a clang-format that will enforce "curl"-style). Regardless, I think there's definitely a bunch of solid ideas here that can be worked on.

From my perspective I think I'd like to see a starting series of PRs here:

De-C-ifying the existing code (Turning FUZZ_DATA into a class and adding the sockettwin class)
Adding CMakeLists with regressible Travis testing to ensure that they don't become stale.

Separately I'd like to see if these internal function fuzzers can be simply added within the current architecture.

Actions: I think I may have some spare time this week on Wednesday evening to be able to look at the internal function fuzzers. I'd like to see whether it's possible to add them in a similar way to the existing fnmatch fuzzer (that is currently disabled, but should still compile). If so, getting these in ASAP as examples feels like a really good start.

pauldreik · 2019-10-20T11:00:41Z

No worries, I have been busy myself!

A clang format would be wonderful. Is it possible to use the one on the curl wiki?
Do you mean that you want me to make a PR for DE-C-ifying and adding CMakelist, or did you plan it yourself?
Where did you mean the travis testing should run, on this curl-fuzzer repo or the curl repo? The latter seems quite bogged down already...

Separately I'd like to see if these internal function fuzzers can be simply added within the current architecture.

Did you have a go at this? I guess the outcome of that influences decisions on going forward with cmake?

I am stuck on curl/curl#4463. I just added a much smaller test case to make it easier to troubleshoot (I don't know how to fix it myself). I think it is good to have proxy fuzzing so I do not want to disable it if I can, but I suspect it does not fit very well into the current structure of the fuzzer since the number of connections may be exhausted, especially if doh is used. With the asio fuzzer, it should work easier but that has other problems (ad hoc structure of the input data since I wanted to try some ideas).

cmeister2 · 2019-10-23T14:49:27Z

Will follow up shortly here - have not been well recently so am only just getting back to tech things.

pauldreik · 2019-10-26T08:44:47Z

Will follow up shortly here - have not been well recently so am only just getting back to tech things.

Feeling well is more important than fuzzing curl. Hope you get better soon, take your time!

cmeister2 · 2019-11-03T11:35:53Z

* A clang format would be wonderful. Is it possible to use the one [on the curl wiki](https://github.com/curl/curl/wiki/clang-format-style)?

Unsure to be honest. I have only used the perl script that does style-checking.

* Do you mean that you want me to make a PR for DE-C-ifying and adding CMakelist, or did you plan it yourself?

It'd be great if you could - it makes life a lot easier to reason about changes in small batches of well defined functional commits!

* Where did you mean the travis testing should run, on this curl-fuzzer repo or the curl repo? The latter seems quite bogged down already...

The curl-fuzzer repo. Ideally we'd have some way of compiling using CMakeLists if possible. Unfortunately this repository does share runners with Travis, so we'll still hit the problem of having too many runners in use.

Separately I'd like to see if these internal function fuzzers can be simply added within the current architecture.

Did you have a go at this? I guess the outcome of that influences decisions on going forward with cmake?

Aiming to do this today, I know I've been slack on this.

cmeister2 · 2019-11-03T13:15:51Z

I've had a look at the UT fuzzing; it looks like the fnmatch code was done pretty hackily (copy a header file into a shared folder).

What I might propose here is actually adding a feature to curl so that we expose an interface to a bunch of internal functions when a feature flag is passed at configuration time. This would require a (hopefully) simple bit of curl function, with a header file that gets installed to access all of these fuzzing interfaces.

I'll propose it on IRC and see if @bagder is interested.

pauldreik changed the title ~~Questions: Structure aware fuzzing, using internal functions~~ Questions: Structure aware fuzzing, using internal functions and more Sep 29, 2019

cmeister2 mentioned this issue Nov 5, 2019

Discussion: how to allow access to internal functions for fuzzing testing curl/curl#4561

Closed

pauldreik mentioned this issue Nov 16, 2019

doh: zero length hostname causes buffer read underflow curl/curl#4605

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions: Structure aware fuzzing, using internal functions and more #32

Questions: Structure aware fuzzing, using internal functions and more #32

pauldreik commented Sep 29, 2019 •

edited

Loading

cmeister2 commented Sep 29, 2019

Cmake and C++

Using boost asio instead of cstyle select

Internal fuzzers

Structure aware fuzzing

A proposal

What do I want help with now

pauldreik commented Sep 29, 2019 •

edited

Loading

pauldreik commented Sep 29, 2019

cmeister2 commented Sep 30, 2019

The internal fuzzers

CMake

Boost Asio

Corpus

Http fuzzer

cmeister2 commented Sep 30, 2019

pauldreik commented Oct 2, 2019 •

edited

Loading

pauldreik commented Oct 2, 2019 •

edited

Loading

pauldreik commented Oct 3, 2019

cmeister2 commented Oct 8, 2019

pauldreik commented Oct 20, 2019

cmeister2 commented Oct 23, 2019

pauldreik commented Oct 26, 2019

cmeister2 commented Nov 3, 2019

cmeister2 commented Nov 3, 2019

Questions: Structure aware fuzzing, using internal functions and more #32

Questions: Structure aware fuzzing, using internal functions and more #32

Comments

pauldreik commented Sep 29, 2019 • edited Loading

Cmake and C++

Using boost asio instead of cstyle select

Internal fuzzers

Structure aware fuzzing

A proposal

What do I want help with now

cmeister2 commented Sep 29, 2019

Cmake and C++

Using boost asio instead of cstyle select

Internal fuzzers

Structure aware fuzzing

A proposal

What do I want help with now

pauldreik commented Sep 29, 2019 • edited Loading

The internal fuzzers

CMake

TLV and custom mutation

Boost Asio

Corpus

Http fuzzer

pauldreik commented Sep 29, 2019

cmeister2 commented Sep 30, 2019

The internal fuzzers

CMake

Boost Asio

Corpus

Http fuzzer

cmeister2 commented Sep 30, 2019

pauldreik commented Oct 2, 2019 • edited Loading

pauldreik commented Oct 2, 2019 • edited Loading

pauldreik commented Oct 3, 2019

cmeister2 commented Oct 8, 2019

pauldreik commented Oct 20, 2019

cmeister2 commented Oct 23, 2019

pauldreik commented Oct 26, 2019

cmeister2 commented Nov 3, 2019

cmeister2 commented Nov 3, 2019

pauldreik commented Sep 29, 2019 •

edited

Loading

pauldreik commented Sep 29, 2019 •

edited

Loading

pauldreik commented Oct 2, 2019 •

edited

Loading

pauldreik commented Oct 2, 2019 •

edited

Loading