summaryrefslogtreecommitdiff
path: root/usr.bin/mandoc/cgi.c
AgeCommit message (Collapse)Author
2016-04-14do not rewrite short URIsIngo Schwarze
2016-04-13Give manuals in purely numerical sections priority over manuals ofIngo Schwarze
the same name in sections with an alphabetical suffix; same logic as in main.c rev. 1.171.
2016-03-18Make the SCRIPT_NAME logic simpler, safer, and make it actually work;Ingo Schwarze
in part based on ideas by bentley@. While here, improve the documentation.
2016-03-17make man(1) mode the default rather than apropos(1) modeIngo Schwarze
2016-03-17support short URIs for man.openbsd.orgIngo Schwarze
2015-12-25Don't retain the search query in the resulting manual links.Anthony J. Bentley
Clean, simple URLs are best. ok schwarze@
2015-11-07Modernization, no functional change intended:Ingo Schwarze
Use the POSIX function getline(3) rather than the slightly dangerous BSD function fgetln(3).
2015-11-05Use include files "header.html" and "footer.html" rather than aIngo Schwarze
compiled-in string. This is not a security risk, we read the file manpath.conf from the same directory, anyway. No error handling is needed; even if the files are absent, that's not an error. This is more flexible without causing complication of the code or the user interface. It helps the upcoming revamp of the online manual pages on man.NetBSD.org. Based on an idea by Jean-Yves Migeon <jeanyves dot migeon at free dot fr>, but implemented in a much simpler way.
2015-11-05simplify: use one stylesheet rather than two; from bentley@Ingo Schwarze
2015-10-22use the new function man_validate() here, tooIngo Schwarze
2015-10-20In order to become able to generate syntax tree nodes on the roff(7)Ingo Schwarze
level, validation must be separated from parsing and rewinding. This first big step moves calling of the mdoc(7) post_*() functions out of the parser loop into their own mdoc_validate() pass, while using a new mdoc_state() module to make syntax tree state handling available to both the parser loop and the validation pass.
2015-10-13Major character table cleanup:Ingo Schwarze
* Use ohash(3) rather than a hand-rolled hash table. * Make the character table static in the chars.c module: There is no need to pass a pointer around, we most certainly never want to use two different character tables concurrently. * No need to keep the characters in a separate file chars.in; that merely encourages downstream porters to mess with them. * Sort the characters to agree with the mandoc_chars(7) manual page. * Specify Unicode codepoints in hex, not decimal (that's the detail that originally triggered this patch). No functional change, minus 100 LOC, and i don't see a performance change.
2015-10-06modernize style: "return" is not a function; ok cmp(1)Ingo Schwarze
2015-04-18Profit from the unified struct roff_man and reduce the number ofIngo Schwarze
arguments of mparse_result() by one. No functional change. Written on the ICE Bruxelles-Koeln on the way back from p2k15.
2015-04-18Replace the structs mdoc and man by a unified struct roff_man.Ingo Schwarze
Almost completely mechanical, no functional change. Written on the train from Exeter to London returning from p2k15.
2015-03-27Actually use the new man.conf(5) "output" directive.Ingo Schwarze
Additional functionality, yet minus 45 lines of code.
2015-03-27Parse the new man.conf(5) "output" directive.Ingo Schwarze
The next step will be to actually use the parsed data.
2015-02-10trim trailing white space, no code change;Ingo Schwarze
from Svyatoslav Mishyn <juef at openmailboxd dot org>, Crux Linux
2015-01-15Fatal errors no longer exist.Ingo Schwarze
If a file can be opened, mandoc will produce some output; at worst, the output may be almost empty. Simplifies error handling and frees a message type for future use.
2014-11-26Make the code sipler and more robust by always assigning a valueIngo Schwarze
to q.manpath and dropping the (incomplete) later NULL checks.
2014-11-11In man(1) mode without -a, stop searching after the first manual treeIngo Schwarze
that contained at least one match in order to not prefer mdoc(1) from ports over mdoc(7). As a bonus, this results in a speedup.
2014-10-28Make the character table available to libroff so it can check theIngo Schwarze
validity of character escape names and warn about unknown ones. This requires mchars_spec2cp() to report unknown names again. Fortunately, that doesn't require changing the calling code because according to groff, invalid character escapes should not produce output anyway, and now that we warn about them, that's fine.
2014-10-07be a bit more patient, 1s is sometimes insufficient for legitimate queriesIngo Schwarze
2014-10-07Switch HTML output to polyglot HTML5; have only one single -Thml mode.Ingo Schwarze
Replace hard-coded widths and alignments with a minimal embedded stylesheet. Do not use <p> because it cannot appear inside block macros. Remove the "summary" attribute because it is not HTML5. Written by kristaps@ some months ago, finished during EuroBSDCon.
2014-09-14Support backslash-escaping of white space in the query expression,Ingo Schwarze
to be more similar to apropos(1) called from the shell. Missing feature reported by Marcus MERIGHI <mcmer dash openbsd at tor dot at> on misc@.
2014-08-27Add an implementation of man(1) into the /usr/bin/mandoc binary andIngo Schwarze
provide a unified set of command line options for mandoc(1), man(1), apropos(1), and whatis(1), each option doing the same for all four. Not adding any completely new options, only extending exiting ones from one tool to the others. New options are: * apropos & whatis -acfkw (in the past, these were man(1) only) * apropos & whatis -a -IOTW (in the past, mandoc(1) only) * mandoc -ac (in the past, man(1) only) * man -IOTW (in the past, mandoc(1) only) Before we can decide whether or not we want to replace src/usr.bin/man with this implementation, considerable bugfixing, testing, and performance measurements are needed, which i'd rather do in the tree than outside. Note that these bugs only affect the new man(1) mode, existing mandoc(1), apropos(1), and whatis(1) is fine. The new functionality in mandoc(1), apropos(1), and whatis(1) is fully enabled. To play with the new man(1), you can try: # mv /usr/bin/man /usr/bin/oman # ln -s /usr/bin/mandoc /usr/bin/man Positive feedback about the general direction from sthen@ and jmc@, and deraadt@ is not against it.
2014-08-26Unify console and CGI section titles, bringing them closer to groff,Ingo Schwarze
is an attempt to reduce confusion for users switching between media and between different operating systems. Not using the groff title for section 4 is intentional, it is just too awful ("Kernel Interfaces Manual"), and the difference from sections 2 and 9 would be too hard to see. Positive feedback from at least deraadt@ millert@ bluhm@ jca@. Previously also agreed with the general direction: jmc@ Nick@. The title for 3p was polished by bluhm@. The existing mismatches were originally noticed by jsg@.
2014-08-21limit CGI process execution time to make REDoS attacks less effective;Ingo Schwarze
attack surface pointed out by Sebastien Marie
2014-08-08mansearch.h uses uint64_t, so it needs stdint.h; found on LinuxIngo Schwarze
2014-07-25Choosing the right encoding is a tricky business...Ingo Schwarze
Printing query strings for URIs *always* needs URI-encoding, and when embedding the URI into an HTML document, it needs replacement of the "&" separators by "&amp;" *in addition to that*, not instead. Delete the function html_primtquery(), it was completely wrong. You can see the badness by entering "mandoc &sec=2" into the query input box before this patch and click "Submit". You come to the right page at first (...man.cgi?query=mandoc+%26sec%3D2&apropos=0&sec=0&...), but now the link to mandoc(1) is wrong: ...mandoc.1?query=mandoc &amp;sec=2&amp;... Clicking on that, the "&sec=2" disappears from the query input box and suddenly you have the first dropdown set to "2 - System Calls". Oops.
2014-07-25Sort the URI keys for .Xr links in the same order used by the search form,Ingo Schwarze
and leave out the manpath when it is the default. For building the HTML formatter options, do not use a static buffer.
2014-07-25oops, we must not try to validate a manpath we don't have;Ingo Schwarze
fixing an oversight introduced in rev. 1.17
2014-07-25We cannot easily control the order of the QUERY_STRING keys generatedIngo Schwarze
by the search form, it's just the order of the fields in the form. Actually, that's not too bad; the generated URI resembles the generating form. To minimize confusion for people looking at URIs, give the keys in the same order when generating URIs for search listings and search redirections, the latter being used instead of search listings that would have only one single entry. Also, if the manpath is the default, remove it form the generated URIs.
2014-07-25In generated .Xr links, avoid double encoding of ampersandsIngo Schwarze
and avoid empty arch= keys.
2014-07-25Even though this is not XHTML yet, remove some gratuitious violationsIngo Schwarze
of XHTML syntax. Also add some cosmetic newlines to the HTML code.
2014-07-25The names of all other struct query memebers match the correspondingIngo Schwarze
QUERY_STRING keys, so rename "expr" to "query". Also add some missing function prototypes. No functional change.
2014-07-25clean up pg_show() to not modify a string returned from getenv(3)Ingo Schwarze
2014-07-25Rewrite http_parse() completely:Ingo Schwarze
1. Make sure the last occurrence of each key is used, even if it is empty, in which case it resets the value to the default. 2. When there is an HTTP encoding error, skip the affected key-value pair only, but not all subsequent key-value pairs. 3. Do not modify a string returned from getenv(3). 4. Do not assume the NULL pointer is all null bits.
2014-07-24Sort result pages first by section number, then by name.Ingo Schwarze
By moving the sort from cgi.c to mansearch.c, we get two advantages: Easier access to the data needed for sorting, in particular the section number, and the apropos(1) command line utility profits as well. Feature requested by deraadt@.
2014-07-24Provide a dropdown entry "All Architectures" and make it the default.Ingo Schwarze
Still, amd64 remains the default in the following sense: If a man(1) mode search returns more than one page of the same name, prefer amd64 over other architectures for immediate display. ok deraadt@ daniel@
2014-07-22Security fix to prevent XSS attacks:Ingo Schwarze
Restrict the character set of strings passed into html_alloc(), in particular architecture names that come from the QUERY_STRING, but also SCRIPT_NAME and manpath.conf content for additional safety, and bail out safely on violations. Issue reported by Sebastien Marie <semarie-openbsd at latrappe dot fr>.
2014-07-21fix a minibug reported by kristaps@:Ingo Schwarze
preserve manpath and arch in .Xr links
2014-07-21Kristaps points out that the current HTTP/1.1 draft standard (RFCIngo Schwarze
2616) requires the Location: response-header field to be an absolute URI (14.30), and only the most recent proposed standard (RFC 7231), which is barely a month old, allows a relative Location: (7.1.2). While most modern browsers appear to support relative Location: headers, some may not, and it's maybe a bit early to rely on relative Location: headers. I'm not going back to the HTTP_HOST or SERVER_NAME CGI variables, though. While some CGI programs certainly require those, in which case both the CGI programmer and the web server admin have to be very careful to keep the system secure and reliable, man.cgi(8) does not really need them. We always know at compile time which domain we are running for, and for man.cgi(8), security and reliability are definitely much more important than flexibility. So make HTTP_HOST a compile-time definition for now.
2014-07-19Security fix:Ingo Schwarze
Validate the manpath up front and report a Bad Request if it is not listed in manpath.conf, such that clients can't probe which directories exist on the server. In case of configuration errors, consistently report Internal Server Error without disclosing any further information. Partially based on a patch from Sebastien Marie <semarie-openbsd at latrappe dot fr>, but avoiding a couple of issues with that patch and approaching the issue in a somewhat more rigorous way.
2014-07-19Security fix:Ingo Schwarze
Validate the name of the file to show before opening it. Only allow relative filenames starting with "man" or "cat" and containing neither "/.." nor "../". While here, correct the condition discarding an initial "./". Vulnerability found by Sebastien Marie <semarie-openbsd at latrappe dot fr>. Many thanks for sending a patch; however, i did not use it but made the checks even stricter.
2014-07-18Do not use the HTTP_HOST CGI variable,Ingo Schwarze
just make the HTTP redirect Location: relative. Less user input is good, it reduces the attack surface. Besides, this removes one global variable and 4 lines of code. Patch from Sebastien Marie <semarie-openbsd at latrappe dot fr>.
2014-07-18When the MAN_DIR/manpath.conf configuration file does not exist or is empty,Ingo Schwarze
log the problem, hand the pg_error_internal() error page to the client, and exit(3) in a controlled way instead of stumbling on and segfaulting later. Patch from Sebastien Marie <semarie-openbsd at latrappe dot fr>, messages tweaked by me.
2014-07-13Compatibility hack for the old "manpath=OpenBSD<blank>" query parameter format;Ingo Schwarze
unfortunate, more than 400 links needing this are scattered all around the www.openbsd.org website, and CVSweb needs this as well.
2014-07-13Make the calltree a bit easier to understand by giving theIngo Schwarze
functions that call resp_begin_html() names starting with "pg_" and those called after resp_begin_html() names with "resp_". No functional change, purely renaming functions.
2014-07-13make source vs. formatted guessing a bit more robustIngo Schwarze