summaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorKaz Kylheku <kaz@kylheku.com>2016-05-31 07:04:06 -0700
committerKaz Kylheku <kaz@kylheku.com>2016-05-31 08:17:42 -0700
commitd6ce2f2ab5dbf62e423f632aa9d727f55ac5911b (patch)
treedcdba78994e46ef0c02f7537806dba86545d7a71
parentf66b253353eeafd2c52f3e3834aefeb06012b284 (diff)
downloadtxr-d6ce2f2ab5dbf62e423f632aa9d727f55ac5911b.tar.gz
txr-d6ce2f2ab5dbf62e423f632aa9d727f55ac5911b.tar.bz2
txr-d6ce2f2ab5dbf62e423f632aa9d727f55ac5911b.zip
Eliminating the ! $ and - file-opening hacks.
Going forward, the "!command" and "$dir" are no longer recognized, except with the backward-compatibility -C option. The "-" string denoting standard input is only recognized from the command line, not in the @(next) directive; and "-" cannot be used in @(output) to denote standard output. The main problem is that these are hacks which intrude into the namespace. (What if a file is named "-", or begins with "!" or "$"?) Secondly, they are possible security holes. These prefixes are recognized even in computed strings, like "@(next var)" where var contains "!command". * Makefile (tst/tests/002/%): Run tests in this directory with the -C 142 option in TXR_OPTS, because they use the ! mechanism. * match.c (complex_open): New argument, from_cmdline, indicating whether the file name came from the command line or is internal. Function now only treats "-" as denoting standard input, if that came from the command line. The "!" and "$" prefixes are only recognized in compatibility with 142 or earlier. (v_next): Suppress old strategy of handling string sources by pushing them to the file list to let match_files open them. This is now done only under 142 compatibility. Instead, open directly, produce the list, and recurse into match_files with ready data. This is not only simpler, but also prevents "-" from being recognized in @(next), because complex_open is invoked directly, and the from_cmdline argument is specified as nil. (v_output): Don't rely on "-" denoting standard output; rather use the stream directly, which can be passed to complex_open. Pass nil to the new from_cmdline parameter of complex_open, so "-" isn't recognized. (open_data_source): Pass t for the from_cmdline parameter of complex_open, to have "-" recognized as denoting standard input. * txr.1: Removed documentation referring to !, and use of - in the @(next) and @(output) directives. Added notes to COMPATIBILITY section.
-rw-r--r--Makefile2
-rw-r--r--match.c34
-rw-r--r--txr.149
3 files changed, 49 insertions, 36 deletions
diff --git a/Makefile b/Makefile
index 84034cbb..0c443a1d 100644
--- a/Makefile
+++ b/Makefile
@@ -291,7 +291,7 @@ tst/tests/001/%: TXR_ARGS := tests/001/data
tst/tests/001/query-1.out: TXR_OPTS := -B
tst/tests/001/query-2.out: TXR_OPTS := -B
tst/tests/001/query-4.out: TXR_OPTS := -B
-tst/tests/002/%: TXR_OPTS := -DTESTDIR=tests/002
+tst/tests/002/%: TXR_OPTS := -C 142 -DTESTDIR=tests/002
tst/tests/004/%: TXR_ARGS := -a 123 -b -c
tst/tests/005/%: TXR_ARGS := tests/005/data
tst/tests/005/%: TXR_OPTS := -B
diff --git a/match.c b/match.c
index dcadde0b..db0f1845 100644
--- a/match.c
+++ b/match.c
@@ -1578,25 +1578,28 @@ static val txeval_allow_ub(val spec, val form, val bindings)
return do_txeval(spec, form, bindings, t);
}
-static val complex_open(val name, val output, val append, val nothrow)
+static val complex_open(val name, val output, val append,
+ val nothrow, val from_cmdline)
{
+ int old_hacky_open = opt_compat && opt_compat <= 142;
+
if (streamp(name)) {
return name;
} else {
val fc = car(name);
val result = nil;
- if (fc == chr('$') && output)
+ if (old_hacky_open && fc == chr('$') && output)
uw_throwf(query_error_s, lit("cannot output to directory: ~a"),
name, nao);
uw_catch_begin (if2(nothrow, cons(error_s, nil)), exc_sym, exc);
- if (fc == chr('-')) {
+ if (from_cmdline && fc == chr('-')) {
result = output ? std_output : std_input;
- } else if (fc == chr('!')) {
+ } else if (old_hacky_open && fc == chr('!')) {
result = open_command(cdr(name), output ? lit("w") : lit("r"));
- } else if (fc == chr('$')) {
+ } else if (old_hacky_open && fc == chr('$')) {
result = open_directory(cdr(name));
} else {
result = open_file(name,
@@ -2411,6 +2414,7 @@ static val v_next(match_files_ctx *c)
sem_error(specline, lit("next: keyword argument expected, not ~s"), first(args), nao);
{
+ int old_hacky_open = opt_compat && opt_compat <= 142;
val alist = improper_plist_to_alist(args, list(nothrow_k, nao));
val from_var = cdr(assoc(var_k, alist));
val list_expr = cdr(assoc(list_k, alist));
@@ -2474,7 +2478,7 @@ static val v_next(match_files_ctx *c)
if3(c->data, cons(c->data, c->data_lineno), t));
return nil;
}
- } else if (nothrow) {
+ } else if (old_hacky_open && nothrow) {
if (str) {
c->files = cons(cons(nothrow_k, str), c->files);
} else {
@@ -2485,7 +2489,7 @@ static val v_next(match_files_ctx *c)
}
c->files = cons(cons(nothrow_k, first(c->files)), rest(c->files));
}
- } else {
+ } else if (old_hacky_open) {
if (str) {
c->files = cons(str, c->files);
} else {
@@ -2494,6 +2498,16 @@ static val v_next(match_files_ctx *c)
sem_error(specline, lit("next: out of arguments"), nao);
c->files = cons(cons(nothrow_k, first(c->files)), rest(c->files));
}
+ } else {
+ val stream = complex_open(str, nil, nil, nothrow, nil);
+ cons_bind (new_bindings, success,
+ match_files(mf_file_data(*c, str,
+ lazy_stream_cons(stream), one)));
+
+ if (success)
+ return cons(new_bindings,
+ if3(c->data, cons(c->data, c->data_lineno), t));
+ return nil;
}
}
} else {
@@ -3193,7 +3207,7 @@ static val v_output(match_files_ctx *c)
val dest_spec = third(first_spec);
val nothrow = nil;
val append = nil;
- val dest = lit("-");
+ val dest = std_output;
val filter = nil;
val named_var = nil, continue_expr = nil, finish_expr = nil;
val alist;
@@ -3293,7 +3307,7 @@ static val v_output(match_files_ctx *c)
return next_spec_k;
}
- stream = complex_open(dest, t, append, nothrow);
+ stream = complex_open(dest, t, append, nothrow, nil);
debuglf(specline, lit("opening data sink ~a"), dest, nao);
@@ -3985,7 +3999,7 @@ static void open_data_source(match_files_ctx *c)
"since query starts with non-matching "
"directive."), name, nao);
} else {
- val stream = complex_open(name, nil, nil, nothrow);
+ val stream = complex_open(name, nil, nil, nothrow, t);
val spec = first(c->spec);
debuglf(spec, lit("opening data source ~a"), name, nao);
diff --git a/txr.1 b/txr.1
index 30a9a001..8d5906da 100644
--- a/txr.1
+++ b/txr.1
@@ -3395,25 +3395,6 @@ variable, which
is visible to the remainder of the entire query. The variable bindings
survive beyond the clause, but the data stream does not.
-The
-.code @(next)
-directive supports the file name conventions as the command
-line. The name
-.code -
-means standard input. Text which starts with a
-.code !
-is
-interpreted as a shell command whose output is read like a file. These
-interpretations are applied after variable substitution. If the file is
-specified as
-.codn @a ,
-but the variable a expands to
-.strn "!echo foo" ,
-then the output of
-the
-.str "echo foo"
-command will be processed.
-
.dir skip
The
@@ -7366,12 +7347,7 @@ the keyword list. If the destination is missing, then the
entire argument list is a keyword list.
The destination may be a string which gives the path name of
-a file to open for output. If the name is
-.code -
-it instead denotes standard output, and if it begins with
-.code !
-then the rest of the shell is treated as a shell command
-to which the output is piped. The destination string may be specified as a
+a file to open for output. The destination string may be specified as a
variable which holds text, as a string literal or as a quasiliteral
Alternatively, the destination may be a stream object.
@@ -42969,6 +42945,29 @@ of these version values, the described behaviors are provided if
is given an argument which is equal or lower. For instance
.code "-C 103"
selects the behaviors described below for version 105, but not those for 102.
+.IP 142
+Until version 142, the \*(TX pattern language supported a prefix
+convention on data sources. Data sources beginning with the character
+.code !
+were treated as system command pipes, and data sources beginning with
+.code $
+indicated that a directory is to be scanned. This convention was recognized
+both for command line arguments, the arguments of the
+.code @(next)
+directive, and of the
+.code @(output)
+directive, whether or not the argument was a literal or a computed
+value. This feature was dropped from the language after version 142.
+Also, until version 142, the
+.code @(next)
+directive recognized the name
+.str -
+as denoting standard input, and
+.code @(output)
+recognized it as standard output. These behaviors were also removed;
+versions after 142 recognize this convention only when it appears
+as a command line argument. All these old behaviors are provided
+if compatibility with 142 or earlier is requested.
.IP 139
After \*(TX 139, changes were implemented in the area of pseudo-random
number generation. Compatibility with 139 brings back the previous