join (Unix)

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
join
Original authorDouglas McIlroy
DeveloperAT&T Bell Laboratories
Initial releaseJanuary 1979; 47 years ago (1979-01)
Repository
  • {{URL|example.com|optional display text}}Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).
Written inC
Engine
    Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).
    Operating systemUnix, Unix-like, Plan 9
    PlatformCross-platform
    TypeCommand
    Licensecoreutils: GPLv3+
    Plan 9: MIT License

    join is a command in Unix and Unix-like operating systems that merges the lines of two sorted text files based on the presence of a common field. It is similar to the join operator used in relational databases but operating on text files.

    Overview

    [edit | edit source]

    The join command takes as input two text files and several options. If no command-line argument is given, this command looks for a pair of lines from the two files having the same first field (a sequence of characters that are different from space), and outputs a line composed of the first field followed by the rest of the two lines.

    The program arguments specify which character to be used in place of space to separate the fields of the line, which field to use when looking for matching lines, and whether to output lines that do not match. The output can be stored to another file rather than printed using redirection.

    As an example, the two following files list the known fathers and the mothers of some people. Both files have been sorted on the join field — this is a requirement of the program.

    george jim
    kumar gunaware
    
    albert martha
    george sophie
    

    The join of these two files (with no argument) would produce:

    george jim sophie
    

    Indeed, only "george" is common as a first word of both files.

    Examples

    [edit | edit source]

    To find Pythagorean triples of the form (k, k+1, n).

    $ join <(for i in {1..9999} ; do printf -- '%010d\t%s\t%s\t%s\n' "$((2*i*(i+1)+1))" "$i" "$((i+1))" ; done) <(for i in {1..9999} ; do printf -- '%010d\t%s\n' "$((i**2))" "$i" ; done)
    0000000025 3 4  5
    0000000841 20 21  29
    0000028561 119 120  169
    0000970225 696 697  985
    0032959081 4059 4060  5741
    

    History

    [edit | edit source]

    join is intended to be a relation database operator. It is part of the X/Open Portability Guide since issue 2 of 1987. It was inherited into the first version of POSIX.1 and the Single Unix Specification.[1][2]

    The version of join bundled in GNU coreutils was written by Mike Haertel.[3] The command is available as a separate package for Microsoft Windows as part of the UnxUtils collection of native Win32 ports of common GNU Unix-like utilities.[4]

    See also

    [edit | edit source]

    References

    [edit | edit source]
    1. ^ printf – Shell and Utilities Reference, The Single UNIX Specification, Version 5 from The Open Group
    2. ^ printf(1) – FreeBSD General Commands Manual
    3. ^ join(1) – Linux General Commands Manual from ManKier.com
    4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    [edit | edit source]