Fork

From Hidden Wiki
Jump to navigation Jump to search

In software engineering, a project fork happens when developers take a copy of source code from one software package and start independent development on it, creating a distinct and separate piece of software. The term often implies not merely a development branch, but also a split in the developer community, a form of schism.[1]

Free and open-source software is that which, by definition, may be forked from the original development team without prior permission, without violating copyright law. However, licensed forks of proprietary software (e.g. Unix) also happen.

Etymology

The word "fork" has been used to mean "to divide in branches, go separate ways" as early as the 14th century.[2] In the software environment, the word evokes the fork system call, which causes a running process to split itself into two (almost) identical copies that (typically) diverge to perform different tasks.[3]

In the context of software development, "fork" was used in the sense of creating a revision control "branch" by Eric Allman as early as 1980, in the context of SCCS:[4]

Template:Quotation

The term was in use on Usenet by 1983 for the process of creating a subgroup to move topics of discussion to.[5]

"Fork" is not known to have been used in the sense of a community schism during the origins of Lucid Emacs (now XEmacs) (1991) or the BSDs (1993–1994); Russ Nelson used the term "shattering" for this sort of fork in 1993, attributing it to John Gilmore.[6] However, "fork" was in use in the present sense by 1995 to describe the XEmacs split,[7] and was an understood usage in the GNU Project by 1996.[8]

Forking of free and open source software

Free and open source software may be legally forked without prior approval of those currently developing, managing, or distributing the software per both The Free Software Definition and The Open Source Definition:[9] Template:Quotation Template:Quotation

In free software, forks often result from a schism over different goals or personality clashes. In a fork, both parties assume nearly identical code bases, but typically only the larger group, or whoever controls the Web site, will retain the full original name and the associated user community. Thus, there is a reputation penalty associated with forking.[9] The relationship between the different teams can be cordial or very bitter.

Eric S. Raymond, in his essay Homesteading the Noosphere,[10] stated that "The most important characteristic of a fork is that it spawns competing projects that cannot later exchange code, splitting the potential developer community". He notes in the Jargon File:[11]

Template:Quotation

David A. Wheeler notes[9] four possible outcomes of a fork, with examples:

  1. The death of the fork. This is by far the most common case. It is easy to declare a fork, but considerable effort to continue independent development and support.
  2. A re-merging of the fork (e.g., egcs becoming "blessed" as the new version of gcc.)
  3. The death of the original (e.g. the X.Org Server succeeding and XFree86 dying.)
  4. Successful branching, typically with differentiation (e.g., OpenBSD and NetBSD.)

Distributed revision control (DVCS) tools have popularised a less emotive use of the term "fork", blurring the distinction with "branch".[12] With a DVCS such as Mercurial or Git, the normal way to contribute to a project, is to first create a personal branch of the repository, independent of the main repository, and later seek to have your changes integrated with it. Sites such as GitHub, Bitbucket and Launchpad provide free DVCS hosting expressly supporting independent branches, such that the technical, social and financial barriers to forking a source code repository are massively reduced, and GitHub uses "fork" as its term for this method of contribution to a project.

Forks often restart version numbering from 0.1 or 1.0 even if the original software was at version 3.0, 4.0, or 5.0. An exception is when the forked software is designed to be a drop-in replacement for the original project, e.g. MariaDB for MySQL[13] or LibreOffice for OpenOffice.org.

Forking proprietary software

In proprietary software, the copyright is usually held by the employing entity, not by the individual software developers. Proprietary code is thus more commonly forked when the owner needs to develop two or more versions, such as a windowed version and a command line version, or versions for differing operating systems, such as a word processor for IBM PC compatible machines and Macintosh computers. Generally, such internal forks will concentrate on having the same look, feel, data format, and behavior between platforms so that a user familiar with one can also be productive or share documents generated on the other. This is almost always an economic decision to generate a greater market share and thus pay back the associated extra development costs created by the fork.

A notable proprietary fork not of this kind is the many varieties of proprietary Unix—almost all derived from AT&T Unix under license and all called "Unix", but increasingly mutually incompatible.[14] See UNIX wars.

The BSD licenses permit forks to become proprietary software, and some sayTemplate:Who that commercial incentives thus make proprietisation almost inevitable. Examples include macOS (based on the proprietary NeXTSTEP and the open source FreeBSD), Cedega and CrossOver (proprietary forks of Wine, though CrossOver tracks Wine and contributes considerably), EnterpriseDB (a fork of PostgreSQL, adding Oracle compatibility features[15]), Supported PostgreSQL with their proprietary ESM storage system,[16] and Netezza's[17] proprietary highly scalable derivative of PostgreSQL. Some of these vendors contribute back changes to the community project, while some keep their changes as their own competitive advantages.

See also

References

1 }}
     | references-column-width 
     | references-column-count references-column-count-{{#if:1|30em}} }}
   | {{#if: 
     | references-column-width }} }}" style="{{#if: 30em
   | {{#iferror: {{#ifexpr: 30em > 1 }}
     | Template:Column-width
     | Template:Column-count }}
   | {{#if: 
     | Template:Column-width }} }} list-style-type: {{#switch: 
   | upper-alpha
   | upper-roman
   | lower-alpha
   | lower-greek
   | lower-roman = {{{group}}}
   | #default = decimal}};">
  1. "Schism", with its connotations, is a common usage, e.g. "the Lemacs/FSFmacs schism" Template:Webarchive (Jamie Zawinski, 2000), "Behind the KOffice split" Template:Webarchive (Joe Brockmeier, Linux Weekly News, 2010-12-14), "Copyright assignment - once bitten, twice shy" Template:Webarchive (Richard Hillesley, H-Online, 2010-08-06), "Forking is a feature" Template:Webarchive (Anil Dash, 2010-09-10), "The Great Software Schism" Template:Webarchive (Glyn Moody, Linux Journal, 2006-09-28), "To Fork Or Not To Fork: Lessons From Ubuntu and Debian" Template:Webarchive (Benjamin Mako Hill, 2005).
  2. Entry 'fork' in Online Etymology Dictionary Template:Webarchive
  3. "The term fork is derived from the POSIX standard for operating systems: the system call used so that a process generates a copy of itself is called fork()." Template:Cite conference
  4. Allman, Eric. "An Introduction to the Source Code Control System." Template:Webarchive Project Ingres, University of California at Berkeley, 1980.
  5. Can somebody fork off a "net.philosophy"? (John Gilmore, net.misc, 18 January 1983)
  6. Shattering — good or bad? (Russell Nelson, gnu.misc.discuss, 1 October 1993)
  7. Re: Hey Franz: 32K Windows SUCK!!!!! (Bill Dubuque, cu.cs.macl.info, 21 September 1995)
  8. Lignux? (Marcus G. Daniels, gnu.misc.discuss, 7 June 1996)
  9. 9.0 9.1 9.2 Why Open Source Software / Free Software (OSS/FS, FLOSS, or FOSS)? Look at the Numbers!: Forking Template:Webarchive (David A. Wheeler)
  10. Template:Cite web
  11. Forked Template:Webarchive (Jargon File), first added to v4.2.2 Template:Webarchive, 20 Aug 2000)
  12. e.g. Template:Cite web See also Template:Cite thesis
  13. Forked a project, where do my version numbers start? Template:Webarchive
  14. Fear of forking Template:Webarchive - An essay about forking in free software projects, by Rick Moen
  15. EnterpriseDB Template:Webarchive
  16. Fujitsu Supported PostgreSQL Template:Webarchive
  17. Netezza Template:Webarchive

External links

  • Right to Fork at Meatball Wiki
  • A PhD examining forking: (Nyman, 2015) "Understanding Code Forking in Open Source Software - An examination of code forking, its effect on open source software, and how it is viewed and practiced by developers"

Template:FOSS Template:Use dmy dates