Tcl Improvement Proposals: Check-in [9f0c27f8d1]

Many hyperlinks are disabled.
Use anonymous login to enable hyperlinks.

Overview

Comment:	Converted TIPs to Markdown
Downloads:	Tarball \| ZIP archive \| SQL archive
Timelines:	family \| ancestors \| descendants \| both \| trunk
Files:	files \| file ages \| folders
SHA3-256:	9f0c27f8d14faef5e50a194ccd93f0f562007dd158ef901bad2deb0f1fd63be0
User & Date:	mjanssen 2017-09-06 14:15:49

Context

2017-09-06
14:15		Added script to generate index check-in: e62d322a99 user: mjanssen tags: trunk
14:15		Converted TIPs to Markdown check-in: 9f0c27f8d1 user: mjanssen tags: trunk
14:15		Clean-up CVS import check-in: dce4aef4b9 user: mjanssen tags: trunk

Changes

Hide Diffs Unified Diffs Ignore Whitespace Patch

Name change from tip/0.tip to tip/0.md.

Name change from tip/1.tip to tip/1.md.

Name change from tip/10.tip to tip/10.md.

Name change from tip/100.tip to tip/100.md.

Name change from tip/101.tip to tip/101.md.

Name change from tip/102.tip to tip/102.md.

Name change from tip/103.tip to tip/103.md.

Name change from tip/104.tip to tip/104.md.

Name change from tip/105.tip to tip/105.md.

Name change from tip/106.tip to tip/106.md.

Name change from tip/107.tip to tip/107.md.

Name change from tip/108.tip to tip/108.md.

Name change from tip/109.tip to tip/109.md.

Name change from tip/11.tip to tip/11.md.

Name change from tip/110.tip to tip/110.md.

Name change from tip/111.tip to tip/111.md.

Name change from tip/112.tip to tip/112.md.

Name change from tip/113.tip to tip/113.md.

Name change from tip/114.tip to tip/114.md.

Name change from tip/115.tip to tip/115.md.

Name change from tip/116.tip to tip/116.md.

Name change from tip/117.tip to tip/117.md.

Name change from tip/118.tip to tip/118.md.

Name change from tip/119.tip to tip/119.md.

Name change from tip/12.tip to tip/12.md.

Name change from tip/120.tip to tip/120.md.

Name change from tip/121.tip to tip/121.md.

Name change from tip/122.tip to tip/122.md.

Name change from tip/123.tip to tip/123.md.

Name change from tip/124.tip to tip/124.md.

Name change from tip/125.tip to tip/125.md.

Name change from tip/126.tip to tip/126.md.

Name change from tip/127.tip to tip/127.md.

Name change from tip/128.tip to tip/128.md.

Name change from tip/129.tip to tip/129.md.

Name change from tip/13.tip to tip/13.md.

Name change from tip/130.tip to tip/130.md.

Name change from tip/131.tip to tip/131.md.

Name change from tip/132.tip to tip/132.md.

Name change from tip/133.tip to tip/133.md.

Name change from tip/134.tip to tip/134.md.

Name change from tip/135.tip to tip/135.md.

Name change from tip/136.tip to tip/136.md.

Name change from tip/137.tip to tip/137.md.

Name change from tip/138.tip to tip/138.md.

Name change from tip/139.tip to tip/139.md.

Name change from tip/14.tip to tip/14.md.

Name change from tip/140.tip to tip/140.md.

Name change from tip/141.tip to tip/141.md.

Name change from tip/142.tip to tip/142.md.

Name change from tip/143.tip to tip/143.md.

Name change from tip/144.tip to tip/144.md.

Name change from tip/145.tip to tip/145.md.

Name change from tip/146.tip to tip/146.md.

Name change from tip/147.tip to tip/147.md.

Name change from tip/148.tip to tip/148.md.

Name change from tip/149.tip to tip/149.md.

Name change from tip/15.tip to tip/15.md.

Name change from tip/150.tip to tip/150.md.

Name change from tip/151.tip to tip/151.md.

Name change from tip/152.tip to tip/152.md.

Name change from tip/153.tip to tip/153.md.

Name change from tip/154.tip to tip/154.md.

Name change from tip/155.tip to tip/155.md.

Name change from tip/156.tip to tip/156.md.

Name change from tip/157.tip to tip/157.md.

Name change from tip/158.tip to tip/158.md.

Name change from tip/159.tip to tip/159.md.

Name change from tip/16.tip to tip/16.md.

Name change from tip/160.tip to tip/160.md.

Name change from tip/161.tip to tip/161.md.

Name change from tip/162.tip to tip/162.md.

Name change from tip/163.tip to tip/163.md.

Name change from tip/164.tip to tip/164.md.

Name change from tip/165.tip to tip/165.md.

Name change from tip/166.tip to tip/166.md.

Name change from tip/167.tip to tip/167.md.

Name change from tip/168.tip to tip/168.md.

Name change from tip/169.tip to tip/169.md.

Name change from tip/17.tip to tip/17.md.

Name change from tip/170.tip to tip/170.md.

Name change from tip/171.tip to tip/171.md.

Name change from tip/172.tip to tip/172.md.

Name change from tip/173.tip to tip/173.md.

Name change from tip/174.tip to tip/174.md.

Name change from tip/175.tip to tip/175.md.

Name change from tip/176.tip to tip/176.md.

Name change from tip/177.tip to tip/177.md.

Name change from tip/178.tip to tip/178.md.

Name change from tip/179.tip to tip/179.md.

Name change from tip/18.tip to tip/18.md.

Name change from tip/180.tip to tip/180.md.

Name change from tip/181.tip to tip/181.md.

Name change from tip/182.tip to tip/182.md.

Name change from tip/183.tip to tip/183.md.

Name change from tip/184.tip to tip/184.md.

Name change from tip/185.tip to tip/185.md.

Name change from tip/186.tip to tip/186.md.

Name change from tip/187.tip to tip/187.md.

Name change from tip/188.tip to tip/188.md.

Name change from tip/189.tip to tip/189.md.

Name change from tip/19.tip to tip/19.md.

Name change from tip/190.tip to tip/190.md.

Name change from tip/191.tip to tip/191.md.

Name change from tip/192.tip to tip/192.md.

Name change from tip/193.tip to tip/193.md.

Name change from tip/194.tip to tip/194.md.

Name change from tip/195.tip to tip/195.md.

Name change from tip/196.tip to tip/196.md.

Name change from tip/197.tip to tip/197.md.

Name change from tip/198.tip to tip/198.md.

Name change from tip/199.tip to tip/199.md.

Name change from tip/2.tip to tip/2.md.

Name change from tip/20.tip to tip/20.md.

Name change from tip/200.tip to tip/200.md.

Name change from tip/201.tip to tip/201.md.

Name change from tip/202.tip to tip/202.md.

Name change from tip/203.tip to tip/203.md.

Name change from tip/204.tip to tip/204.md.

Name change from tip/205.tip to tip/205.md.

Name change from tip/206.tip to tip/206.md.

Name change from tip/207.tip to tip/207.md.

Name change from tip/208.tip to tip/208.md.

Name change from tip/209.tip to tip/209.md.

Name change from tip/21.tip to tip/21.md.

Name change from tip/210.tip to tip/210.md.

Name change from tip/211.tip to tip/211.md.

Name change from tip/212.tip to tip/212.md.

Name change from tip/213.tip to tip/213.md.

Name change from tip/214.tip to tip/214.md.

Name change from tip/215.tip to tip/215.md.

Name change from tip/216.tip to tip/216.md.

Name change from tip/217.tip to tip/217.md.

Name change from tip/218.tip to tip/218.md.

Name change from tip/219.tip to tip/219.md.

Name change from tip/22.tip to tip/22.md.

Name change from tip/220.tip to tip/220.md.

Name change from tip/221.tip to tip/221.md.

Name change from tip/222.tip to tip/222.md.

Name change from tip/223.tip to tip/223.md.

Name change from tip/224.tip to tip/224.md.

Name change from tip/225.tip to tip/225.md.

Name change from tip/226.tip to tip/226.md.

Name change from tip/227.tip to tip/227.md.

Name change from tip/228.tip to tip/228.md.

Name change from tip/229.tip to tip/229.md.

Name change from tip/23.tip to tip/23.md.

Name change from tip/230.tip to tip/230.md.

Name change from tip/231.tip to tip/231.md.

Name change from tip/232.tip to tip/232.md.

Name change from tip/233.tip to tip/233.md.

Name change from tip/234.tip to tip/234.md.

Name change from tip/235.tip to tip/235.md.

Name change from tip/236.tip to tip/236.md.

Name change from tip/237.tip to tip/237.md.

Name change from tip/238.tip to tip/238.md.

Name change from tip/239.tip to tip/239.md.

Name change from tip/24.tip to tip/24.md.

Name change from tip/240.tip to tip/240.md.

Name change from tip/241.tip to tip/241.md.

Name change from tip/242.tip to tip/242.md.

Name change from tip/243.tip to tip/243.md.

Name change from tip/244.tip to tip/244.md.

Name change from tip/245.tip to tip/245.md.

Name change from tip/246.tip to tip/246.md.

Name change from tip/247.tip to tip/247.md.

Name change from tip/248.tip to tip/248.md.

Name change from tip/249.tip to tip/249.md.

Name change from tip/25.tip to tip/25.md.

Name change from tip/250.tip to tip/250.md.

Name change from tip/251.tip to tip/251.md.

Name change from tip/252.tip to tip/252.md.

Name change from tip/253.tip to tip/253.md.

Name change from tip/254.tip to tip/254.md.

Name change from tip/255.tip to tip/255.md.

Name change from tip/256.tip to tip/256.md.

Name change from tip/257.tip to tip/257.md.

Name change from tip/258.tip to tip/258.md.

Name change from tip/259.tip to tip/259.md.

Name change from tip/26.tip to tip/26.md.

Name change from tip/260.tip to tip/260.md.

Name change from tip/261.tip to tip/261.md.

Name change from tip/262.tip to tip/262.md.

Name change from tip/263.tip to tip/263.md.

Name change from tip/264.tip to tip/264.md.

Name change from tip/265.tip to tip/265.md.

Name change from tip/266.tip to tip/266.md.

Name change from tip/267.tip to tip/267.md.

Name change from tip/268.tip to tip/268.md.

Name change from tip/269.tip to tip/269.md.

Name change from tip/27.tip to tip/27.md.

Name change from tip/270.tip to tip/270.md.

Name change from tip/271.tip to tip/271.md.

Name change from tip/272.tip to tip/272.md.

Name change from tip/273.tip to tip/273.md.

Name change from tip/274.tip to tip/274.md.

Name change from tip/275.tip to tip/275.md.

Name change from tip/276.tip to tip/276.md.

Name change from tip/277.tip to tip/277.md.

Name change from tip/278.tip to tip/278.md.

Name change from tip/279.tip to tip/279.md.

Name change from tip/28.tip to tip/28.md.

Name change from tip/280.tip to tip/280.md.

Name change from tip/281.tip to tip/281.md.

Name change from tip/282.tip to tip/282.md.

Name change from tip/283.tip to tip/283.md.

Name change from tip/284.tip to tip/284.md.

Name change from tip/285.tip to tip/285.md.

Name change from tip/286.tip to tip/286.md.

Name change from tip/287.tip to tip/287.md.

Name change from tip/288.tip to tip/288.md.

Name change from tip/289.tip to tip/289.md.

Name change from tip/29.tip to tip/29.md.

Name change from tip/290.tip to tip/290.md.

Name change from tip/291.tip to tip/291.md.

Name change from tip/292.tip to tip/292.md.

Name change from tip/293.tip to tip/293.md.

Name change from tip/294.tip to tip/294.md.

Name change from tip/295.tip to tip/295.md.

Name change from tip/296.tip to tip/296.md.

Name change from tip/297.tip to tip/297.md.

Name change from tip/298.tip to tip/298.md.

Name change from tip/299.tip to tip/299.md.

Name change from tip/3.tip to tip/3.md.

Name change from tip/30.tip to tip/30.md.

Name change from tip/300.tip to tip/300.md.

Name change from tip/301.tip to tip/301.md.

Name change from tip/302.tip to tip/302.md.

Name change from tip/303.tip to tip/303.md.

Name change from tip/304.tip to tip/304.md.

Name change from tip/305.tip to tip/305.md.

Name change from tip/306.tip to tip/306.md.

Name change from tip/307.tip to tip/307.md.

Name change from tip/308.tip to tip/308.md.

Name change from tip/309.tip to tip/309.md.

Name change from tip/31.tip to tip/31.md.

Name change from tip/310.tip to tip/310.md.

Name change from tip/311.tip to tip/311.md.

Name change from tip/312.tip to tip/312.md.

Name change from tip/313.tip to tip/313.md.

Name change from tip/314.tip to tip/314.md.

Name change from tip/315.tip to tip/315.md.

Name change from tip/316.tip to tip/316.md.

Name change from tip/317.tip to tip/317.md.

Name change from tip/318.tip to tip/318.md.

Name change from tip/319.tip to tip/319.md.

Name change from tip/32.tip to tip/32.md.

Name change from tip/320.tip to tip/320.md.

Name change from tip/321.tip to tip/321.md.

Name change from tip/322.tip to tip/322.md.

Name change from tip/323.tip to tip/323.md.

Name change from tip/324.tip to tip/324.md.

Name change from tip/325.tip to tip/325.md.

Name change from tip/326.tip to tip/326.md.

Name change from tip/327.tip to tip/327.md.

Name change from tip/328.tip to tip/328.md.

Name change from tip/329.tip to tip/329.md.

Name change from tip/33.tip to tip/33.md.

Name change from tip/330.tip to tip/330.md.

Name change from tip/331.tip to tip/331.md.

Name change from tip/332.tip to tip/332.md.

Name change from tip/333.tip to tip/333.md.

Name change from tip/334.tip to tip/334.md.

Name change from tip/335.tip to tip/335.md.

Name change from tip/336.tip to tip/336.md.

Name change from tip/337.tip to tip/337.md.

Name change from tip/338.tip to tip/338.md.

Name change from tip/339.tip to tip/339.md.

Name change from tip/34.tip to tip/34.md.

Name change from tip/340.tip to tip/340.md.

Name change from tip/341.tip to tip/341.md.

Name change from tip/342.tip to tip/342.md.

Name change from tip/343.tip to tip/343.md.

Name change from tip/344.tip to tip/344.md.

Name change from tip/345.tip to tip/345.md.

Name change from tip/346.tip to tip/346.md.

Name change from tip/347.tip to tip/347.md.

Name change from tip/348.tip to tip/348.md.

Name change from tip/349.tip to tip/349.md.

Name change from tip/35.tip to tip/35.md.

Name change from tip/350.tip to tip/350.md.

Name change from tip/351.tip to tip/351.md.

Name change from tip/352.tip to tip/352.md.

Name change from tip/353.tip to tip/353.md.

Name change from tip/354.tip to tip/354.md.

Name change from tip/355.tip to tip/355.md.

Name change from tip/356.tip to tip/356.md.

Name change from tip/357.tip to tip/357.md.

Name change from tip/358.tip to tip/358.md.

Name change from tip/359.tip to tip/359.md.

Name change from tip/36.tip to tip/36.md.

Name change from tip/360.tip to tip/360.md.

Name change from tip/361.tip to tip/361.md.

Name change from tip/362.tip to tip/362.md.

Name change from tip/363.tip to tip/363.md.

Name change from tip/364.tip to tip/364.md.

Name change from tip/365.tip to tip/365.md.

Name change from tip/366.tip to tip/366.md.

Name change from tip/367.tip to tip/367.md.

Name change from tip/368.tip to tip/368.md.

Name change from tip/369.tip to tip/369.md.

Name change from tip/37.tip to tip/37.md.

Name change from tip/370.tip to tip/370.md.

Name change from tip/371.tip to tip/371.md.

Name change from tip/372.tip to tip/372.md.

Name change from tip/373.tip to tip/373.md.

Name change from tip/374.tip to tip/374.md.

Name change from tip/375.tip to tip/375.md.

Name change from tip/376.tip to tip/376.md.

Name change from tip/377.tip to tip/377.md.

Name change from tip/378.tip to tip/378.md.

Name change from tip/379.tip to tip/379.md.

Name change from tip/38.tip to tip/38.md.

Name change from tip/380.tip to tip/380.md.

Name change from tip/381.tip to tip/381.md.

Name change from tip/382.tip to tip/382.md.

Name change from tip/383.tip to tip/383.md.

Name change from tip/384.tip to tip/384.md.

Name change from tip/385.tip to tip/385.md.

Name change from tip/386.tip to tip/386.md.

Name change from tip/387.tip to tip/387.md.

Name change from tip/388.tip to tip/388.md.

Name change from tip/389.tip to tip/389.md.

Name change from tip/39.tip to tip/39.md.

Name change from tip/390.tip to tip/390.md.

Name change from tip/391.tip to tip/391.md.

Name change from tip/392.tip to tip/392.md.

Name change from tip/393.tip to tip/393.md.

Name change from tip/394.tip to tip/394.md.

Name change from tip/395.tip to tip/395.md.

Name change from tip/396.tip to tip/396.md.

Name change from tip/397.tip to tip/397.md.

Name change from tip/398.tip to tip/398.md.

Name change from tip/399.tip to tip/399.md.

Name change from tip/4.tip to tip/4.md.

Name change from tip/40.tip to tip/40.md.

Name change from tip/400.tip to tip/400.md.

Name change from tip/401.tip to tip/401.md.

Name change from tip/402.tip to tip/402.md.

Name change from tip/403.tip to tip/403.md.

Name change from tip/404.tip to tip/404.md.

Name change from tip/405.tip to tip/405.md.

Name change from tip/406.tip to tip/406.md.

Name change from tip/407.tip to tip/407.md.

Name change from tip/408.tip to tip/408.md.

Name change from tip/409.tip to tip/409.md.

Name change from

tip/41.tip to tip/41.md. splitdiff" data-lefthash="8e230abe3be81ecf5e3310c7e48233559d80abb21ae123776a22051b675a6223"> class="diffln difflnl">

 difftxtl">  Paned Window Tk Widget $Revision: 1.13 $ Eric Melski <[email protected]> a C-based paned window widget for inclusion in the window consists of one or more vertical or each pair separated by a movable "sash" and each called a "slave".  Paned windows are common in user interfaces and should therefore be provided Examples of the widget can be found in Netscape Messenger; many email clients; and graphical World Wide Web browser. Rationale other graphical toolkits in terms of the selection In order to keep Tk vibrant, it is imperative that the widget set be enhanced have become commonplace in modern graphical user such widget is the paned window widget.  A widget to create robust paned windows should be included widget could be implemented in C or in Tcl; in fact, paned window widgets already exist.  However, these mostly caused by the inability to completely manage of Tk windows from Tcl (i.e. there is no way to make like ''Tk_MaintainGeometry'' or 39;Tk_ManageGeometry'').  This issue could possibly be addressed by the a proper megawidget system for Tk, but that goal seems If we wait for that system before it may be too late.  In addition, megawidget suffer from "widget bloat" - each paned window widget typically two widgets, plus two or more widgets for For a Motif-style paned window with two means five widgets are created (one frame for the paned widget; one frame for each pane; one frame for the for the sash handle).  Even assuming the existence of system, we may not be able to address the widget window implementation will be able to address both of should be more robust, reliable, and lightweight.  A will be able to access Tk's geometry management it will require only one widget for each paned window, difflnr"> difftxtr"> class="edit">IP 41: Paned Window Tk Widget Eric Melski <[email protected]> widget,tk,panedwindow a C-based paned window widget for inclusion in the window consists of one or more vertical or each pair separated by a movable "sash" and each called a "slave".  Paned windows are common in user interfaces and should therefore be provided Examples of the widget can be found in Netscape Messenger; many email clients; and graphical World Wide Web browser. Rationale other graphical toolkits in terms of the selection In order to keep Tk vibrant, it is imperative that the widget set be enhanced have become commonplace in modern graphical user such widget is the paned window widget.  A widget to create robust paned windows should be included widget could be implemented in C or in Tcl; in fact, paned window widgets already exist.  However, these mostly caused by the inability to completely manage of Tk windows from Tcl \(i.e. there is no way to make like _Tk\_MaintainGeometry_ or ns>_ManageGeometry_\).  This issue could possibly be addressed by the a proper megawidget system for Tk, but that goal seems If we wait for that system before it may be too late.  In addition, megawidget suffer from "widget bloat" - each paned window widget typically two widgets, plus two or more widgets for For a Motif-style paned window with two means five widgets are created \(one frame for the paned widget; one frame for each pane; one frame for the for the sash handle\).  Even assuming the existence of system, we may not be able to address the widget window implementation will be able to address both of should be more robust, reliable, and lightweight.  A will be able to access Tk's geometry management it will require only one widget for each paned window, data-startln="56" data-endln="75" id="skip188h37i14"> difflnl difflne">︙︙ class="diffln difflnl"> difftxtl"> a proper "Batteries Included" distribution, but like system, this seems like a goal far from reality is to distribute the widget with the core, but have it placed in a separate package and namespace.  This provides the same level of availability as direct inclusion in the core, but does not actually make the widget part of Tk directly.  There are two possible arguments in favor of this approach.  First, since this widget will be in its own namespace, future panedwindow widgets could be included without name conflicts.  However, if each widget is put in its own namespace, the name conflict has not actually been resolved.  The point of contention has simply been moved from the global command space to the global namespace space.  Namespaces make sense when grouping blocks of related functions and data, but widgets have only one command.  It's just as easy to pick a unique command name as a unique namespace name.  The second possible advantage is that the widget could be loaded on demand, rather than automatically being pulled in with Tk.  However, most machines that Tk runs on use a virtual memory system.  Thus, that are actually used will be resident in memory.  The benefit of incorporating this widget into the Tk distribution in this manner seem marginal. mage:41example Example Panedwindow Widget Specification for the paned window widget is included here: panedwindow - Create and manipulate panedwindow widgets panedwindow pathName ?options? OPTIONS -background           -height              -width -borderwidth          -orient -cursor               -relief See  the  options manual entry for details on the standard options. WIDGET-SPECIFIC OPTIONS Command-Line Name:-handlepad Database Name:  handlePad Database Class: HandlePad When sash handles are drawn, specifies the distance from  the top or left end of the sash (depending on the orientation of the widget) at which to draw the handle.  May be any value accepted by Tk_GetPixels. Command-Line Name:-handlesize Database Name:  handleSize Database Class: HandleSize Specifies the side length of a sash  handle.   Han- dles are always drawn as squares.  May be any value accepted by Tk_GetPixels. Command-Line Name:-opaqueresize Database Name:  opaqueResize Database Class: OpaqueResize Specifies whether panes should be resized as a sash is  moved (true), or if resizing should be deferred until the sash is placed (false). Command-Line Name:-sashcursor Database Name:  sashCursor Database Class: SashCursor Mouse cursor to use when over  a  sash.   If  null, sb_h_double_arrow   will  be  used  for  horizontal panedwindows, and sb_v_double_arrow  will  be  used for vertical panedwindows. Command-Line Name:-sashpad Database Name:  sashPad Database Class: SashPad Specifies  the  amount  of padding to leave of each side of a sash.   May  be  any  value  accepted  by Tk_GetPixels. Command-Line Name:-sashrelief Database Name:  sashRelief Database Class: SashRelief Relief  to  use when drawing a sash.  May be any of the standard Tk relief values. Command-Line Name:-sashwidth Database Name:  sashWidth Database Class: SashWidth Specifies the width of each sash.  May be any value accepted by Tk_GetPixels. Command-Line Name:-showhandle Database Name:  showHandle Database Class: ShowHandle Specifies whether or not sash handles should be shown. May be any valid Tcl boolean value. The panedwindow command creates a new window (given by the pathName argument) and makes it into a panedwindow widget. Additional  options,  described above, may be specified on the command line or in the option  database  to  configure aspects  of the panedwindow such as its default background color and relief.  The  panedwindow  command  returns  the path name of the new window. A   panedwindow  widget  contains  any  number  of  panes, arranged horizontally  or  vertically,  according  to  the value  of the -orient option.  Each pane contains one wid- get, and each pair of panes is  separated  by  a  moveable (via mouse movements) sash.  Moving a sash causes the wid- gets on either side of the sash to be resized. The panedwindow command creates a new  Tcl  command  whose name  is  the  same  as the path name of the panedwindow's window.  This command may be used to invoke various opera- tions on the widget.  It has the following general form: pathName option ?arg arg ...? PathName  is the name of the command, which is the same as the panedwindow widget's path name.  Option and  the  args determine  the exact behavior of the command.  The follow- ing commands are possible for panedwindow widgets: pathName add slave ?slave ...? ?option value ...? Add one or more slaves to the panedwindow, each  in a  separate  pane.   The  arguments  consist of the names of one or  more  slave  windows  followed  by pairs  of  arguments that specify how to manage the slaves.  Option may have any of the values accepted by the configure subcommand. pathName cget option Returns  the  current  value  of  the configuration option given by option.  Option may have any of the values accepted by the panedwindow command. pathName configure ?option? ?value option value ...? Query  or  modify  the configuration options of the widget.  If no option is specified, returns a  list describing  all  of the available options for path- Name (see Tk_ConfigureInfo for information  on  the format  of this list).  If option is specified with no value, then the command returns a list  describ- ing the one named option (this list will be identi- cal to  the  corresponding  sublist  of  the  value returned  if  no  option  is specified).  If one or more option-value pairs  are  specified,  then  the command modifies the given widget option(s) to have the given  value(s);   in  this  case  the  command returns an empty string. Option may have any of the values accepted by the panedwindow command. pathName forget slave ?slave ...? Remove the pane containing slave from the panedwin- dow.   All  geometry  management  options for slave will be forgotten. pathName identify x y Identify the panedwindow component  underneath  the point  given by x and y, in window coordinates.  If the point is over a sash  or  a  sash  handle,  the result  is  a two element list containing the index of the  sash  or  handle,  and  a  word  indicating whether  it  is over a sash or a handle, such as {0 sash} or {2 handle}.  If  the  point  is  over  any other  part  of  the  panedwindow, the result is an empty list. pathName proxy ?args? This command is used to query and change the  posi- tion  of  the sash proxy, used for rubberband-style pane resizing. It can take  any  of  the  following forms: pathName proxy coord Return a list containing the x and y coordi- nates of the most recent proxy location. pathname proxy forget Remove the proxy from the display. pathName proxy place x y Place  the  proxy  at  the  given  x  and  y coordinates. pathName sash ?args? This  command is used to query and change the posi- tion of sashes in the panedwindow.  It can take any of the following forms: pathName sash coord index Return  the  current x and y coordinate pair for the sash given by index.  Index must  be an  integer  between  0  and 1 less than the number of slaves in  the  panedwindow.   The coordinates  given are those of the top left corner of the region  containing  the  sash. pathName  sash dragto index x y This command computes the difference  between  the  given coordinates and the coordinates given to the last sash coord command for the given  sash. It then moves that sash the computed differ- ence.  The return value is the empty string. pathName sash mark index x y Records x and y for the sash given by index; used in conjunction with later  dragto  com- mands to move the sash. pathName sash place index x y Place  the  sash given by index at the given coordinates. pathName slavecget slave option Query a management option for slave.  Option may be any value allowed by the slaveconfigure subcommand. pathName slaveconfigure slave ?option? ?value option value Query  or  modify the management options for slave. If no option is specified, returns a list  describ- ing  all of the available options for pathName (see Tk_ConfigureInfo for information on the  format  of this  list).  If option is specified with no value, then the command returns a list describing the  one named  option  (this  list will be identical to the corresponding sublist of the value returned  if  no option  is specified).  If one or more option-value pairs are specified, then the command modifies  the given  widget option(s) to have the given value(s); in this case the command returns an  empty  string. The following options are supported: -after slave Insert  the slave after the slave specified. slave should be the name of a window already managed by pathName. -before slave Insert the slave before the slave specified. slave should be the name of a window already managed by pathName. -height size Specify  a height for the slave.  The height will be the outer  dimension  of  the  slave including its border, if any.  If size is an empty string, or if -height  is  not  speci- fied,  then  the height requested internally by the slave will  be  used  initially;  the height may later be adjusted by the movement of sashes in the panedwindow.  Size  may  be any value accepted by Tk_GetPixels. -minsize n Specifies  that the size of the slave cannot be made less than n.  This  constraint  only affects  the size of the widget in the paned dimension -- the x dimension for  horizontal panedwindows,  the  y dimension for vertical panedwindows.  May be any value accepted  by Tk_GetPixels. -padx n Specifies  a  non-negative  value indicating how much extra space to leave on  each  side of  the slave in the X-direction.  The value may  have  any  of  the  forms  accepted  by Tk_GetPixels. -pady n Specifies  a  non-negative  value indicating how much extra space to leave on  each  side of  the slave in the Y-direction.  The value may  have  any  of  the  forms  accepted  by Tk_GetPixels. -sticky style If   a  slave's  pane  is  larger  than  the requested  dimensions  of  the  slave,  this option  may be used to position (or stretch) the slave within  its  pane.   Style   is  a string  that  contains  zero  or more of the characters n, s, e or  w.   The  string  can optionally  contains  spaces  or commas, but they are ignored.  Each letter refers  to  a side  (north, south, east, or west) that the slave will "stick" to.  If both n and s  (or e  and  w)  are specified, the slave will be stretched to  fill  the  entire  height  (or width) of its cavity. -width size Specify  a  width  for the slave.  The width will be the outer  dimension  of  the  slave including its border, if any.  If size is an empty string, or if -width is not specified, then  the  width requested internally by the slave will be used initially; the width  may later  be adjusted by the movement of sashes in the panedwindow.  Size may be  any  value accepted by Tk_GetPixels. pathName slaves Returns  an  ordered list of the widgets managed by pathName. A pane is resized by grabbing the sash (or sash handle  if present)  and  dragging  with  the  mouse.  This is accom- plished via mouse motion bindings on the widget.   When  a sash  is moved, the sizes of the panes on each side of the sash, and thus the widgets in those panes, are adjusted. When a pane is resized from outside (eg, it is  packed  to expand  and fill, and the containing toplevel is resized), space is added to the final (rightmost or bottommost) pane in the window. Reference Implementation here has already been implemented, with The widget is included with the part of the ''tktable'' SourceForge project at possible future enhancements: of a weight for each pane, similar to the -weight option supported by when allocating space from a resize to panes in the image to be placed on the window sash, a la or Java Swing, to allow one-click expand and the -setgrid option such that if a pane contains the sash can only be moved in grid size steps. prohibited by the current design, and could be later date as enhancements to the widget. Copyright been placed in the public domain. difflnr"> class="difftxt difftxtr"> a proper "Batteries Included" distribution, but like system, this seems like a goal far from reality is to distribute the widget with the core, but have it placed in a separate package and namespace.  This provides the same level of availability as direct inclusion in the core, but does not actually make the widget part of Tk directly.  There are two possible arguments in favor of this approach.  First, since this widget will be in its own namespace, future panedwindow widgets could be included without name conflicts.  However, if each widget is put in its own namespace, the name conflict has not actually been resolved.  The point of contention has simply been moved from the global command space to the global namespace space.  Namespaces make sense when grouping blocks of related functions and data, but widgets have only one command.  It's just as easy to pick a unique command name as a unique namespace name.  The second possible advantage is that the widget could be loaded on demand, rather than automatically being pulled in with Tk.  However, most machines that Tk runs on use a virtual memory system.  Thus, that are actually used will be resident in memory.  The benefit of incorporating this widget into the Tk distribution in this manner seem marginal. Panedwindow Widget](../assets/41example.png) Specification for the paned window widget is included here: panedwindow - Create and manipulate panedwindow widgets panedwindow pathName ?options? OPTIONS -background           -height              -width -borderwidth          -orient -cursor               -relief See  the  options manual entry for details on the standard options. WIDGET-SPECIFIC OPTIONS Command-Line Name:-handlepad Database Name:  handlePad Database Class: HandlePad When sash handles are drawn, specifies the distance from  the top or left end of the sash (depending on the orientation of the widget) at which to draw the handle.  May be any value accepted by Tk_GetPixels. Command-Line Name:-handlesize Database Name:  handleSize Database Class: HandleSize Specifies the side length of a sash  handle.   Han- dles are always drawn as squares.  May be any value accepted by Tk_GetPixels. Command-Line Name:-opaqueresize Database Name:  opaqueResize Database Class: OpaqueResize Specifies whether panes should be resized as a sash is  moved (true), or if resizing should be deferred until the sash is placed (false). Command-Line Name:-sashcursor Database Name:  sashCursor Database Class: SashCursor Mouse cursor to use when over  a  sash.   If  null, sb_h_double_arrow   will  be  used  for  horizontal panedwindows, and sb_v_double_arrow  will  be  used for vertical panedwindows. Command-Line Name:-sashpad Database Name:  sashPad Database Class: SashPad Specifies  the  amount  of padding to leave of each side of a sash.   May  be  any  value  accepted  by Tk_GetPixels. Command-Line Name:-sashrelief Database Name:  sashRelief Database Class: SashRelief Relief  to  use when drawing a sash.  May be any of the standard Tk relief values. Command-Line Name:-sashwidth Database Name:  sashWidth Database Class: SashWidth Specifies the width of each sash.  May be any value accepted by Tk_GetPixels. Command-Line Name:-showhandle Database Name:  showHandle Database Class: ShowHandle Specifies whether or not sash handles should be shown. May be any valid Tcl boolean value. The panedwindow command creates a new window (given by the pathName argument) and makes it into a panedwindow widget. Additional  options,  described above, may be specified on the command line or in the option  database  to  configure aspects  of the panedwindow such as its default background color and relief.  The  panedwindow  command  returns  the path name of the new window. A   panedwindow  widget  contains  any  number  of  panes, arranged horizontally  or  vertically,  according  to  the value  of the -orient option.  Each pane contains one wid- get, and each pair of panes is  separated  by  a  moveable (via mouse movements) sash.  Moving a sash causes the wid- gets on either side of the sash to be resized. The panedwindow command creates a new  Tcl  command  whose name  is  the  same  as the path name of the panedwindow's window.  This command may be used to invoke various opera- tions on the widget.  It has the following general form: pathName option ?arg arg ...? PathName  is the name of the command, which is the same as the panedwindow widget's path name.  Option and  the  args determine  the exact behavior of the command.  The follow- ing commands are possible for panedwindow widgets: pathName add slave ?slave ...? ?option value ...? Add one or more slaves to the panedwindow, each  in a  separate  pane.   The  arguments  consist of the names of one or  more  slave  windows  followed  by pairs  of  arguments that specify how to manage the slaves.  Option may have any of the values accepted by the configure subcommand. pathName cget option Returns  the  current  value  of  the configuration option given by option.  Option may have any of the values accepted by the panedwindow command. pathName configure ?option? ?value option value ...? Query  or  modify  the configuration options of the widget.  If no option is specified, returns a  list describing  all  of the available options for path- Name (see Tk_ConfigureInfo for information  on  the format  of this list).  If option is specified with no value, then the command returns a list  describ- ing the one named option (this list will be identi- cal to  the  corresponding  sublist  of  the  value returned  if  no  option  is specified).  If one or more option-value pairs  are  specified,  then  the command modifies the given widget option(s) to have the given  value(s);   in  this  case  the  command returns an empty string. Option may have any of the values accepted by the panedwindow command. pathName forget slave ?slave ...? Remove the pane containing slave from the panedwin- dow.   All  geometry  management  options for slave will be forgotten. pathName identify x y Identify the panedwindow component  underneath  the point  given by x and y, in window coordinates.  If the point is over a sash  or  a  sash  handle,  the result  is  a two element list containing the index of the  sash  or  handle,  and  a  word  indicating whether  it  is over a sash or a handle, such as {0 sash} or {2 handle}.  If  the  point  is  over  any other  part  of  the  panedwindow, the result is an empty list. pathName proxy ?args? This command is used to query and change the  posi- tion  of  the sash proxy, used for rubberband-style pane resizing. It can take  any  of  the  following forms: pathName proxy coord Return a list containing the x and y coordi- nates of the most recent proxy location. pathname proxy forget Remove the proxy from the display. pathName proxy place x y Place  the  proxy  at  the  given  x  and  y coordinates. pathName sash ?args? This  command is used to query and change the posi- tion of sashes in the panedwindow.  It can take any of the following forms: pathName sash coord index Return  the  current x and y coordinate pair for the sash given by index.  Index must  be an  integer  between  0  and 1 less than the number of slaves in  the  panedwindow.   The coordinates  given are those of the top left corner of the region  containing  the  sash. pathName  sash dragto index x y This command computes the difference  between  the  given coordinates and the coordinates given to the last sash coord command for the given  sash. It then moves that sash the computed differ- ence.  The return value is the empty string. pathName sash mark index x y Records x and y for the sash given by index; used in conjunction with later  dragto  com- mands to move the sash. pathName sash place index x y Place  the  sash given by index at the given coordinates. pathName slavecget slave option Query a management option for slave.  Option may be any value allowed by the slaveconfigure subcommand. pathName slaveconfigure slave ?option? ?value option value Query  or  modify the management options for slave. If no option is specified, returns a list  describ- ing  all of the available options for pathName (see Tk_ConfigureInfo for information on the  format  of this  list).  If option is specified with no value, then the command returns a list describing the  one named  option  (this  list will be identical to the corresponding sublist of the value returned  if  no option  is specified).  If one or more option-value pairs are specified, then the command modifies  the given  widget option(s) to have the given value(s); in this case the command returns an  empty  string. The following options are supported: -after slave Insert  the slave after the slave specified. slave should be the name of a window already managed by pathName. -before slave Insert the slave before the slave specified. slave should be the name of a window already managed by pathName. -height size Specify  a height for the slave.  The height will be the outer  dimension  of  the  slave including its border, if any.  If size is an empty string, or if -height  is  not  speci- fied,  then  the height requested internally by the slave will  be  used  initially;  the height may later be adjusted by the movement of sashes in the panedwindow.  Size  may  be any value accepted by Tk_GetPixels. -minsize n Specifies  that the size of the slave cannot be made less than n.  This  constraint  only affects  the size of the widget in the paned dimension -- the x dimension for  horizontal panedwindows,  the  y dimension for vertical panedwindows.  May be any value accepted  by Tk_GetPixels. -padx n Specifies  a  non-negative  value indicating how much extra space to leave on  each  side of  the slave in the X-direction.  The value may  have  any  of  the  forms  accepted  by Tk_GetPixels. -pady n Specifies  a  non-negative  value indicating how much extra space to leave on  each  side of  the slave in the Y-direction.  The value may  have  any  of  the  forms  accepted  by Tk_GetPixels. -sticky style If   a  slave's  pane  is  larger  than  the requested  dimensions  of  the  slave,  this option  may be used to position (or stretch) the slave within  its  pane.   Style   is  a string  that  contains  zero  or more of the characters n, s, e or  w.   The  string  can optionally  contains  spaces  or commas, but they are ignored.  Each letter refers  to  a side  (north, south, east, or west) that the slave will "stick" to.  If both n and s  (or e  and  w)  are specified, the slave will be stretched to  fill  the  entire  height  (or width) of its cavity. -width size Specify  a  width  for the slave.  The width will be the outer  dimension  of  the  slave including its border, if any.  If size is an empty string, or if -width is not specified, then  the  width requested internally by the slave will be used initially; the width  may later  be adjusted by the movement of sashes in the panedwindow.  Size may be  any  value accepted by Tk_GetPixels. pathName slaves Returns  an  ordered list of the widgets managed by pathName. A pane is resized by grabbing the sash (or sash handle  if present)  and  dragging  with  the  mouse.  This is accom- plished via mouse motion bindings on the widget.   When  a sash  is moved, the sizes of the panes on each side of the sash, and thus the widgets in those panes, are adjusted. When a pane is resized from outside (eg, it is  packed  to expand  and fill, and the containing toplevel is resized), space is added to the final (rightmost or bottommost) pane in the window. Reference Implementation here has already been implemented, with The widget is included with the part of the _tktable_ SourceForge project at able.sourceforge.net> possible future enhancements: of a weight for each pane, similar to the -weight option supported by when allocating space from a resize to panes in the image to be placed on the window sash, a la or Java Swing, to allow one-click expand and the -setgrid option such that if a pane contains the sash can only be moved in grid size steps. prohibited by the current design, and could be later date as enhancements to the widget. Copyright been placed in the public domain.

Name change from tip/410.tip to tip/410.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134

TIP:            410
Title:          Three Features of scan Adapted for binary scan/format
Version:        $Revision: 1.2 $
Author:         Andreas Leitgeb <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        26-Aug-2012
Post-History:   
Tcl-Version:    8.7

~ Abstract

This proposal specifies three new features for '''binary scan''' and '''binary
format''' that already exist similarly for '''scan''', namely: '''#''' for
consuming a count-value from the parameter list (like "'''scan %*'''"),
'''p''' for writing current position to a consumed parameter variable (like
"'''scan %n'''") and returning a single parsed value if no parameter is left.

~ Rationale

Experience with '''binary format''' and '''binary scan''' indicates that there
are some features of '''scan''' which it would be highly desirable to have. In
particular, the ability to take an item length as a separate parameter, to
store the current location, and to return a '''single matched value''' when
last variable is not supplied would all be highly desirable.

Different symbols for some of the operations have had to be chosen, as both
"*" and "n" already exist and have a different meaning for '''binary scan'''
and '''binary format'''. Also, unlike with ''scan''. no list of values shall
be returned (except for a single counted conversion), but instead only one
extra conversion character allowed. Experience with scan shows that people
tend to forget about the list-layer and use ''[scan "08" %d]'' directly as
a number, which, while safe for integers, is just the wrong thing to do.

The TIP-Author believes that these are all rather
"low hanging fruit". If this turns out not to be the case, then any
controversial one of these features shall be moved to its own TIP.

~ Proposal

A "'''#'''" (number sign) at a place in the format-string where a number or a
"'''*'''" is currently allowed, shall consume one item from the parameter list
and interpret it as a number. It shall only occur after a conversion specifier
that accepts trailing numbers. The parameter consumed for "'''#'''" is the one
''after'' the parameter used for the conversion specifier itself, as the
"'''#'''" follows that specifier.

A new conversion specifier "'''p'''" shall not accept a trailing count and
consume one item from parameter list and interpret it as the name of a local
variable into which to store the current cursor-position. No data is consumed
for '''binary scan''' and no data produced for '''binary format'''.

A '''binary scan''' with a format-string that contains '''one''' data
conversion specifier '''more''' than variable parameters shall return
the remaining converted value (or an empty string if the last conversion
wasn't successful).

~~ Details

A "'''p'''"-conversion is not counted. In classic usage with variable
parameters, the return value of '''binary scan''' gives only the number of
real data conversions, thus not counting "'''@'''", "'''x'''", "'''X'''" or
"'''p'''".

A "'''#'''" given as count will always imply a list of values written to the
variable, even if the value is "1" and the list is of length 1. A negative
value could change the direction for relative movements "'''x'''" and
"'''X'''", and is treated as 0 in all other cases. A non-numeric value
(including the empty string!) given for a "'''#'''" causes the '''binary'''
command to return an error, just like garbage in the format string would. It
is explicitly not intended to get single-value behaviour with "'''#'''" and
empty string, nor have the separate count-value contain an asterisk or further
conversion characters.

~ Further Ideas

Eventually, as a special case for '''binary scan''', the following idiom shall
be allowed outside of the basic specification:

|  binary scan "\0\0\0\x2A" "I p" pos

returns 42 and then writes 4 to variable pos.

While the idiom would be quite practical, there is a risk of reader's
confusion about which value would be written and which returned, despite
unambiguous definition. Also, this one might turn out to be less than trivial
to implement, as it would require some lookahead to reserve the remaining
parameter for "p", not for "I" that is currently at hand.

~ Examples

|   # pad out 12 nuls, then set cursor to 0, write an 
|   #   int, record position, then write another int.
|   set data [binary format "x# @0 I p I" 12 1 pos 42]
|   --> data: "\0\0\0\1\0\0\0\x2A\0\0\0\0"  pos: 4
|
|   # set cursor position from value of first param, scan 
|   # three items from the data and write them to the
|   # next three parameter variables, then write new cursor
|   # position to next parameter variable.
|   binary scan $data "@# Iss p" $pos beI leS1 leS2 pos
|   --> beI: 42  leS1:0 leS2:0 pos:12
|
|   # 4 is value for "#", no further param, thus return 
|   # result for "I"
|   set val [binary scan $data "@# I" 4]
|   --> val: 42
|
|   # error case: more than one conversion to return
|   set val [binary scan $data "@# II" 4]
|   --> error: "not enough arguments for all format specifiers"
|
|   # extra "further ideas" feature:
|   set val [binary scan $data "@# I p" 4 pos]
|   --> val: 42 pos: 8

~ Rejected Alternatives

For a format string "'''a#'''", one could have argued to use first parameter
for the count and second for the conversion target, as that would be the order
of relevance (count is needed before the resulting value is even generated). I
think, this is a bikeshedding issue, and any choice is a good choice, so I
went with order of occurrence, thus "'''a#'''" expects the target variable
first, and the count second.

It would be possible to use "'''@'''" without a count instead of "'''p'''",
but I consider it dangerous, when a previous error turns into overwriting of
variables. I consider a typo "'''@ 42 ...'''" to be common enough to not want
to give it a new unexpected meaning and side-effect.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|
|
|
|
|

|

|
|

|

|
|
|

|

|

|
|

|
|
|

|

|

|
|
|
|

|

|
|
|
|

|

|
|
|

|

|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134

# TIP 410: Three Features of scan Adapted for binary scan/format

	Author:         Andreas Leitgeb <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        26-Aug-2012
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This proposal specifies three new features for **binary scan** and **binary
format** that already exist similarly for **scan**, namely: **\#** for
consuming a count-value from the parameter list \(like "**scan %\***"\),
**p** for writing current position to a consumed parameter variable \(like
"**scan %n**"\) and returning a single parsed value if no parameter is left.

# Rationale

Experience with **binary format** and **binary scan** indicates that there
are some features of **scan** which it would be highly desirable to have. In
particular, the ability to take an item length as a separate parameter, to
store the current location, and to return a **single matched value** when
last variable is not supplied would all be highly desirable.

Different symbols for some of the operations have had to be chosen, as both
"\*" and "n" already exist and have a different meaning for **binary scan**
and **binary format**. Also, unlike with _scan_. no list of values shall
be returned \(except for a single counted conversion\), but instead only one
extra conversion character allowed. Experience with scan shows that people
tend to forget about the list-layer and use _[scan "08" %d]_ directly as
a number, which, while safe for integers, is just the wrong thing to do.

The TIP-Author believes that these are all rather
"low hanging fruit". If this turns out not to be the case, then any
controversial one of these features shall be moved to its own TIP.

# Proposal

A "**\#**" \(number sign\) at a place in the format-string where a number or a
"**\***" is currently allowed, shall consume one item from the parameter list
and interpret it as a number. It shall only occur after a conversion specifier
that accepts trailing numbers. The parameter consumed for "**\#**" is the one
_after_ the parameter used for the conversion specifier itself, as the
"**\#**" follows that specifier.

A new conversion specifier "**p**" shall not accept a trailing count and
consume one item from parameter list and interpret it as the name of a local
variable into which to store the current cursor-position. No data is consumed
for **binary scan** and no data produced for **binary format**.

A **binary scan** with a format-string that contains **one** data
conversion specifier **more** than variable parameters shall return
the remaining converted value \(or an empty string if the last conversion
wasn't successful\).

## Details

A "**p**"-conversion is not counted. In classic usage with variable
parameters, the return value of **binary scan** gives only the number of
real data conversions, thus not counting "**@**", "**x**", "**X**" or
"**p**".

A "**\#**" given as count will always imply a list of values written to the
variable, even if the value is "1" and the list is of length 1. A negative
value could change the direction for relative movements "**x**" and
"**X**", and is treated as 0 in all other cases. A non-numeric value
\(including the empty string!\) given for a "**\#**" causes the **binary**
command to return an error, just like garbage in the format string would. It
is explicitly not intended to get single-value behaviour with "**\#**" and
empty string, nor have the separate count-value contain an asterisk or further
conversion characters.

# Further Ideas

Eventually, as a special case for **binary scan**, the following idiom shall
be allowed outside of the basic specification:

	  binary scan "\0\0\0\x2A" "I p" pos

returns 42 and then writes 4 to variable pos.

While the idiom would be quite practical, there is a risk of reader's
confusion about which value would be written and which returned, despite
unambiguous definition. Also, this one might turn out to be less than trivial
to implement, as it would require some lookahead to reserve the remaining
parameter for "p", not for "I" that is currently at hand.

# Examples

	   # pad out 12 nuls, then set cursor to 0, write an 
	   #   int, record position, then write another int.
	   set data [binary format "x# @0 I p I" 12 1 pos 42]
	   --> data: "\0\0\0\1\0\0\0\x2A\0\0\0\0"  pos: 4

	   # set cursor position from value of first param, scan 
	   # three items from the data and write them to the
	   # next three parameter variables, then write new cursor
	   # position to next parameter variable.
	   binary scan $data "@# Iss p" $pos beI leS1 leS2 pos
	   --> beI: 42  leS1:0 leS2:0 pos:12

	   # 4 is value for "#", no further param, thus return 
	   # result for "I"
	   set val [binary scan $data "@# I" 4]
	   --> val: 42

	   # error case: more than one conversion to return
	   set val [binary scan $data "@# II" 4]
	   --> error: "not enough arguments for all format specifiers"

	   # extra "further ideas" feature:
	   set val [binary scan $data "@# I p" 4 pos]
	   --> val: 42 pos: 8

# Rejected Alternatives

For a format string "**a\#**", one could have argued to use first parameter
for the count and second for the conversion target, as that would be the order
of relevance \(count is needed before the resulting value is even generated\). I
think, this is a bikeshedding issue, and any choice is a good choice, so I
went with order of occurrence, thus "**a\#**" expects the target variable
first, and the count second.

It would be possible to use "**@**" without a count instead of "**p**",
but I consider it dangerous, when a previous error turns into overwriting of
variables. I consider a typo "**@ 42 ...**" to be common enough to not want
to give it a new unexpected meaning and side-effect.

# Copyright

This document has been placed in the public domain.

Name change from tip/411.tip to tip/411.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119

120
121
122
123
124
125
126

127
128
129
130

131
132
133
134
135
136
137

138
139
140
141
142

143
144
145
146
147

148
149
150
151
152

153
154
155
156
157
158

159
160
161
162
163
164
165
166
167
168
169
170
171
172

TIP:            411
Title:          Improved Channel Introspection via "chan info"
Version:        $Revision: 1.4 $
Author:         Pawel Salawa <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        31-Aug-2012
Post-History:   
Tcl-Version:    8.7

~ Abstract

This document describes new subcommand for '''chan''', '''chan info''', that
provides a unified interface to deeper introspection of information about a
particular channel.

~ Rationale

When working with Tcl channels sometimes it happens that we got the channel,
but we don't know if it's a file channel, socket, or reflected channel. This
information can be very useful. Also some additional information, depending of
the channel type, like file path for file channel, host and port for sockets
(it's already available, but could get unified within new '''chan'''
subcommand), or any metadata provided by reflected channels.

An example where it could be used is the package with an API that accepts just
a channel on input call and the inside routines need to do something with the
file (in file system), so they have to learn the name of the file related to
given channel.

~ Specification

A new subcommand for '''chan''' is introduced:

 > '''chan info''' ''channelId''

Also a new optional command is introduced for reflected channels API:

 > ''cmdPrefix'' '''chaninfo''' ''channelId''

~~ The info Subcommand of chan

The '''chan info''' command will take a single mandatory argument,
''channelId'', which will be the name of a channel to retrieve information
about. This operation will always fail in a safe interpreter. The result of
the new '''chan info''' command would be a dictionary with following keys
always present:

 type: indicating a type of channel. Possible values are "'''file'''",
   "'''socket'''", "'''process'''" (result of [['''open''' "|..."]]), empty
   string (in case of channel that doesn't support this information), or any
   custom type, depending of refchan implementations. This is a mandatory key.

The remainder of the keys are optional and depend on the type.

For '''file''' channels, the dictionary shall include these:

 path: full, normalized path to the file, including the file name.

 new: boolean value indicating whether file already existed while opening, or
   it was created.

For '''socket''' channels, the dictionary shall include these:

 host: peer hostname, or local hostname for listening socket. This is
   partially equivalent to getting the first value returned by [['''chan
   configure''' ''channelId'' '''-peername''']] for connected sockets.

 port: peer port, or listening port (for listening socket). This is partially
   equivalent to getting the third value returned by [['''chan configure'''
   ''channelId'' '''-peername''']] for connected sockets.

 side: one of the
 following: "'''client'''", "'''accepted'''", or "'''listening'''".

For '''process''' channels, the dictionary shall include these:

 cmdline: copy of the command passed to '''open'''.

 pid: PID of a spawned process, as produced by '''pid'''.

Any key could be produced by other channel types, notably including reflected channels.

~~ The chaninfo Operation of Reflected Channel Implementations

The '''chaninfo''' subcommand of a reflected channel implementation command
returns a dict that is provided in response to a '''chan info''' request. If
the dictionary does not include the mandatory '''type''' member, the reflected
channel baseline implementation will add it and set it to '''refchan'''. It is
an error to return a non-dictionary.

Since reflected channels are free to set the type to anything, they can
simulate standard channels, like "'''file'''", as well as create completely
new types.

If the operation is not supported, the baseline implementation will treat it
the same as if the operation returned an empty dictionary.

~ Internals

Channel structure in Tcl core would require another API level indicating
channels that have a function returning an "info" dict. All core channels are
expected to migrate to this level, although it's possible to stay at current
API version - it will just cause the '''type''' in '''chan info''' dict to be
the ''typeName'' field of the channel's ''Tcl_ChannelType'' structure, with no
additional keys in the dict.

~ Examples

This is a a pure Tcl implementation of file type channel, so it supports new
information in '''chan info''':

|oo::class create filechan {
|    variable path fd created filemode
|    constructor {fpath mode} {
|        set filemode $mode
|        set path $fpath
|    }

|
|    method initialize {ch mode} {
|        set exists [file exists $path]
|        set fd [open $path $filemode]
|        set created [expr { [file exists $path] && !$exists}]
|        return "initialize finalize watch read seek chaninfo"
|    }

|    method finalize {ch} {
|        ::close $fd
|        my destroy
|    }

|    method watch {ch events} {
|        foreach event [list read write] method [list readable writable] {
|            if {$event in $events} {
|                fileevent $fd $method [list chan postevent $ch $event]
|            }
|        }
|    }

|
|    # Must be present on a readable channel
|    method read {ch count} {
|        ::read $fd $count
|    }

|
|    # This method is optional, but useful for the example below
|    method seek {ch offset base} {
|        ::seek $fd $offset $base
|    }

|
|    method chaninfo {ch} {
|        dict create type file path $path new $created
|    }
|}

|
|proc openfile {file mode} {
|    # lets not bother of what modes should be passed to [chan create],
|    # it's just an example...
|    chan create [list read write] [filechan new $file $mode]
|}

|
|set fd [openfile "myfile.txt" r]
|puts [chan info $fd]
|close $fd

~ Reference Implementation

http://sqlitestudio.pl/tcl/patches/tip-411-chan_info.patch

Patch made against 8.6.0 (just before final release).

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|
|

|

|

|

|

|

|

|
|

|

|
|
|

|

|

|
|

|
|
|

|

|

|

|

|

|
|
|
|

|

|

|
|

|

|

|
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
|
|
<
>
|
|
|
|
<
<
<
>
>
>
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
<
<
>
>
|
|
|
|
|
<
>
|
|
|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117

118
119
120
121
122
123
124

125
126
127
128

129
130
131
132
133

134
135
136
137
138
139
140

141
142
143
144
145

146
147
148
149

150
151
152
153
154
155
156

157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172

# TIP 411: Improved Channel Introspection via "chan info"

	Author:         Pawel Salawa <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        31-Aug-2012
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This document describes new subcommand for **chan**, **chan info**, that
provides a unified interface to deeper introspection of information about a
particular channel.

# Rationale

When working with Tcl channels sometimes it happens that we got the channel,
but we don't know if it's a file channel, socket, or reflected channel. This
information can be very useful. Also some additional information, depending of
the channel type, like file path for file channel, host and port for sockets
\(it's already available, but could get unified within new **chan**
subcommand\), or any metadata provided by reflected channels.

An example where it could be used is the package with an API that accepts just
a channel on input call and the inside routines need to do something with the
file \(in file system\), so they have to learn the name of the file related to
given channel.

# Specification

A new subcommand for **chan** is introduced:

 > **chan info** _channelId_

Also a new optional command is introduced for reflected channels API:

 > _cmdPrefix_ **chaninfo** _channelId_

## The info Subcommand of chan

The **chan info** command will take a single mandatory argument,
_channelId_, which will be the name of a channel to retrieve information
about. This operation will always fail in a safe interpreter. The result of
the new **chan info** command would be a dictionary with following keys
always present:

 type: indicating a type of channel. Possible values are "**file**",
   "**socket**", "**process**" \(result of [**open** "\|..."]\), empty
   string \(in case of channel that doesn't support this information\), or any
   custom type, depending of refchan implementations. This is a mandatory key.

The remainder of the keys are optional and depend on the type.

For **file** channels, the dictionary shall include these:

 path: full, normalized path to the file, including the file name.

 new: boolean value indicating whether file already existed while opening, or
   it was created.

For **socket** channels, the dictionary shall include these:

 host: peer hostname, or local hostname for listening socket. This is
   partially equivalent to getting the first value returned by [**chan
   configure** _channelId_ **-peername**] for connected sockets.

 port: peer port, or listening port \(for listening socket\). This is partially
   equivalent to getting the third value returned by [**chan configure**
   _channelId_ **-peername**] for connected sockets.

 side: one of the
 following: "**client**", "**accepted**", or "**listening**".

For **process** channels, the dictionary shall include these:

 cmdline: copy of the command passed to **open**.

 pid: PID of a spawned process, as produced by **pid**.

Any key could be produced by other channel types, notably including reflected channels.

## The chaninfo Operation of Reflected Channel Implementations

The **chaninfo** subcommand of a reflected channel implementation command
returns a dict that is provided in response to a **chan info** request. If
the dictionary does not include the mandatory **type** member, the reflected
channel baseline implementation will add it and set it to **refchan**. It is
an error to return a non-dictionary.

Since reflected channels are free to set the type to anything, they can
simulate standard channels, like "**file**", as well as create completely
new types.

If the operation is not supported, the baseline implementation will treat it
the same as if the operation returned an empty dictionary.

# Internals

Channel structure in Tcl core would require another API level indicating
channels that have a function returning an "info" dict. All core channels are
expected to migrate to this level, although it's possible to stay at current
API version - it will just cause the **type** in **chan info** dict to be
the _typeName_ field of the channel's _Tcl\_ChannelType_ structure, with no
additional keys in the dict.

# Examples

This is a a pure Tcl implementation of file type channel, so it supports new
information in **chan info**:

	oo::class create filechan {
	    variable path fd created filemode
	    constructor {fpath mode} {
	        set filemode $mode
	        set path $fpath

	    }

	    method initialize {ch mode} {
	        set exists [file exists $path]
	        set fd [open $path $filemode]
	        set created [expr { [file exists $path] && !$exists}]
	        return "initialize finalize watch read seek chaninfo"

	    }
	    method finalize {ch} {
	        ::close $fd
	        my destroy

	    }
	    method watch {ch events} {
	        foreach event [list read write] method [list readable writable] {
	            if {$event in $events} {
	                fileevent $fd $method [list chan postevent $ch $event]

	            }
	        }
	    }

	    # Must be present on a readable channel
	    method read {ch count} {
	        ::read $fd $count

	    }

	    # This method is optional, but useful for the example below
	    method seek {ch offset base} {
	        ::seek $fd $offset $base

	    }

	    method chaninfo {ch} {
	        dict create type file path $path new $created

	    }
	}

	proc openfile {file mode} {
	    # lets not bother of what modes should be passed to [chan create],
	    # it's just an example...
	    chan create [list read write] [filechan new $file $mode]

	}

	set fd [openfile "myfile.txt" r]
	puts [chan info $fd]
	close $fd

# Reference Implementation

<http://sqlitestudio.pl/tcl/patches/tip-411-chan\_info.patch>

Patch made against 8.6.0 \(just before final release\).

# Copyright

This document has been placed in the public domain.

Name change from tip/412.tip to tip/412.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354

355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377

378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403

404
405
406
407
408

409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475

TIP:            412
Title:          Dynamic Locale Changing for msgcat with On-Demand File Load
Version:        $Revision: 1.9 $
Author:         Harald Oehlmann <[email protected]>
Author:         Harald Oehlmann <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        27-Mar-2012
Post-History:   
Keywords:       Tcl,localization,msgcat
Obsoletes:      399
Tcl-Version:    8.6

~ Abstract

This TIP adds dynamic locale switching capabilities to the '''msgcat'''
package.

~ Rationale

~~ Dynamic Locale Switching

Within a multi-language application like a web-server, one may change the locale quite frequently, for example if users with different locales are requesting pages. Unfortunately, this does not fit well with the model adopted by the msgcat package, which assumes that all code follows this sequence:

 1.
 Set locale list: '''mclocale''' ''locale''

 2.
 Load language files with other package load: '''mcload''' ''msg-folder''

 3.
 Translate strings: '''mc''' ''key args...''

Note that if the locale should be changed after other packages are loaded, one must restart at step 2. This requires reloading all packages which is mostly not practical.

The aim of this TIP is to extend the package by dynamic locale change capabilities.

msgcat will reload any missing message catalog files of all currently loaded packages on a locale change. In addition, any package may register to get informed to a locale change. Other packages may do changes to reflect the locale change like rebuilding the GUI.

This TIP compares to [399] that the package is able to load message catalog files on demand, e.g. specially on a locale change.

~~ package locale

If the clock command gets called with the argument "-locale", the locale is changed using msgcat::mclocale. After processing, the initial value is restored. The package keeps track, which locales where already used and calls msgcat::mcload for any new locale. The locale is restored after processing.

This is an implementation of dynamic locales but conflicts with the new features described above. Other packages may be informed to change the locale and may trigger expensive operations like a rebuild of the GUI.

In consequence, each package may define a package locale which is independent of the default locale.

~ Overview of the proposed solution

Proposed changes in brief:

~~ Dynamically load message catalog files

if the locale is changed by mclocale locale, the message file load process is executed for every present package.

~~ Package locale

A package may install a package local locale which is independent to the global locale.

~~ Locale change callback

A callback may be registered to get informed about the change of locale. A use case is to refresh a GUI if the locale changed.

~~ Non message file operation

A program may use message files to issue mcset commands or may issue them by other means, if the message catalogs are, for example, stored in a data base.

Each package may register a callback to get informed that a certain locale should be loaded and may issue the corresponding mcset commands.

~~ Package mcunknown

A package may have a certain way to provide translations for message keys not included in the message catalog. Thus, it may register an own package message unknown callback to provide a translation.

~ Specification

~~ Package Equals Client Namespace

A client package is a package which uses msgcat. A unique namespace is required for each client package. Within msgcat, namespace and package is always connected.

Up to now, the msgcat package used this namespace as an identifier to store the catalog data of a certain package. This is now extended to additional properties which are stored for a package.

~~ Package locale

A package locale may be used by a package instead the default locale set by msgcat::mclocale. A package may choose to use a package locale or the default locale.

~~ Default and Package State

Some state values (like the locale) are available as default (global) values. In addition, each package may choose to use a package locale state.

The used naming is:

    default

     state: valid for all packages which do not set a package state.

    package

     state: only valid for one package if it has set a package state.

The following state values are present as default state and may be set individually per package:

    The locale like "de_ch".
    The preferences property is a list of locales in their preference order and is automatically computed from locale. Example locale = "de_ch" -> preferences = "de_ch de {}".
    The loadedlocales state value is the list of currently loaded locales.

~~ Default State

The following standard methods exist to get or set the default state:

~~~ msgcat::mclocale

The default locale. It may be read using msgcat::mclocale.

It may be set using msgcat::mclocale locale. This command is extended, that the message catalogs of all missing locales for all packages not having set a package state are loaded.

~~~ msgcat::mcpreferences

Get the default preferences (derived from the default locale).

~~~ msgcat::mcloadedlocales

The following new command may be used to deal with the default state:

 > '''msgcat::mcloadedlocales''' ''subcommand'' ''?locale?''

The parameter locale is mandatory for the subcommand present.

The following subcommands are available:

~~~ Subcommand "get"

Get the list of current loaded locales

~~~ Subcommand "present"

Returns true, if the given locale is loaded

~~~ Subcommand "clear"

The list of currently loaded locales is set to mcpreferences and all message catalog keys of packages without a package locale set and with locales not in mcpreferences are unset.

~~ Package Configuration

The package configuration of the calling package may be changed using the following new command:

 > '''msgcat::mcpackagelocale''' ''subcommand'' ?''locale''?

The parameter locale is mandatory for the subcommands set and present.

Available subcommands are:

~~~ Subcommand "set"

Set or change the package locale.

The global state values are copied, if there were no package locale set before.

The package locale is changed to the optional given new package locale.

~~~ Subcommand "get"

Return the package locale or the default locale, if no package locale set.

~~~ Subcommand "preferences"

Return the package preferences or the default preferences, if no package locale set.

~~~ Subcommand "loaded"

The list of locales loaded for this package is returned.

~~~ Subcommand "isset"

Returns true, if a package locale is set.

~~~ Subcommand "unset"

Unset the package locale and use the default state for the package. Load all message catalog files of the package for locales, which were not present in the package loadedlocales list and are present in the default list.

~~~ Subcommand "present"

Returns true, if the given locale is loaded

~~~ Subcommand "clear"

Set the current loaded locales list of the package to preferences and unset all message catalog keys of the package with locales not included in the package preferences.

~~ Package Configuration Options

Each package may have a set of configuration options set to invoke certain actions. They may be retrieved or changed with the following new command:

 > '''msgcat::mcpackageconfig''' ''subcommand option'' ?''value''?

Available subcommands are:

 get: Get the current value of the option or an error if not set.

 isset: Returns true if option is set.

 set: Set the given value to the option. May have additional consequences and
   return values as described in the option section.

 unset: Unset the option.

Available options are:

~~~ Package Option "mcfolder"

This is the message folder of the package. This option is set by mcload and by the subcommand set. Both are identical and both return the number of loaded message catalog files.

Setting or changing this value will load all locales contained in the preferences valid for the package. This implies also to invoke any set loadcmd (see below).

Unsetting this value will disable message file load for the package.

If the locale valid for this package changes, this value is used to eventually load message catalog files.

Message catalog files are always sourced in the namespace of the package registering the value.

~~~ Package Option "loadcmd"

This callback is invoked before a set of message catalog files are loaded for the package which has this property set.

This callback may be used to do any preparation work for message file load or to get the message data from another source like a data base. In this case, no message files are used (mcfolder is unset).

See chapter callback invocation below. The parameter list appended to this callback is the list of locales to load.

If this callback is changed, it is called with the preferences valid for the package.

~~~ Package Option "changecmd"

This callback is invoked when a default local change was performed. Its purpose is to allow a package to update any dependency on the default locale like showing the GUI in another language.

Tk may be extended to register to this callback and to invoke a virtual event.

See the callback invocation section below. The parameter list appended to this callback is mcpreferences. All registered packages are invoked in no particular order.

~~~ Package Option "unknowncmd"

Use a package locale mcunknown procedure instead of the standard version supplied by the msgcat package (msgcat::mcunknown).

The called procedure must return the formatted message which will finally be returned by msgcat::mc.

A generic unknown handler is used if set to the empty string. This consists in returning the key if no arguments are given. With given arguments, format is used to process the arguments.

See chapter callback invocation below. The appended arguments are identical to mcunknown.

~~~ Callback Invocation

Callbacks are invoked under the following conditions:

    the callback command is set,
    the command is not the empty string,
    the registration namespace exists.

Any error within the callback stops the operation which invoked the callback. This might be surprising, as the error might be in another package.

~~ Test if Message Key is Set

Message catalog keys may be expensive to calculate and thus may be set on demand.

The following new procedure returns false, if mc would call mcunknown for a key:

 > '''msgcat::mcexists''' ''src''

There are two options, to limit the key search to just the current namespace (don't search in parent namespaces) and just the current locale (don't search the preferences but the first item):

 > '''msgcat::mcexists''' ?'''-exactnamespace'''? ?''-exactlocale''? ''src''

~~ forget package

A package may clear all its keys and state using the new command:

 > '''msgcat::mcforgetpackage'''

~~ Locale and Preferences Format

Locales set by mcset may eventually not correspond to the current preferences, as the preferences are treated as follows:

    put to lower case,
    remove any multiple "_" and any "_" at the beginning or at the end of the

     locale.

It is proposed, that:

    the locale and the first preferences element is always identical to the

      lowercase passed locale,

    any multiple "_" are seen as one separator.

Example: preferences of locale "sy__cyrl_win"

    current preferences: "sy_cyrl_win sy_cyrl sy"
    proposed preferences: "sy__cyrl_win sy__cyrl sy".

Alternatively, all locales may normalized using the upper algorithm, which felt heavy in computation with little gain.

~ Example Usage

~~ Example from TIP #399

Imagine an application which supports the current user language and French, German and English. An external package tp is used. The package uses msgcat and installs itself during the package require tp call:

|package require msgcat
|msgcat::mcload [file join [file dirname [info script]] msgs]

An implementation of the application with the current msgcat 1.5.0 would require the following initialization sequence:

|package require msgcat
|package require np

and the following code to change the locale to French:

|package forget np
|msgcat::mclocale fr
|package require np

Using the extension of this TIP, one may load as usual:

|package require msgcat
|package require np

and to change to french locale:

|msgcat::mclocale fr

The first time, a locale is required, all corresponding message files of all packages which use msgcat get loaded. This might be a heavy operation.

If a locale is reactivated (and the message catalog data was not cleared), it is a quick operation.

Without this TIP, it is computational expensive (if possible, as many packages are not reloadable or a reload may disturb current processing, e.g., by forcing the closing of sockets, etc.).

~~ Change with No Need to Come Back

If it is certain that a locale is changed and the then obsolete data is of no use, one may clear unused message catalog items:

|msgcat::mclocale fr
|msgcat::mcloadedlocale clear

~~ Use a Callback to be Notified About a Locale Change

Packages which display a GUI may update their widgets when the locale changes. To register to a callback, use:

|namespace eval gui {
| msgcat::mcpackageconfig changecmd updateGUI
|
| proc updateGui args {
| puts "New locale is '[lindex $args 0]'."
| }
|}

|
|% msgcat::mclocale fr
|fr
|% New locale is 'fr'.

~~ To Use Another Locale Source than Message Catalog Files

If locales (or additional locales) are contained in another source like a data base, a package may use the load callback and not mcload:

|namespace eval db {
| msgcat::mcpackageconfig loadcmd loadMessages
| msgcat::mcconfig loadedpackages\
| [concat [msgcat::mcconfig loadedpackages] namespace current]
|
| proc loadMessages args {
| foreach locale $args {
| if {[LocaleInDB $locale]} {
| msgcat::mcmset $locale [GetLocaleList $locale]
| }
| }
| }
|}

~~ Use a package locale

The reference implementation also contains a changed clock command which uses a package locale. Here are some sketches from the implementation.

First, a package locale is initialized and the generic unknown function is activated:

|msgcat::mcpackagelocale set
|msgcat::mcpackageconfig unknowncmd ""

If the user requires the week day in a certain locale, it is changed:

|clock format clock seconds -format %A -locale fr

and the code:

|msgcat::mcpackagelocale set $locale
|return [lindex [msgcat::mc DAYS_OF_WEEK_FULL] $day]
|### Returns "mercredi"

Some message-catalog items are heavy in computation and thus are dynamically cached using:

|proc ::tcl::clock::LocalizeFormat { locale format } {
| set key FORMAT_$format
| if { [::msgcat::mcexists -exactlocale -exactnamespace $key] } {
| return [mc $key]
| }

| #...expensive computation of format clipped...
| mcset $locale $key $format
| return $format
|}

~ Reference Implementation

See Tcl fossil tag msgcat_dyn_locale [1].

~ Compatibility

Imagined incompatibilities:

    If packages call mcload multiple times with different folders, the

     data was currently appended. This is still the case, but only the last
     folder is used for any reload. The property '''mcfolder''' may be
     transformed to a list to cover this case.

    The return value of mcload (file count) may be much higher as there

     may be loaded much more files. I suppose, this value is only used by the
     test suite to verify functionality and is not for big general use.

    Message files may not be aware, that they may be loaded at any moment and

     not only after their own '''mcload'''. I suppose, this is the biggest
     issue but I think, there is no alternative.

    Message files do not get reloaded any more, if a second mcload is

     issued with the same path argument.

    Package which temporary change the default locale trigger any callback

     and may lead to user visible side effects.

~ Issues

Known issues:

    Packages might not be aware of a locale change and may buffer

     translations outside of '''msgcat'''. Packages should not buffer msgcat
     messages if they are used in a dynamic locale application (like tklib
     tooltip does for example).

    The clock command currently has a small dynamic patch for msgcat

     implemented. This must be removed in favor to new msgcat features due to
     the temporarily change of the default locale.

~ Extensions

    Expose the function to calculate the preference list from a given locale.
    Load a message catalog file for a given locale without changing the

     default/package locale.

    Methods isloaded to check if a locale is currently loaded.
    Access message catalog with specified namespace, locale and search

     behavior.

~ Alternatives

The alternative is the former [399], but that is problematic because the list of locales must be known before any package load. The additional complexity of this TIP is a justifiable trade-off against the greatly improved flexibility in the loading and locale selection order.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|
|

|
|

|
|
|

|
|

|

|

|

|

|
|

|

|
|
|
|
|
<
<
>
>
|
|
|
|

|

|

|
|
|
|
|
|
|
|
|
<
<
<
<
|
>
>
>
>
|

|
|

|

|
|
|

|
|
|
|
<
>
|
|
|
<
|
>
|

|

|

|

|

|

|

|
|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351

352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371

372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401

402
403
404
405

406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475

# TIP 412: Dynamic Locale Changing for msgcat with On-Demand File Load

	Author:         Harald Oehlmann <[email protected]>
	Author:         Harald Oehlmann <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        27-Mar-2012
	Post-History:   
	Keywords:       Tcl,localization,msgcat
	Obsoletes:      399
	Tcl-Version:    8.6
-----

# Abstract

This TIP adds dynamic locale switching capabilities to the **msgcat**
package.

# Rationale

## Dynamic Locale Switching

Within a multi-language application like a web-server, one may change the locale quite frequently, for example if users with different locales are requesting pages. Unfortunately, this does not fit well with the model adopted by the msgcat package, which assumes that all code follows this sequence:

 1.
 Set locale list: **mclocale** _locale_

 2.
 Load language files with other package load: **mcload** _msg-folder_

 3.
 Translate strings: **mc** _key args..._

Note that if the locale should be changed after other packages are loaded, one must restart at step 2. This requires reloading all packages which is mostly not practical.

The aim of this TIP is to extend the package by dynamic locale change capabilities.

msgcat will reload any missing message catalog files of all currently loaded packages on a locale change. In addition, any package may register to get informed to a locale change. Other packages may do changes to reflect the locale change like rebuilding the GUI.

This TIP compares to [[399]](399.md) that the package is able to load message catalog files on demand, e.g. specially on a locale change.

## package locale

If the clock command gets called with the argument "-locale", the locale is changed using msgcat::mclocale. After processing, the initial value is restored. The package keeps track, which locales where already used and calls msgcat::mcload for any new locale. The locale is restored after processing.

This is an implementation of dynamic locales but conflicts with the new features described above. Other packages may be informed to change the locale and may trigger expensive operations like a rebuild of the GUI.

In consequence, each package may define a package locale which is independent of the default locale.

# Overview of the proposed solution

Proposed changes in brief:

## Dynamically load message catalog files

if the locale is changed by mclocale locale, the message file load process is executed for every present package.

## Package locale

A package may install a package local locale which is independent to the global locale.

## Locale change callback

A callback may be registered to get informed about the change of locale. A use case is to refresh a GUI if the locale changed.

## Non message file operation

A program may use message files to issue mcset commands or may issue them by other means, if the message catalogs are, for example, stored in a data base.

Each package may register a callback to get informed that a certain locale should be loaded and may issue the corresponding mcset commands.

## Package mcunknown

A package may have a certain way to provide translations for message keys not included in the message catalog. Thus, it may register an own package message unknown callback to provide a translation.

# Specification

## Package Equals Client Namespace

A client package is a package which uses msgcat. A unique namespace is required for each client package. Within msgcat, namespace and package is always connected.

Up to now, the msgcat package used this namespace as an identifier to store the catalog data of a certain package. This is now extended to additional properties which are stored for a package.

## Package locale

A package locale may be used by a package instead the default locale set by msgcat::mclocale. A package may choose to use a package locale or the default locale.

## Default and Package State

Some state values \(like the locale\) are available as default \(global\) values. In addition, each package may choose to use a package locale state.

The used naming is:

    default

     state: valid for all packages which do not set a package state.

    package

     state: only valid for one package if it has set a package state.

The following state values are present as default state and may be set individually per package:

    The locale like "de\_ch".
    The preferences property is a list of locales in their preference order and is automatically computed from locale. Example locale = "de\_ch" -> preferences = "de\_ch de \{\}".
    The loadedlocales state value is the list of currently loaded locales.

## Default State

The following standard methods exist to get or set the default state:

### msgcat::mclocale

The default locale. It may be read using msgcat::mclocale.

It may be set using msgcat::mclocale locale. This command is extended, that the message catalogs of all missing locales for all packages not having set a package state are loaded.

### msgcat::mcpreferences

Get the default preferences \(derived from the default locale\).

### msgcat::mcloadedlocales

The following new command may be used to deal with the default state:

 > **msgcat::mcloadedlocales** _subcommand_ _?locale?_

The parameter locale is mandatory for the subcommand present.

The following subcommands are available:

### Subcommand "get"

Get the list of current loaded locales

### Subcommand "present"

Returns true, if the given locale is loaded

### Subcommand "clear"

The list of currently loaded locales is set to mcpreferences and all message catalog keys of packages without a package locale set and with locales not in mcpreferences are unset.

## Package Configuration

The package configuration of the calling package may be changed using the following new command:

 > **msgcat::mcpackagelocale** _subcommand_ ?_locale_?

The parameter locale is mandatory for the subcommands set and present.

Available subcommands are:

### Subcommand "set"

Set or change the package locale.

The global state values are copied, if there were no package locale set before.

The package locale is changed to the optional given new package locale.

### Subcommand "get"

Return the package locale or the default locale, if no package locale set.

### Subcommand "preferences"

Return the package preferences or the default preferences, if no package locale set.

### Subcommand "loaded"

The list of locales loaded for this package is returned.

### Subcommand "isset"

Returns true, if a package locale is set.

### Subcommand "unset"

Unset the package locale and use the default state for the package. Load all message catalog files of the package for locales, which were not present in the package loadedlocales list and are present in the default list.

### Subcommand "present"

Returns true, if the given locale is loaded

### Subcommand "clear"

Set the current loaded locales list of the package to preferences and unset all message catalog keys of the package with locales not included in the package preferences.

## Package Configuration Options

Each package may have a set of configuration options set to invoke certain actions. They may be retrieved or changed with the following new command:

 > **msgcat::mcpackageconfig** _subcommand option_ ?_value_?

Available subcommands are:

 get: Get the current value of the option or an error if not set.

 isset: Returns true if option is set.

 set: Set the given value to the option. May have additional consequences and
   return values as described in the option section.

 unset: Unset the option.

Available options are:

### Package Option "mcfolder"

This is the message folder of the package. This option is set by mcload and by the subcommand set. Both are identical and both return the number of loaded message catalog files.

Setting or changing this value will load all locales contained in the preferences valid for the package. This implies also to invoke any set loadcmd \(see below\).

Unsetting this value will disable message file load for the package.

If the locale valid for this package changes, this value is used to eventually load message catalog files.

Message catalog files are always sourced in the namespace of the package registering the value.

### Package Option "loadcmd"

This callback is invoked before a set of message catalog files are loaded for the package which has this property set.

This callback may be used to do any preparation work for message file load or to get the message data from another source like a data base. In this case, no message files are used \(mcfolder is unset\).

See chapter callback invocation below. The parameter list appended to this callback is the list of locales to load.

If this callback is changed, it is called with the preferences valid for the package.

### Package Option "changecmd"

This callback is invoked when a default local change was performed. Its purpose is to allow a package to update any dependency on the default locale like showing the GUI in another language.

Tk may be extended to register to this callback and to invoke a virtual event.

See the callback invocation section below. The parameter list appended to this callback is mcpreferences. All registered packages are invoked in no particular order.

### Package Option "unknowncmd"

Use a package locale mcunknown procedure instead of the standard version supplied by the msgcat package \(msgcat::mcunknown\).

The called procedure must return the formatted message which will finally be returned by msgcat::mc.

A generic unknown handler is used if set to the empty string. This consists in returning the key if no arguments are given. With given arguments, format is used to process the arguments.

See chapter callback invocation below. The appended arguments are identical to mcunknown.

### Callback Invocation

Callbacks are invoked under the following conditions:

    the callback command is set,
    the command is not the empty string,
    the registration namespace exists.

Any error within the callback stops the operation which invoked the callback. This might be surprising, as the error might be in another package.

## Test if Message Key is Set

Message catalog keys may be expensive to calculate and thus may be set on demand.

The following new procedure returns false, if mc would call mcunknown for a key:

 > **msgcat::mcexists** _src_

There are two options, to limit the key search to just the current namespace \(don't search in parent namespaces\) and just the current locale \(don't search the preferences but the first item\):

 > **msgcat::mcexists** ?**-exactnamespace**? ?_-exactlocale_? _src_

## forget package

A package may clear all its keys and state using the new command:

 > **msgcat::mcforgetpackage**

## Locale and Preferences Format

Locales set by mcset may eventually not correspond to the current preferences, as the preferences are treated as follows:

    put to lower case,
    remove any multiple "\_" and any "\_" at the beginning or at the end of the

     locale.

It is proposed, that:

    the locale and the first preferences element is always identical to the

      lowercase passed locale,

    any multiple "\_" are seen as one separator.

Example: preferences of locale "sy\_\_cyrl\_win"

    current preferences: "sy\_cyrl\_win sy\_cyrl sy"
    proposed preferences: "sy\_\_cyrl\_win sy\_\_cyrl sy".

Alternatively, all locales may normalized using the upper algorithm, which felt heavy in computation with little gain.

# Example Usage

## Example from TIP \#399

Imagine an application which supports the current user language and French, German and English. An external package tp is used. The package uses msgcat and installs itself during the package require tp call:

	package require msgcat
	msgcat::mcload [file join [file dirname [info script]] msgs]

An implementation of the application with the current msgcat 1.5.0 would require the following initialization sequence:

	package require msgcat
	package require np

and the following code to change the locale to French:

	package forget np
	msgcat::mclocale fr
	package require np

Using the extension of this TIP, one may load as usual:

	package require msgcat
	package require np

and to change to french locale:

	msgcat::mclocale fr

The first time, a locale is required, all corresponding message files of all packages which use msgcat get loaded. This might be a heavy operation.

If a locale is reactivated \(and the message catalog data was not cleared\), it is a quick operation.

Without this TIP, it is computational expensive \(if possible, as many packages are not reloadable or a reload may disturb current processing, e.g., by forcing the closing of sockets, etc.\).

## Change with No Need to Come Back

If it is certain that a locale is changed and the then obsolete data is of no use, one may clear unused message catalog items:

	msgcat::mclocale fr
	msgcat::mcloadedlocale clear

## Use a Callback to be Notified About a Locale Change

Packages which display a GUI may update their widgets when the locale changes. To register to a callback, use:

	namespace eval gui {
	 msgcat::mcpackageconfig changecmd updateGUI

	 proc updateGui args {
	 puts "New locale is '[lindex $args 0]'."

	 }
	}

	% msgcat::mclocale fr
	fr
	% New locale is 'fr'.

## To Use Another Locale Source than Message Catalog Files

If locales \(or additional locales\) are contained in another source like a data base, a package may use the load callback and not mcload:

	namespace eval db {
	 msgcat::mcpackageconfig loadcmd loadMessages
	 msgcat::mcconfig loadedpackages\
	 [concat [msgcat::mcconfig loadedpackages] namespace current]

	 proc loadMessages args {
	 foreach locale $args {
	 if {[LocaleInDB $locale]} {
	 msgcat::mcmset $locale [GetLocaleList $locale]

	 }
	 }
	 }
	}

## Use a package locale

The reference implementation also contains a changed clock command which uses a package locale. Here are some sketches from the implementation.

First, a package locale is initialized and the generic unknown function is activated:

	msgcat::mcpackagelocale set
	msgcat::mcpackageconfig unknowncmd ""

If the user requires the week day in a certain locale, it is changed:

	clock format clock seconds -format %A -locale fr

and the code:

	msgcat::mcpackagelocale set $locale
	return [lindex [msgcat::mc DAYS_OF_WEEK_FULL] $day]
	### Returns "mercredi"

Some message-catalog items are heavy in computation and thus are dynamically cached using:

	proc ::tcl::clock::LocalizeFormat { locale format } {
	 set key FORMAT_$format
	 if { [::msgcat::mcexists -exactlocale -exactnamespace $key] } {
	 return [mc $key]

	 }
	 #...expensive computation of format clipped...
	 mcset $locale $key $format
	 return $format

	}

# Reference Implementation

See Tcl fossil tag msgcat\_dyn\_locale [[1]](1.md).

# Compatibility

Imagined incompatibilities:

    If packages call mcload multiple times with different folders, the

     data was currently appended. This is still the case, but only the last
     folder is used for any reload. The property **mcfolder** may be
     transformed to a list to cover this case.

    The return value of mcload \(file count\) may be much higher as there

     may be loaded much more files. I suppose, this value is only used by the
     test suite to verify functionality and is not for big general use.

    Message files may not be aware, that they may be loaded at any moment and

     not only after their own **mcload**. I suppose, this is the biggest
     issue but I think, there is no alternative.

    Message files do not get reloaded any more, if a second mcload is

     issued with the same path argument.

    Package which temporary change the default locale trigger any callback

     and may lead to user visible side effects.

# Issues

Known issues:

    Packages might not be aware of a locale change and may buffer

     translations outside of **msgcat**. Packages should not buffer msgcat
     messages if they are used in a dynamic locale application \(like tklib
     tooltip does for example\).

    The clock command currently has a small dynamic patch for msgcat

     implemented. This must be removed in favor to new msgcat features due to
     the temporarily change of the default locale.

# Extensions

    Expose the function to calculate the preference list from a given locale.
    Load a message catalog file for a given locale without changing the

     default/package locale.

    Methods isloaded to check if a locale is currently loaded.
    Access message catalog with specified namespace, locale and search

     behavior.

# Alternatives

The alternative is the former [[399]](399.md), but that is problematic because the list of locales must be known before any package load. The additional complexity of this TIP is a justifiable trade-off against the greatly improved flexibility in the loading and locale selection order.

# Copyright

This document has been placed in the public domain.

Name change from tip/413.tip to tip/413.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123

TIP:            413
Title:          Unicode Support for 'string is space' and 'string trim'
Version:        $Revision: 1.5 $
Author:         Jan Nijtmans <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        08-Oct-2012
Post-History:   
Discussions-To: Tcl Core list
Keywords:       Tcl
Tcl-Version:    8.6

~ Abstract

This TIP is in fact a re-consideration of [318], in that it attempts to
define, once and for all, for which characters '''string is space''' should
return 1 and which characters '''string trim''' should trim.

~ Rationale

Intuitively, '''string is space''' and '''string trim''' should treat the same
characters as space, but currently that's not the case, even after the
implementation of [318].  The unicode standard advanced to version 6.2 now (at
the time of this writing), but also Java and .NET have their own views on what
whitespace should be. Let's try to learn from them.

~~ Defining the Tcl Space Set

The NUL character has the function as string separator, which could be
considered in the same group as LINE SEPARATOR (U+2028) and PARAGRAPH
SEPARATOR (U+2029). It's a very useful character to be stripped. It even had
the Whitespace property in Unicode 2.0. The problem with considering this
character as space is that its visible representation is not specified, it
even should not occur in normal text. Therefore, it is not in the "Tcl space
set", but it is very useful to let it be stripped by '''string trim'''.

The Unicode standard changed in time, which resulted in whitespace characters
being removed (deprecated) and added.  The ''String.Trim()'' method in .NET
3.5 stripped zero width space (U+200B) and zero width no-break space (U+FEFF)
from strings, but later .NET versions don't do that any more.  The "Tcl space
set" should not depend on that: If characters are deprecated in future Unicode
versions, and because of that Whitespace properties are changed, they will not
be removed from the "Tcl space set". But if new whitespace characters are
added in future Unicode standards, they will be added to the "Tcl space set"
as well, influencing both '''string is space''' and '''string trim'''.

The 3 characters that are in the "Tcl space set" but not in the current
Unicode whitespace set are discussed now.

Most obvious is zero width no-break space (U+FEFF), which is a very useful
character to be stripped, as it is used now as Byte Order Mark (BOM). It has
no visible representation, and - in fact - no meaning at all within Tcl, as
Tcl is UTF-8 internally already. It should not occur anywhere else in the
string, but in the past it could as being a zero-width no-break space. It had
the ''White_Space'' property in Unicode 2.0, but later versions of Unicode do
not; the use of the BOM as a space was deprecated.

When the use as space was deprecated for (U+FEFF), another character was put
forward as replacement for it: word joiner (U+2060). As this character has no
visible representation, and has no meaning at all when at the start or the end
of a string, it makes sense to include it in the "Tcl space set" as well, the
more because its predecessor had the ''White_Space'' property.

Finally, zero width space (U+200B), had the ''White_Space'' property in
Unicode 3.0. In the current Unicode Charts it is still listed as being a
space, even though the White_Space property was removed later. Therefore it
should be in the "Tcl space set" as well.

~ Specification

This document proposes:

 * For the ASCII set, '''string is space''' stays as is.  '''string trim'''
   will be modified to trim all characters for which '''string is space'''
   returns 1, augmented with the NUL character. This means that NUL, VT and FF
   will be added to the set. This is a '''potential incompatibility'''.

 * For characters outside ASCII, the Unicode '''White_Space'''
   [http://www.unicode.org/Public/6.2.0/ucd/PropList.txt] property forms the
   basis of what '''string is space''' and '''string trim''' consider being
   space. But 3 characters are added to the set: zero with space (U+200B),
   word joiner (U+2060) and zero width no-break space (U+FEFF) (i.e., the
   BOM).

The '''string trimleft''' and '''string trimright''' commands will also be
modified, as they track '''string trim'''.

~ Compatibility

For the ASCII set, the only change is the addition of 3 characters to
'''string trim'''. For Unicode there are more changes, but all added
characters are either rarely used, either intuitively expected to be trimmed
by '''string trim'''.  I don't think that any code will be adversely affected
by this change, it will probably fix more bugs than that it breaks any
existing code.

~ Alternatives

 1. NUL could be added to '''string is space''', but that would
    be in conflict with what POSIX ''isspace()'' function does.

 2. NUL could be left out of the '''string trim''' set.

 3. Additional characters I considered being part of the set:

|    break permitted here (U+0082)
|    no break here (U+0083)
|    zero width joiner (U+200C)
|    zero with non-joiner (U+200D)

  > Those are clearly useful characters to be stripped, as they have no
    meaning and no visible appearance at the beginning or end of a string. But
    they are not spaces, so it would diverge the two commands.

~ Reference Implementation

A reference implementation is available in the Tcl fossil repository on the
''tip-318-update'' branch [https://core.tcl.tk/tcl/timeline?r=tip-318-update].

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|
|
|

|

|

|
|

|

|
|

|

|
|

|

|
|

|

|
|

|

|

|

|

|
|

|

|
|
|
|
|
|

|
|

|

|

|

|

|
|

|

|
|
|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123

# TIP 413: Unicode Support for 'string is space' and 'string trim'

	Author:         Jan Nijtmans <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        08-Oct-2012
	Post-History:   
	Discussions-To: Tcl Core list
	Keywords:       Tcl
	Tcl-Version:    8.6
-----

# Abstract

This TIP is in fact a re-consideration of [[318]](318.md), in that it attempts to
define, once and for all, for which characters **string is space** should
return 1 and which characters **string trim** should trim.

# Rationale

Intuitively, **string is space** and **string trim** should treat the same
characters as space, but currently that's not the case, even after the
implementation of [[318]](318.md).  The unicode standard advanced to version 6.2 now \(at
the time of this writing\), but also Java and .NET have their own views on what
whitespace should be. Let's try to learn from them.

## Defining the Tcl Space Set

The NUL character has the function as string separator, which could be
considered in the same group as LINE SEPARATOR \(U\+2028\) and PARAGRAPH
SEPARATOR \(U\+2029\). It's a very useful character to be stripped. It even had
the Whitespace property in Unicode 2.0. The problem with considering this
character as space is that its visible representation is not specified, it
even should not occur in normal text. Therefore, it is not in the "Tcl space
set", but it is very useful to let it be stripped by **string trim**.

The Unicode standard changed in time, which resulted in whitespace characters
being removed \(deprecated\) and added.  The _String.Trim\(\)_ method in .NET
3.5 stripped zero width space \(U\+200B\) and zero width no-break space \(U\+FEFF\)
from strings, but later .NET versions don't do that any more.  The "Tcl space
set" should not depend on that: If characters are deprecated in future Unicode
versions, and because of that Whitespace properties are changed, they will not
be removed from the "Tcl space set". But if new whitespace characters are
added in future Unicode standards, they will be added to the "Tcl space set"
as well, influencing both **string is space** and **string trim**.

The 3 characters that are in the "Tcl space set" but not in the current
Unicode whitespace set are discussed now.

Most obvious is zero width no-break space \(U\+FEFF\), which is a very useful
character to be stripped, as it is used now as Byte Order Mark \(BOM\). It has
no visible representation, and - in fact - no meaning at all within Tcl, as
Tcl is UTF-8 internally already. It should not occur anywhere else in the
string, but in the past it could as being a zero-width no-break space. It had
the _White\_Space_ property in Unicode 2.0, but later versions of Unicode do
not; the use of the BOM as a space was deprecated.

When the use as space was deprecated for \(U\+FEFF\), another character was put
forward as replacement for it: word joiner \(U\+2060\). As this character has no
visible representation, and has no meaning at all when at the start or the end
of a string, it makes sense to include it in the "Tcl space set" as well, the
more because its predecessor had the _White\_Space_ property.

Finally, zero width space \(U\+200B\), had the _White\_Space_ property in
Unicode 3.0. In the current Unicode Charts it is still listed as being a
space, even though the White\_Space property was removed later. Therefore it
should be in the "Tcl space set" as well.

# Specification

This document proposes:

 * For the ASCII set, **string is space** stays as is.  **string trim**
   will be modified to trim all characters for which **string is space**
   returns 1, augmented with the NUL character. This means that NUL, VT and FF
   will be added to the set. This is a **potential incompatibility**.

 * For characters outside ASCII, the Unicode **White\_Space**
   <http://www.unicode.org/Public/6.2.0/ucd/PropList.txt>  property forms the
   basis of what **string is space** and **string trim** consider being
   space. But 3 characters are added to the set: zero with space \(U\+200B\),
   word joiner \(U\+2060\) and zero width no-break space \(U\+FEFF\) \(i.e., the
   BOM\).

The **string trimleft** and **string trimright** commands will also be
modified, as they track **string trim**.

# Compatibility

For the ASCII set, the only change is the addition of 3 characters to
**string trim**. For Unicode there are more changes, but all added
characters are either rarely used, either intuitively expected to be trimmed
by **string trim**.  I don't think that any code will be adversely affected
by this change, it will probably fix more bugs than that it breaks any
existing code.

# Alternatives

 1. NUL could be added to **string is space**, but that would
    be in conflict with what POSIX _isspace\(\)_ function does.

 2. NUL could be left out of the **string trim** set.

 3. Additional characters I considered being part of the set:

		    break permitted here (U+0082)
		    no break here (U+0083)
		    zero width joiner (U+200C)
		    zero with non-joiner (U+200D)

	  > Those are clearly useful characters to be stripped, as they have no
    meaning and no visible appearance at the beginning or end of a string. But
    they are not spaces, so it would diverge the two commands.

# Reference Implementation

A reference implementation is available in the Tcl fossil repository on the
_tip-318-update_ branch <https://core.tcl.tk/tcl/timeline?r=tip-318-update> .

# Copyright

This document has been placed in the public domain.

Name change from tip/414.tip to tip/414.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51

TIP:            414
Title:          Add (back) Tcl_InitSubsystems as Public API
Version:        $Revision: 1.25 $
Author:         Brian Griffin <[email protected]>
Author:         Jan Nijtmans <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        15-Oct-2012
Post-History:   
Tcl-Version:    8.7

~ Abstract

The ability to initialize just the lower level Tcl subsystems used to be part
of the public API, now it is no longer exposed. This TIP proposes that it be
re-exposed.

~ Rationale

Some parts of Tcl's API are useful in portable applications even without
creating a Tcl interpreter; examples of this include Tcl_Alloc and (most of)
the Tcl_DString-related functions. In order to use these functions correctly,
the Tcl library ''must'' be initialized, yet the function for doing so -
Tcl_InitSubsystems (currently TclInitSubsystems) - was removed from Tcl's API;
using Tcl_FindExecutable instead feels incorrect as we're not seeking to make
the name of the executable available to Tcl scripts.

~ Proposed Change

A new function Tcl_InitSubsystems, similar to the internal TclInitSubsystems,
should be exposed as alternative to Tcl_FindExecutable in Tcl's C API. This
will ''not'' be a part of the Stub API; it is not intended to ever be used
from an initialized stubbed environment, as it is meant to be used prior to
the stub table being available. It has a single argument, ''panicProc''.
When NULL, the default panic function is used. The full signature is:

 > EXTERN const char *
   '''Tcl_InitSubsystems'''(
       Tcl_PanicProc *''panicProc'');

The return value of ''Tcl_InitSubsystems'' is the Tcl version.

~ Reference Implementation

A reference implementation is available in the '''initsubsystems''' branch.
[http://core.tcl.tk/tcl/info/3c9828933f]

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|
|
|
|
|

|

|
|
|

|

|
|
|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51

# TIP 414: Add (back) Tcl_InitSubsystems as Public API

	Author:         Brian Griffin <[email protected]>
	Author:         Jan Nijtmans <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        15-Oct-2012
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

The ability to initialize just the lower level Tcl subsystems used to be part
of the public API, now it is no longer exposed. This TIP proposes that it be
re-exposed.

# Rationale

Some parts of Tcl's API are useful in portable applications even without
creating a Tcl interpreter; examples of this include Tcl\_Alloc and \(most of\)
the Tcl\_DString-related functions. In order to use these functions correctly,
the Tcl library _must_ be initialized, yet the function for doing so -
Tcl\_InitSubsystems \(currently TclInitSubsystems\) - was removed from Tcl's API;
using Tcl\_FindExecutable instead feels incorrect as we're not seeking to make
the name of the executable available to Tcl scripts.

# Proposed Change

A new function Tcl\_InitSubsystems, similar to the internal TclInitSubsystems,
should be exposed as alternative to Tcl\_FindExecutable in Tcl's C API. This
will _not_ be a part of the Stub API; it is not intended to ever be used
from an initialized stubbed environment, as it is meant to be used prior to
the stub table being available. It has a single argument, _panicProc_.
When NULL, the default panic function is used. The full signature is:

 > EXTERN const char \*
   **Tcl\_InitSubsystems**\(
       Tcl\_PanicProc \*_panicProc_\);

The return value of _Tcl\_InitSubsystems_ is the Tcl version.

# Reference Implementation

A reference implementation is available in the **initsubsystems** branch.
<http://core.tcl.tk/tcl/info/3c9828933f> 

# Copyright

This document has been placed in the public domain.

Name change from tip/415.tip to tip/415.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68

69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91

92
93
94
95
96
97
98
99
100
101
102

103
104
105
106
107
108
109
110
111
112
113
114

TIP:            415
Title:          Enable Easy Creation of Circular Arc Segments
Version:        $Revision: 1.11 $
Author:         Simon Geard <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        16-Oct-2012
Post-History:   
Keywords:       Tk
Tcl-Version:    8.7

~ Abstract

Creating a segment of a circular arc is unnecessarily difficult using the
'''canvas''' arc. This TIP proposes a simple extension of the syntax to
support the creation of circular arc segments in a natural way. A similar
extension to support the more general elliptical arc segments is outside the
scope of this TIP.

~ Rationale

There is scope to enhance arc creation to make it much more useful as was
shown by a recent discussion on news:comp.lang.tcl. The proposal here is the
simplest enhancement to enable creation of circular arc segments from a single
parameter.

~ Proposal

Enhance arc creation to support a new '''-height''' option

 > ''canvas'' '''create arc''' ''x1 y1 x2 y2'' '''-height''' ''h'' ?''options''?

The new option '''-height''' ''h'' causes the specified coordinates ''x1 y1'' and '' x2 y2'' to be interpreted as the
start and end points of the arc's chord. The value of ''h'' is the (canvas) distance of the arc's
mid-point from the chord with the sign of ''h'' determining the direction of the arc:

''h'' > 0 => clockwise
''h'' < 0 => anticlockwise

If ''h'' != 0 then the options ''-start'' and ''-extent'' are ignored (because they are calculated internally for a given ''h'').

Any non-zero value of ''h'' defines a unique arc.

If ''h'' = 0 (exactly) the option is ignored and the command is processed as if it wasn't present. In addition

 > ''canvas'' '''itemcget''' ''tagOrId'' ''--height'''

will always return 0. This behaviour enables introspection without complications. A consequence is that

 > ''canvas'' '''itemconfigure''' ''tagOrId'' ''--height''' ''0''

 is a no-op.

~ Example

The following code shows the creation of arcs using the new method, copying them onto another canvas
and using a '''scale''' widget to dynamically control the arcs

|# Callback for modifying the arcs' h value
|proc deltaHeight {h} {
|	global c
|	global arcList
|	foreach {i hp hm} $arcList {
|		$c itemconfigure a_$i -height [expr {$h*$hp}]
|		$c itemconfigure b_$i -height [expr {$h*$hm}]
|	}
|}

|
|# Create the canvas and its duplicate
|set c [canvas .c -width 700 -height 700 -bg grey]
|set cc [canvas .cc -width 700 -height 700 -bg grey]
|pack $c $cc -fill both -expand 1 -side left
|
|# Pretty colours
|array set colours {0 red 1 yellow 2 green 3 cyan 4 blue 5 magenta}
|
|# A slider with which to adjust h
|set lh 1; # Initial setting for scale
|set s [scale .s -from 0.1 -to 15 -resolution 0.1 -variable lh -orient vertical -length 700 -command deltaHeight]
|pack $s -side right -fill y
|
|# Create the arcs
|for {set i 1} {$i <= 24} {incr i} {
|	set col [expr {$i % 6}]
|	set hp [expr {$i*10}]
|	set hm [expr {-$i*10}]
|	lappend arcList $i $hp $hm
|	$c create arc 300 200 400 400 -height [expr {$i*10}] -outline $colours($col) -style arc -tags [list aa a_$i]
|	$c create arc 300 200 400 400 -height [expr {-$i*10}] -outline $colours($col) -style arc -tags [list aa b_$i]
|}

|
|# Serialize
|set fh [open "ccopy.tcl" w]
|foreach id [$c find withtag aa] {
|    puts $fh "\$cc create arc [$c coords $id] \
|		-height [$c itemcget $id -height]\
|		-start [$c itemcget $id -start] \
|		-extent [$c itemcget $id -extent] \
|		-outline [$c itemcget $id -outline] \
|		-style [$c itemcget $id -style]"
|}

|close $fh
|
|# Create copy from serialization
|source "ccopy.tcl"

~ Reference Implementation

A reference implementation for the functionality is available.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|
|
|

|
|

|

|

|

|

|

|

|

|
|
|
|
|
|
|
<
<
>
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65

66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89

90
91
92
93
94
95
96
97
98
99
100

101
102
103
104
105
106
107
108
109
110
111
112
113
114

# TIP 415: Enable Easy Creation of Circular Arc Segments

	Author:         Simon Geard <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        16-Oct-2012
	Post-History:   
	Keywords:       Tk
	Tcl-Version:    8.7
-----

# Abstract

Creating a segment of a circular arc is unnecessarily difficult using the
**canvas** arc. This TIP proposes a simple extension of the syntax to
support the creation of circular arc segments in a natural way. A similar
extension to support the more general elliptical arc segments is outside the
scope of this TIP.

# Rationale

There is scope to enhance arc creation to make it much more useful as was
shown by a recent discussion on news:comp.lang.tcl. The proposal here is the
simplest enhancement to enable creation of circular arc segments from a single
parameter.

# Proposal

Enhance arc creation to support a new **-height** option

 > _canvas_ **create arc** _x1 y1 x2 y2_ **-height** _h_ ?_options_?

The new option **-height** _h_ causes the specified coordinates _x1 y1_ and _ x2 y2_ to be interpreted as the
start and end points of the arc's chord. The value of _h_ is the \(canvas\) distance of the arc's
mid-point from the chord with the sign of _h_ determining the direction of the arc:

_h_ > 0 => clockwise
_h_ < 0 => anticlockwise

If _h_ != 0 then the options _-start_ and _-extent_ are ignored \(because they are calculated internally for a given _h_\).

Any non-zero value of _h_ defines a unique arc.

If _h_ = 0 \(exactly\) the option is ignored and the command is processed as if it wasn't present. In addition

 > _canvas_ **itemcget** _tagOrId_ _--height**

will always return 0. This behaviour enables introspection without complications. A consequence is that

 > _canvas_ **itemconfigure** _tagOrId_ _--height** _0_

 is a no-op.

# Example

The following code shows the creation of arcs using the new method, copying them onto another canvas
and using a **scale** widget to dynamically control the arcs

	# Callback for modifying the arcs' h value
	proc deltaHeight {h} {
		global c
		global arcList
		foreach {i hp hm} $arcList {
			$c itemconfigure a_$i -height [expr {$h*$hp}]
			$c itemconfigure b_$i -height [expr {$h*$hm}]

		}
	}

	# Create the canvas and its duplicate
	set c [canvas .c -width 700 -height 700 -bg grey]
	set cc [canvas .cc -width 700 -height 700 -bg grey]
	pack $c $cc -fill both -expand 1 -side left

	# Pretty colours
	array set colours {0 red 1 yellow 2 green 3 cyan 4 blue 5 magenta}

	# A slider with which to adjust h
	set lh 1; # Initial setting for scale
	set s [scale .s -from 0.1 -to 15 -resolution 0.1 -variable lh -orient vertical -length 700 -command deltaHeight]
	pack $s -side right -fill y

	# Create the arcs
	for {set i 1} {$i <= 24} {incr i} {
		set col [expr {$i % 6}]
		set hp [expr {$i*10}]
		set hm [expr {-$i*10}]
		lappend arcList $i $hp $hm
		$c create arc 300 200 400 400 -height [expr {$i*10}] -outline $colours($col) -style arc -tags [list aa a_$i]
		$c create arc 300 200 400 400 -height [expr {-$i*10}] -outline $colours($col) -style arc -tags [list aa b_$i]

	}

	# Serialize
	set fh [open "ccopy.tcl" w]
	foreach id [$c find withtag aa] {
	    puts $fh "\$cc create arc [$c coords $id] \
			-height [$c itemcget $id -height]\
			-start [$c itemcget $id -start] \
			-extent [$c itemcget $id -extent] \
			-outline [$c itemcget $id -outline] \
			-style [$c itemcget $id -style]"

	}
	close $fh

	# Create copy from serialization
	source "ccopy.tcl"

# Reference Implementation

A reference implementation for the functionality is available.

# Copyright

This document has been placed in the public domain.

Name change from tip/416.tip to tip/416.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136

TIP:            416
Title:          New Options for 'load': -global and -lazy
Version:        $Revision: 1.5 $
Author:         Christian Delbaere <[email protected]>
Author:         Jan Nijtmans <[email protected]>
Author:         Jan Nijtmans <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        31-Oct-2012
Post-History:   
Tcl-Version:    8.6

~ Abstract

This TIP proposes enhancing the Tcl '''load''' command with the additional
options '''-global''' and '''-lazy'''. It is implemented on top of [357], by
defining a meaning to the '''flags''' parameter already defined there.

~ Rationale

Platforms that use the ''dlopen()'' function to Tcl '''load''' shared modules
at runtime provide options to control how the library is loaded:

 * global vs. local symbol scoping

 * lazy vs. now symbol resolution

Currently, Tcl's '''load''' command has hard coded defaults for these options
and they cannot be overridden within a Tcl script.  This imposes constraints on
the internal implementation of the modules intended to be loaded into the
interpreter.  This is especially problematic for modules that provide Tcl
scripting bindings for existing C++ APIs.  Often, the C++ APIs make
assumptions about the availability and scoping of their symbols.

Tcl binding packages for C++ APIs are often created by a different development
group than the one that created the original C++ API.  Because the two groups
are independent, the C++ API maintainers will not always be open or able to
change their code to fit the requirements to be loaded into a scripting
language.

A common problem occurs when the same static variable is present in two
different Tcl modules.  For some applications, the variable is meant to be
shared across modules (global scoping), while in other applications, the
variable must have its own value within each module (local scoping).  If the
wrong scoping is chosen, the underlying code will not work correctly; rather
it will yield strange bugs and / or crashes.

Also in the domain of Tcl bindings for C++ APIs: it's convenient for the
binding package maintainers to have binary compatibility between one version
of the Tcl API and several versions of the C++ API.  The '''-lazy''' flag for
Tcl's load command will provides the feature necessary for this flexibility,
since it can be used to defer missing symbol errors.  So, users can often
continue to run their scripts as long as they restrict themselves to calling
only commands where the symbols are available.

Of course, some applications work best when '''load''' is called with
'''-global''' and some work best without it.  The same can be said for
'''-lazy'''.  By providing these options, Tcl will allow programmers to choose
the best fit for their application.

~ Specification

In [357], the ''Tcl_LoadFile'' is given as:

 > EXTERN int
   '''Tcl_LoadFile'''(
       Tcl_Interp *''interp'',
       Tcl_Obj *''pathPtr'',
       const char *''symbols''[],
       int ''flags'',
       void *''procPtrs'',
       Tcl_LoadHandle *''handlePtr'');

The meaning of the ''flags'' parameter is not defined in TIP #357, except
that the current value should be 0. This TIP defines the meaning of the first
two bits of this parameter:

|#define TCL_LOAD_GLOBAL 1
|#define TCL_LOAD_LAZY 2

Any combination (logical or) of those two bits can be given to the ''flags''
parameter. The remaining bits are meant for future extension and are
currently ignored, but should be set to 0.

The '''load''' command will get two new options:

Current specification:

 > '''load''' ''fileName'' ?''packageName'' ?''interp''??

Recommended specification:

 > '''load''' ?'''-global'''? ?'''-lazy'''? ?'''--'''? ''fileName''
   ?''packageName'' ?''interp''??

~ Discussion

Not all platforms may support library loading to a degree required for this
TIP functionality.  In that case, the additional options just act as if they
were not there. The reference implementation works on most modern UNIX systems
and MacOSX, which use ''dlopen()'' or ''NSLinkModule()''. Windows does not allow
lazy symbol resolution or global scoping, so the options have no effect on Windows.

The '''load''' command will determine the use of the new form by checking if
more than one argument is given and the first argument starts with a '''-'''.
This should not affect any existing extensions, as dynamic library filenames
beginning with '''-''' are rare.

Note that use of the '''-global''' or '''-lazy''' option may lead to crashes in your
application later (in case of symbol conflicts resp. missing symbols), which cannot
be detected during the '''load'''. So, only use this when you know what you are doing,
you will not get a nice error message when something is wrong with the loaded library.

~ Examples

Load a module with the defaults (local scoping, "now" resolution)

| load module.so

Load the module with global scoping and "now" resolution

| load -global module.so

Load the module with global scoping and lazy resolution

| load -global -lazy module.so

~ Reference Implementation

A reference implementation is available in the '''frq-3579001''' branch; see
https://core.tcl.tk/tcl/timeline?r=frq-3579001

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|
|
|

|

|

|

|

|
|
|

|
|

|

|

|
|
|

|

|

|
|
|
|
|
|
|

|

|
|

|

|

|

|
|

|

|

|
|

|

|
|
|

|

|

|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136

# TIP 416: New Options for 'load': -global and -lazy

	Author:         Christian Delbaere <[email protected]>
	Author:         Jan Nijtmans <[email protected]>
	Author:         Jan Nijtmans <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        31-Oct-2012
	Post-History:   
	Tcl-Version:    8.6
-----

# Abstract

This TIP proposes enhancing the Tcl **load** command with the additional
options **-global** and **-lazy**. It is implemented on top of [[357]](357.md), by
defining a meaning to the **flags** parameter already defined there.

# Rationale

Platforms that use the _dlopen\(\)_ function to Tcl **load** shared modules
at runtime provide options to control how the library is loaded:

 * global vs. local symbol scoping

 * lazy vs. now symbol resolution

Currently, Tcl's **load** command has hard coded defaults for these options
and they cannot be overridden within a Tcl script.  This imposes constraints on
the internal implementation of the modules intended to be loaded into the
interpreter.  This is especially problematic for modules that provide Tcl
scripting bindings for existing C\+\+ APIs.  Often, the C\+\+ APIs make
assumptions about the availability and scoping of their symbols.

Tcl binding packages for C\+\+ APIs are often created by a different development
group than the one that created the original C\+\+ API.  Because the two groups
are independent, the C\+\+ API maintainers will not always be open or able to
change their code to fit the requirements to be loaded into a scripting
language.

A common problem occurs when the same static variable is present in two
different Tcl modules.  For some applications, the variable is meant to be
shared across modules \(global scoping\), while in other applications, the
variable must have its own value within each module \(local scoping\).  If the
wrong scoping is chosen, the underlying code will not work correctly; rather
it will yield strange bugs and / or crashes.

Also in the domain of Tcl bindings for C\+\+ APIs: it's convenient for the
binding package maintainers to have binary compatibility between one version
of the Tcl API and several versions of the C\+\+ API.  The **-lazy** flag for
Tcl's load command will provides the feature necessary for this flexibility,
since it can be used to defer missing symbol errors.  So, users can often
continue to run their scripts as long as they restrict themselves to calling
only commands where the symbols are available.

Of course, some applications work best when **load** is called with
**-global** and some work best without it.  The same can be said for
**-lazy**.  By providing these options, Tcl will allow programmers to choose
the best fit for their application.

# Specification

In [[357]](357.md), the _Tcl\_LoadFile_ is given as:

 > EXTERN int
   **Tcl\_LoadFile**\(
       Tcl\_Interp \*_interp_,
       Tcl\_Obj \*_pathPtr_,
       const char \*_symbols_[],
       int _flags_,
       void \*_procPtrs_,
       Tcl\_LoadHandle \*_handlePtr_\);

The meaning of the _flags_ parameter is not defined in TIP \#357, except
that the current value should be 0. This TIP defines the meaning of the first
two bits of this parameter:

	#define TCL_LOAD_GLOBAL 1
	#define TCL_LOAD_LAZY 2

Any combination \(logical or\) of those two bits can be given to the _flags_
parameter. The remaining bits are meant for future extension and are
currently ignored, but should be set to 0.

The **load** command will get two new options:

Current specification:

 > **load** _fileName_ ?_packageName_ ?_interp_??

Recommended specification:

 > **load** ?**-global**? ?**-lazy**? ?**--**? _fileName_
   ?_packageName_ ?_interp_??

# Discussion

Not all platforms may support library loading to a degree required for this
TIP functionality.  In that case, the additional options just act as if they
were not there. The reference implementation works on most modern UNIX systems
and MacOSX, which use _dlopen\(\)_ or _NSLinkModule\(\)_. Windows does not allow
lazy symbol resolution or global scoping, so the options have no effect on Windows.

The **load** command will determine the use of the new form by checking if
more than one argument is given and the first argument starts with a **-**.
This should not affect any existing extensions, as dynamic library filenames
beginning with **-** are rare.

Note that use of the **-global** or **-lazy** option may lead to crashes in your
application later \(in case of symbol conflicts resp. missing symbols\), which cannot
be detected during the **load**. So, only use this when you know what you are doing,
you will not get a nice error message when something is wrong with the loaded library.

# Examples

Load a module with the defaults \(local scoping, "now" resolution\)

	 load module.so

Load the module with global scoping and "now" resolution

	 load -global module.so

Load the module with global scoping and lazy resolution

	 load -global -lazy module.so

# Reference Implementation

A reference implementation is available in the **frq-3579001** branch; see
<https://core.tcl.tk/tcl/timeline?r=frq-3579001>

# Copyright

This document has been placed in the public domain.

Name change from tip/417.tip to tip/417.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64

TIP:		417
Title:		Use Explicit Option Names for "file tempfile"
Version:	$Revision: 1.1 $
Author:		Christophe Curis <[email protected]>
State:		Draft
Type:		Project
Tcl-Version:	8.7
Vote:		Pending
Created:	16-Nov-2012
Post-History:
Keywords:	Tcl, future expansion, extensibility

~ Abstract

This TIP proposes altering the way in which optional arguments are specified
to '''file tempfile''' (see [210]) to make them easier to understand and
extend in the future.

~ Rationale

The current documentation for '''file tempfile''' states that there are two
optional arguments using a fixed order. This has some limits:

 * it is not possible to use the second argument without the first

 * being an infrequently-used function, having a fixed order implies that a
   look to the manual page will be obligatory to make sure of the order

 * this inhibits potential for any future expansion of the command.

Switching to option/value format will make the optional arguments easier to
understand.

~ Proposal

The syntax of the command would be changed to:

 > '''file tempfile''' ?''options...''?

with supported ''options'':

 * '''-namevar'''
   ''variable'': Specifies a variable for receiving the file name.

 * '''-template'''
   ''template'': Defines a template for the file name.

This syntax would allow:

 * easy extension in the future, as any option name can be added,

 * ability to use any of the options in any order,

 * an explicit syntax making the code easier to read and understand.

~ Reference Implementation

No implementation is available now, but the change is probably not complex;
the priority have been placed on raising the subject before the release of the
final 8.6 version of Tcl.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|
|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64

# TIP 417: Use Explicit Option Names for "file tempfile"

	Author:		Christophe Curis <[email protected]>
	State:		Draft
	Type:		Project
	Tcl-Version:	8.7
	Vote:		Pending
	Created:	16-Nov-2012
	Post-History:
	Keywords:	Tcl, future expansion, extensibility
-----

# Abstract

This TIP proposes altering the way in which optional arguments are specified
to **file tempfile** \(see [[210]](210.md)\) to make them easier to understand and
extend in the future.

# Rationale

The current documentation for **file tempfile** states that there are two
optional arguments using a fixed order. This has some limits:

 * it is not possible to use the second argument without the first

 * being an infrequently-used function, having a fixed order implies that a
   look to the manual page will be obligatory to make sure of the order

 * this inhibits potential for any future expansion of the command.

Switching to option/value format will make the optional arguments easier to
understand.

# Proposal

The syntax of the command would be changed to:

 > **file tempfile** ?_options..._?

with supported _options_:

 * **-namevar**
   _variable_: Specifies a variable for receiving the file name.

 * **-template**
   _template_: Defines a template for the file name.

This syntax would allow:

 * easy extension in the future, as any option name can be added,

 * ability to use any of the options in any order,

 * an explicit syntax making the code easier to read and understand.

# Reference Implementation

No implementation is available now, but the change is probably not complex;
the priority have been placed on raising the subject before the release of the
final 8.6 version of Tcl.

# Copyright

This document has been placed in the public domain.

Name change from tip/418.tip to tip/418.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146

TIP:            418
Title:          Add [binary] Subcommands for In-Place Modification
Version:        $Revision: 1.3 $
Author:         Jeff Rogers <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        27-Aug-2012
Post-History:   
Keywords:       Tcl,binary data
Tcl-Version:    8.7

~ Abstract

This TIP proposes adding new subcommands to the '''binary''' to better enable
parsing and manipulation of binary values.

~ Rationale

The '''binary''' command efficiently deals with creating new objects or
completely parsing existing ones, but it does not handle modifying existing
binary objects or parsing them a little bit at a time.  A few new subcommands
would greatly improve the performance of these operations on large objects.

~~ Variable vs. Value

While it will be possible to implement these modification operations as
standard copy-on-write operations taking a value instead of a variable name, I
believe this would result in copying unless the well-known but still clumsy
technique of unsetting the variable after reading it (i.e., [[K $x [[set x {}]]]]) 
is used. This TIP is intended to fix this by providing a more
convenient and simpler-to-use mechanism which also admits more efficient
implementation.

~ Specification

Two new core subcommands are proposed: '''binary edit''' modifies an existing
byte array "in place"; and '''binary scanshift''' parse data from a byte array
and removes the data that was parsed.  The intent is that additional commands
can be built on top of these in library code.

The existing '''binary''' commands already make use of an internal cursor;
that notion is extensively used by these new commands.

~~ Binary Edit

 > '''binary edit''' ''varName formatStr'' ?''value value ...''?

This is similar to '''binary format''' except that the initial value of the
new byte array is an existing object stored in a variable rather than an array
of nulls.

Format specifiers in ''formatStr'' are as in '''binary format''' except:

 * fixed-width format specifiers (e.g., '''c''', '''s''', '''i''') that do not
   have enough values in their corresponding argument (importantly, if they
   have 0 values) move the cursor by the appropriate width.

 * New '''z''' and '''Z''' format specifiers are introduced that move the
   cursor forward or backward in the binary string without writing anything
   and consume no arguments. '''Z''' is a synonym for "X", provided for
   symmetry.  A count of "'''*'''" for the "'''z'''" format moves the cursor
   to the end of the existing object.

After the format string and all arguments have been processed, the length of
the string is adjusted to end at the current cursor position.

Thus, a format string that starts with "z*" will append to the existing value,
and one that ends with "z*" will keep the length the same.

~~ Binary Scanshift

 > '''binary scanshift''' ''varName formatStr'' ?''var var var ...''?

This works like '''binary scan''' except that after the format string has been
processed and all variables assigned to, all data in the string before the
ending location of the cursor is discarded.

Thus,

|binary scanshift bvar c byte1
|binary scanshift bvar c byte2
|binary scanshift bvar c byte3

Will put the first 3 bytes of the binary ''$bvar'' into ''byte1'', ''byte2'',
and ''byte3'', with ''bvar'' being subsequently three bytes shorter (the
missing bytes being the first three).

This is useful to avoid keeping a separate external cursor variable that must
be incremented and re-used on each iteration.

~~ Additional Library Commands

Suggested additional library commands are '''poke''' and '''append'''. The
arguments to '''binary poke''' will be:

 > '''binary poke''' ''varName index formatStr'' ?''var ...''?

This moves the cursor to a specified index, then overwrite with the specified
format string.  Implemented as

|      binary edit varName "@${index} $formatStr z*" var ...

The arguments to '''binary append''' will be:

 > '''binary append''' ''varName formatStr'' "?''var ...''?

This appends the given formatted data to an existing var.  Implemented as

|      binary edit varName "z* $formatStr" ?var ...?"

~ Implementation Notes

Efficient implementation of the "'''scanshift'''" subcommand requires a new
"offset" field in the ByteArray structure and any operations that read the
object (particularly duplicating it and updating the string representation)
need to be aware of this field.  All external interfaces should be unaffected,
as the ByteArray structure type is private to tclBinary.c, And since it's
internal, EIAS is not violated.

When extending an existing byte array with the "'''edit'''" subcommand, care
should be taken with memory allocation to avoid repeated ''realloc()'' and
''memcpy()'' operations.  It is a reasonable assumption that a given byte
array will be extended repeatedly or not at all beyond the initial creation.
So a memory allocation strategy is to allocate the exact length initially
(i.e., when adjusting the size from 0 to non-zero) and allocating double the
requested length subsequently a typical allocation-doubling strategy should
work well.  A double allocation should not be needed for an initial extension
(i.e., extending from 0 to some length) as that is typically the case when a
binary object is first created, and most binary objects will probably not be
extended; but once extended it is reasonable to prepare for more of the same.

After numerous "'''scanshift'''" operations there will be wasted space at the
beginning of the memory allocated for the data.  One strategy for keeping this
under control would be to move the live data to the beginning of the allocated
space when the offset is larger than the live data, so that the memory could
be copied without worrying about overlap; and this would also leave the
allocation size at roughly double the live data size.

~ Reference Implementation

Forthcoming.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|
|
|

|

|
|

|
|

|

|

|

|
|
|

|
|
|

|

|
|

|

|

|

|

|

|

|

|

|
|
|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146

# TIP 418: Add [binary] Subcommands for In-Place Modification

	Author:         Jeff Rogers <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        27-Aug-2012
	Post-History:   
	Keywords:       Tcl,binary data
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes adding new subcommands to the **binary** to better enable
parsing and manipulation of binary values.

# Rationale

The **binary** command efficiently deals with creating new objects or
completely parsing existing ones, but it does not handle modifying existing
binary objects or parsing them a little bit at a time.  A few new subcommands
would greatly improve the performance of these operations on large objects.

## Variable vs. Value

While it will be possible to implement these modification operations as
standard copy-on-write operations taking a value instead of a variable name, I
believe this would result in copying unless the well-known but still clumsy
technique of unsetting the variable after reading it \(i.e., [K $x [set x {}]]\) 
is used. This TIP is intended to fix this by providing a more
convenient and simpler-to-use mechanism which also admits more efficient
implementation.

# Specification

Two new core subcommands are proposed: **binary edit** modifies an existing
byte array "in place"; and **binary scanshift** parse data from a byte array
and removes the data that was parsed.  The intent is that additional commands
can be built on top of these in library code.

The existing **binary** commands already make use of an internal cursor;
that notion is extensively used by these new commands.

## Binary Edit

 > **binary edit** _varName formatStr_ ?_value value ..._?

This is similar to **binary format** except that the initial value of the
new byte array is an existing object stored in a variable rather than an array
of nulls.

Format specifiers in _formatStr_ are as in **binary format** except:

 * fixed-width format specifiers \(e.g., **c**, **s**, **i**\) that do not
   have enough values in their corresponding argument \(importantly, if they
   have 0 values\) move the cursor by the appropriate width.

 * New **z** and **Z** format specifiers are introduced that move the
   cursor forward or backward in the binary string without writing anything
   and consume no arguments. **Z** is a synonym for "X", provided for
   symmetry.  A count of "**\***" for the "**z**" format moves the cursor
   to the end of the existing object.

After the format string and all arguments have been processed, the length of
the string is adjusted to end at the current cursor position.

Thus, a format string that starts with "z\*" will append to the existing value,
and one that ends with "z\*" will keep the length the same.

## Binary Scanshift

 > **binary scanshift** _varName formatStr_ ?_var var var ..._?

This works like **binary scan** except that after the format string has been
processed and all variables assigned to, all data in the string before the
ending location of the cursor is discarded.

Thus,

	binary scanshift bvar c byte1
	binary scanshift bvar c byte2
	binary scanshift bvar c byte3

Will put the first 3 bytes of the binary _$bvar_ into _byte1_, _byte2_,
and _byte3_, with _bvar_ being subsequently three bytes shorter \(the
missing bytes being the first three\).

This is useful to avoid keeping a separate external cursor variable that must
be incremented and re-used on each iteration.

## Additional Library Commands

Suggested additional library commands are **poke** and **append**. The
arguments to **binary poke** will be:

 > **binary poke** _varName index formatStr_ ?_var ..._?

This moves the cursor to a specified index, then overwrite with the specified
format string.  Implemented as

	      binary edit varName "@${index} $formatStr z*" var ...

The arguments to **binary append** will be:

 > **binary append** _varName formatStr_ "?_var ..._?

This appends the given formatted data to an existing var.  Implemented as

	      binary edit varName "z* $formatStr" ?var ...?"

# Implementation Notes

Efficient implementation of the "**scanshift**" subcommand requires a new
"offset" field in the ByteArray structure and any operations that read the
object \(particularly duplicating it and updating the string representation\)
need to be aware of this field.  All external interfaces should be unaffected,
as the ByteArray structure type is private to tclBinary.c, And since it's
internal, EIAS is not violated.

When extending an existing byte array with the "**edit**" subcommand, care
should be taken with memory allocation to avoid repeated _realloc\(\)_ and
_memcpy\(\)_ operations.  It is a reasonable assumption that a given byte
array will be extended repeatedly or not at all beyond the initial creation.
So a memory allocation strategy is to allocate the exact length initially
\(i.e., when adjusting the size from 0 to non-zero\) and allocating double the
requested length subsequently a typical allocation-doubling strategy should
work well.  A double allocation should not be needed for an initial extension
\(i.e., extending from 0 to some length\) as that is typically the case when a
binary object is first created, and most binary objects will probably not be
extended; but once extended it is reasonable to prepare for more of the same.

After numerous "**scanshift**" operations there will be wasted space at the
beginning of the memory allocated for the data.  One strategy for keeping this
under control would be to move the live data to the beginning of the allocated
space when the offset is larger than the live data, so that the memory could
be copied without worrying about overlap; and this would also leave the
allocation size at roughly double the live data size.

# Reference Implementation

Forthcoming.

# Copyright

This document has been placed in the public domain.

Name change from tip/419.tip to tip/419.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76

TIP:            419
Title:          A New Command for Binding to Tk Events
Version:        $Revision: 1.2 $
Author:         Jeff Rogers <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        28-Aug-2012
Post-History:   
Tcl-Version:    8.7

~ Abstract

This TIP proposes a more modern mechanism for binding callbacks to Tk's
events.

~ Rationale

The Tk '''bind'''' command passes details about an event to a callback script
by doing a textual substitution of percent markers.  This has worked well for
years, however most recent code prefers to use command prefixes to which set
arguments are appended rather than scripts which are evaluated.  This TIP
proposes such an approach for tk event binding.

~ Specification

A new command, "tkevent" is introduced with the following syntax:

 > '''tkevent''' ''tag'' ''sequence'' ''cmd''

The ''tag'' and ''sequence'' arguments are the same as used in the '''bind'''
command. The ''cmd'' is evaluated by appending a single argument which is a
dictionary containing the event details.  The implementation of ''cmd'' can
retrieve details of the event using that dictionary.

Bindings created by '''tkevent''' are compatible with those created by
'''bind'''.  When a sequence is bound to a tag using '''tkevent''', it
replaces any previous binding, and vice versa.  Appending to a binding with
"bind tag sequence +script" may not work as expected.

The possible keys in the dict passed to the handler are:

|        serial        above         button
|        count         detail        focus
|        height        window        keycode
|        mode          override_redirect
|        place         state         time
|        width         x             y
|        character     border_width  delta
|        send_event    keysym        keysym_num
|        property      root          subwindow
|        type          window        xroot
|        yroot

These keys are intended to be be the same as the options to '''event
generate''' where applicable.

Not all values are legal for all event types; where a key is not legal for an
event type, it will not be present in the dictionary when the ''cmd'' bound to
that event is evaluated.

~ Reference Implementation

A sample implementation of these commands in pure tcl is available at
http://wiki.tcl.tk/tkevent

~ Cross-Compatibility with Bind

Except for the "+script" feature, '''bind''' and '''tkevent''' support
identical functionality, and either could be implemented in terms of the
other. If '''bind''' as a core command was dropped in favor of '''tkevent''',
it could be provided as a library implementation.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|
|
|

|
|

|

|
|
|
|
|
|
|
|
|
|
|

|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76

# TIP 419: A New Command for Binding to Tk Events

	Author:         Jeff Rogers <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        28-Aug-2012
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes a more modern mechanism for binding callbacks to Tk's
events.

# Rationale

The Tk **bind**' command passes details about an event to a callback script
by doing a textual substitution of percent markers.  This has worked well for
years, however most recent code prefers to use command prefixes to which set
arguments are appended rather than scripts which are evaluated.  This TIP
proposes such an approach for tk event binding.

# Specification

A new command, "tkevent" is introduced with the following syntax:

 > **tkevent** _tag_ _sequence_ _cmd_

The _tag_ and _sequence_ arguments are the same as used in the **bind**
command. The _cmd_ is evaluated by appending a single argument which is a
dictionary containing the event details.  The implementation of _cmd_ can
retrieve details of the event using that dictionary.

Bindings created by **tkevent** are compatible with those created by
**bind**.  When a sequence is bound to a tag using **tkevent**, it
replaces any previous binding, and vice versa.  Appending to a binding with
"bind tag sequence \+script" may not work as expected.

The possible keys in the dict passed to the handler are:

	        serial        above         button
	        count         detail        focus
	        height        window        keycode
	        mode          override_redirect
	        place         state         time
	        width         x             y
	        character     border_width  delta
	        send_event    keysym        keysym_num
	        property      root          subwindow
	        type          window        xroot
	        yroot

These keys are intended to be be the same as the options to **event
generate** where applicable.

Not all values are legal for all event types; where a key is not legal for an
event type, it will not be present in the dictionary when the _cmd_ bound to
that event is evaluated.

# Reference Implementation

A sample implementation of these commands in pure tcl is available at
<http://wiki.tcl.tk/tkevent>

# Cross-Compatibility with Bind

Except for the "\+script" feature, **bind** and **tkevent** support
identical functionality, and either could be implemented in terms of the
other. If **bind** as a core command was dropped in favor of **tkevent**,
it could be provided as a library implementation.

# Copyright

This document has been placed in the public domain.

Name change from tip/42.tip to tip/42.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
TIP:		42
Title:          Add New Standard Tk Option: -clientdata
Version:	$Revision: 1.4 $
Author:         Bryan Oakley <[email protected]>
State:		Withdrawn
Type:           Project
Vote:		Pending
Created:	05-Jul-2001
Post-History:	
Tcl-Version:    8.5

~ Abstract

This TIP proposes to add a new standard option, -clientdata, for all
Tk widgets.

~ Rationale

Many modern and not so modern widget toolkits provide a way to attach
programmer defined data to a widget.  Tk lacks such a feature.  The
only way to accomplish a similar feat today is by storing data in a
global or namespace variable keyed by widget name.  This doesn't lend
itself very well to general purpose library routines.

One example of how this could be used is in prototyping additional
widget functionality.  For example, [39] requests a new option for
each widget that enables a widget to declare that it is part of a
larger compound widget.  One potential use of this new flag is in the
Tk library code that handles keyboard traversal.

With the option proposed in this TIP, it would have been quite simple
to prototype the necessary changes at the script level, making it
easier to validate the utility of the requested change in TIP #39, and
to provide a reference implementation of the affected library
procedures.

Another example use of this flag would be in the development of a
graphical interface builder such as SpecTcl or Visual Tcl.  With
applications that let you create widgets interactively, it is often
convenient to attach metadata directly to the widget.  For example,
<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39

# TIP 42: Add New Standard Tk Option: -clientdata

	Author:         Bryan Oakley <[email protected]>
	State:		Withdrawn
	Type:           Project
	Vote:		Pending
	Created:	05-Jul-2001
	Post-History:	
	Tcl-Version:    8.5
-----

# Abstract

This TIP proposes to add a new standard option, -clientdata, for all
Tk widgets.

# Rationale

Many modern and not so modern widget toolkits provide a way to attach
programmer defined data to a widget.  Tk lacks such a feature.  The
only way to accomplish a similar feat today is by storing data in a
global or namespace variable keyed by widget name.  This doesn't lend
itself very well to general purpose library routines.

One example of how this could be used is in prototyping additional
widget functionality.  For example, [[39]](39.md) requests a new option for
each widget that enables a widget to declare that it is part of a
larger compound widget.  One potential use of this new flag is in the
Tk library code that handles keyboard traversal.

With the option proposed in this TIP, it would have been quite simple
to prototype the necessary changes at the script level, making it
easier to validate the utility of the requested change in TIP \#39, and
to provide a reference implementation of the affected library
procedures.

Another example use of this flag would be in the development of a
graphical interface builder such as SpecTcl or Visual Tcl.  With
applications that let you create widgets interactively, it is often
convenient to attach metadata directly to the widget.  For example,

︙ ︙ 
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99

... and the list goes on.  The bottom line is, it adds flexibility
that can be leveraged in many ways.  Impact on the core is minimal
since it merely requires the storage and retrieval of information.
And the mechanism is already in place; we merely need to define a slot
in the widget data structure to store the information.

~ Specification

Suggested wording for the ''options'' man page (which, I suspect, can
be greatly improved upon):

| Command-Line Name: -clientdata
| Database Name: clientData
| Database Class: ClientData

 > ''Specifies programmer defined data to be associated with the
   widget.  The Tk libraries do not use this information or require
   the information to be in any particular format.  It is purely for
   use by the application.''

~ Lame Joke

Did you hear the one about the three legged dog that went into a
saloon, jumped up on the nearest stool, banged his good foot on the
bar, and with a steely-eyed glare said "I'm lookin' for the man that
shot my Paw!"?

~ Notice of Withdrawal

This TIP was Withdrawn by the TIP Editor following discussion on the
tcl-core mailing list.  The following is a summary of reasons for
withdrawal:

 > Perhaps some of the ideas behind these TIPs should be incorporated
   into some new TIP on making megawidget support better, but none of
   these TIPs really stand on their own.  (38 isn't a good idea, since
   alteration of the bindtags for all widgets of a class at once is a
   bad idea, and it is better when rolling your own megawidget classes
   to put the setting up of the bindtags in there.  39 and 42 just
   clash with each other as soon as you have two different codebases
   trying to use a single widget.)

~ Copyright

This document has been placed in the public domain accompanied with
only a small and very personal amount of fanfare.

|

|
|

|
|
|

|

|

|

|

|

|

|

>
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99

... and the list goes on.  The bottom line is, it adds flexibility
that can be leveraged in many ways.  Impact on the core is minimal
since it merely requires the storage and retrieval of information.
And the mechanism is already in place; we merely need to define a slot
in the widget data structure to store the information.

# Specification

Suggested wording for the _options_ man page \(which, I suspect, can
be greatly improved upon\):

	 Command-Line Name: -clientdata
	 Database Name: clientData
	 Database Class: ClientData

 > _Specifies programmer defined data to be associated with the
   widget.  The Tk libraries do not use this information or require
   the information to be in any particular format.  It is purely for
   use by the application._

# Lame Joke

Did you hear the one about the three legged dog that went into a
saloon, jumped up on the nearest stool, banged his good foot on the
bar, and with a steely-eyed glare said "I'm lookin' for the man that
shot my Paw!"?

# Notice of Withdrawal

This TIP was Withdrawn by the TIP Editor following discussion on the
tcl-core mailing list.  The following is a summary of reasons for
withdrawal:

 > Perhaps some of the ideas behind these TIPs should be incorporated
   into some new TIP on making megawidget support better, but none of
   these TIPs really stand on their own.  \(38 isn't a good idea, since
   alteration of the bindtags for all widgets of a class at once is a
   bad idea, and it is better when rolling your own megawidget classes
   to put the setting up of the bindtags in there.  39 and 42 just
   clash with each other as soon as you have two different codebases
   trying to use a single widget.\)

# Copyright

This document has been placed in the public domain accompanied with
only a small and very personal amount of fanfare.

Name change from tip/420.tip to tip/420.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65

66
67

68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304

TIP:            420
Title:          'vexpr', a Vector Expression Command
Version:        $Revision: 1.6 $
Author:         Sean Woods <[email protected]>
Author:         Andreas Kupries <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        15-Nov-2012
Post-History:   
Tcl-Version:    8.7

~ Abstract

This TIP proposes to add a new command to Tcl for manipulating vectors and
related mathematical objects. The command, '''vexpr''', will provide
C-optimized implementations of generally useful scalar, 2D, 3D and affine transforms. '''vexpr''' is a complement to '''expr''', and expects to take in vector arguments and return vector results.

~ Rationale

With the interest expressed in the community by [363], I am concerned about
the introduction of non-scalar results from '''expr''' (and parts of the
language the use '''expr'''). As the goal of that TIP is to introduce vector
math operations, a less ambitious, but arguable equally effective technique
could be to introduce a dedicated command. In particular, one designed from
the ground up to handle the intricacies of vector operations.

'''vexpr''' is a vector expression parser. It operates using reverse-polish
notation (like an HP calculator.) Each argument is pushed onto the stack, and
when a command is detected, they are popped off the stack. The result of the
command is pushed onto the stack in their place.

Why? Well mostly for ease of implementation. Partly because there is no PEMDAS
equivalent order of operation for matrices and vectors.  Once I go through an
example or two, it should be a little clearer.

~ Examples

To add {1 1 1} and {2 2 2} I run the following command:

|vexpr {2 2 2} {1 1 1} +
|> 3.0 3.0 3.0

Remember though, we are working with a stack. Items are popped on the stack in
a first-in first-out fashion. While for addition it doesn't matter what order
we do things, subtraction does care.

|vexpr {1 1 1} {2 2 2} -
|> 1.0 1.0 1.0
|vexpr {2 2 2} {1 1 1} -
|> -1.0 -1.0 -1.0

While with 2 arguments and an opcode this seems silly, imagine a complex
operation with several steps. Here we are going to model a robot arm with 3
joints. Each "arm" is one unit long, and when one joint bends, the rest follow
suit.

''unbent''

|(A) - (B) - (C)

''bent''

|      (C)
|        |

|      (B)
|     /

|(A)

Code:

|# Positions of the joints
|set A_pos {0.0 0.0 0.0}
|set B_pos {1.0 0.0 0.0}
|set C_pos {2.0 0.0 0.0}
|
|# Rotations of the joints 
|set A_rot {0 0 45}
|set B_rot {0 0 45}
|
|set b_transform [vexpr \
|    $A_pos $B_pos - \
|    affine_translate \
|    $A_rot radians \
|    affine_rotate \
|    affine_multiply]
|> { 0.707  0.707 0.0  -0.707} 
|  {-0.707  0.707 0.0   0.707}
|  { 0.0    0.0   1.0   0.0}
|  { 0.0    0.0   0.0   1.0}
|
|set b_real [vexpr $B_pos $b_transform vector_transform]
|
|> 0.707106 0.707106 0.0
|
|set c_transform [vexpr \
|    $C_pos $B_real - \
|    affine_translate \
|    load affine_multiply \
|    $B_rot radians \
|    affine_rotate \
|    affine_multiply]
|> { 0.0 1.0 0.0 0.707}
|  {-1.0 0.0 0.0 2.293}
|  {0.0  0.0 1.0 0.0}
|  {0.0  0.0 0.0 1.0}
|
|set c_real [vexpr $C_pos $c_transform vector_transform]
|> 0.0 2.0 0.0

If you aren't familiar with 3D math and affine transformations, that may look
overly complicated, but as you can see each '''vexpr''' call is packed with
commands. You can plainly see that after 2 45 degree bends, our "C" point
comes to rest at 0.0,2,0 after completing a 90 degree bend.

~ Operations

Note that all arguments that are not one of these operation words are instead treated as values to push onto the evaluation stack.

~~affine_multiply

 > AFFINE AFFINE -> AFFINE

Multiplies 2 4x4 matrices. Used to combine 2 affine transformations. Note:
Some affine transformations need to be performed in a particular order to make
sense.

~~affine_rotate

 > VECTOR -> AFFINE

Converts a "vector" of 3 angles (Xrotation Yrotation Zrotation) into an affine
transformation. NOTE: the angles should be in radians.

~~affine_scale

 > VECTOR -> AFFINE

Converts a scale vector (Xscale Yscale Zscale) into an affine transformation.
Note: 1.0 1.0 1.0 = No scaling. 2.0 2.0 2.0 = Double the size. 0.5 0.5 0.5 =
Half the size.

~~affine_translate

 > VECTOR -> AFFINE

Converts a displacement vector (X Y Z) into an affine transformation	

~~cart_to_cyl

 > VECTOR -> VECTOR

Converts a cartesian vector to cylindrical coordinates	

~~cart_to_sphere

 > VECTOR -> VECTOR

Converts a cartesian vector to spherical coordinates

~~cross

 > VECTOR VECTOR -> VECTOR

Performs the cross product of two vectors

~~copy

 > ANY -> ANY ANY

Copies the top of the stack, pushing it onto the stack.

~~cyl_to_cart

 > VECTOR -> VECTOR

Converts a vector in cylindrical coordinates to cartesian coordinates	

~~cyl_to_degrees

 > VECTOR -> VECTOR

Converts a cylindrical vector in radians to degrees.

~~cyl_to_radians

 > VECTOR -> VECTOR

Converts a cylindrical vector in degrees to radians.

~~degrees

 > VECTOR -> VECTOR

Converts a vector or scalar in radians to degrees.

~~dot

 > VECTOR VECTOR -> SCALAR

Produces the dot product of two vectors.

~~dT

 > (None) -> SCALAR

Pushes the value of dT into the stack.

~~identity

 > (None) -> AFFINE

Pushes the identity matrix onto the stack.

~~load

 > (None) -> ANY

Pushes the last value stored by STORE onto the stack.	

~~pi

 > (None) -> SCALER

Pushes the value of PI onto the stack.

~~radians

 > VECTOR -> VECTOR

Converts a vector or scalar in degrees to radians.

~~setDT

 > SCALAR -> (None)

Pops the current stack value and stores it in the dT variable.

~~sphere_to_cart

 > VECTOR -> VECTOR

Converts a vector in spherical coordinates to cartesian coordinates.

~~sphere_to_degrees

 > VECTOR -> VECTOR

Converts a spherical vector in radians to a spherical vector in degrees.

~~sphere_to_radians

 > VECTOR -> VECTOR

Converts a spherical vector in degrees to a spherical vector in radians.

~~store

 > ANY -> ANY

Stores the top of the stack internally for later use. The value stored remains at the top of the stack.

~~vector_add

 > VECTOR VECTOR -> VECTOR

Adds 2 vectors, which must be of the same length.

~~vector_length

 > VECTOR -> SCALAR

Produces the length of a vector.

~~vector_scale

 > SCALAR VECTOR -> VECTOR

Scales a vector by a scalar

~~vector_subtract

 > VECTOR VECTOR -> VECTOR

Subtracts one vector from another.

~~vector_transform

 > AFFINE VECTOR -> VECTOR

Transforms a vector using an affine matrix.

~Implementation

A test implementation for '''vexpr''' is available as an TEA extension, and can be downloaded [http://www.etoyoc.com/tclmatrix3d].  At this point in time, the goal is adding '''vexpr''' as a standalone command.

~~ Limits

'''vexpr''' converts all arguments to an array of 16 double precision
elements; only the item left on the top of the stack is converted back into a Tcl list. The "stack" itself has a hard-coded limit of 32 elements. (It is implemented as an array.) Exceeding the stack size will cause the command to throw a Tcl error.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|
|
|

|
|

|

|

|
|

|
|
|
|

|

|

|

|
<
>
|
<
>
|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63

64
65

66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304

# TIP 420: 'vexpr', a Vector Expression Command

	Author:         Sean Woods <[email protected]>
	Author:         Andreas Kupries <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        15-Nov-2012
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes to add a new command to Tcl for manipulating vectors and
related mathematical objects. The command, **vexpr**, will provide
C-optimized implementations of generally useful scalar, 2D, 3D and affine transforms. **vexpr** is a complement to **expr**, and expects to take in vector arguments and return vector results.

# Rationale

With the interest expressed in the community by [[363]](363.md), I am concerned about
the introduction of non-scalar results from **expr** \(and parts of the
language the use **expr**\). As the goal of that TIP is to introduce vector
math operations, a less ambitious, but arguable equally effective technique
could be to introduce a dedicated command. In particular, one designed from
the ground up to handle the intricacies of vector operations.

**vexpr** is a vector expression parser. It operates using reverse-polish
notation \(like an HP calculator.\) Each argument is pushed onto the stack, and
when a command is detected, they are popped off the stack. The result of the
command is pushed onto the stack in their place.

Why? Well mostly for ease of implementation. Partly because there is no PEMDAS
equivalent order of operation for matrices and vectors.  Once I go through an
example or two, it should be a little clearer.

# Examples

To add \{1 1 1\} and \{2 2 2\} I run the following command:

	vexpr {2 2 2} {1 1 1} +
	> 3.0 3.0 3.0

Remember though, we are working with a stack. Items are popped on the stack in
a first-in first-out fashion. While for addition it doesn't matter what order
we do things, subtraction does care.

	vexpr {1 1 1} {2 2 2} -
	> 1.0 1.0 1.0
	vexpr {2 2 2} {1 1 1} -
	> -1.0 -1.0 -1.0

While with 2 arguments and an opcode this seems silly, imagine a complex
operation with several steps. Here we are going to model a robot arm with 3
joints. Each "arm" is one unit long, and when one joint bends, the rest follow
suit.

_unbent_

	(A) - (B) - (C)

_bent_

	      (C)

	        |
	      (B)

	     /
	(A)

Code:

	# Positions of the joints
	set A_pos {0.0 0.0 0.0}
	set B_pos {1.0 0.0 0.0}
	set C_pos {2.0 0.0 0.0}

	# Rotations of the joints 
	set A_rot {0 0 45}
	set B_rot {0 0 45}

	set b_transform [vexpr \
	    $A_pos $B_pos - \
	    affine_translate \
	    $A_rot radians \
	    affine_rotate \
	    affine_multiply]
	> { 0.707  0.707 0.0  -0.707} 
	  {-0.707  0.707 0.0   0.707}
	  { 0.0    0.0   1.0   0.0}
	  { 0.0    0.0   0.0   1.0}

	set b_real [vexpr $B_pos $b_transform vector_transform]

	> 0.707106 0.707106 0.0

	set c_transform [vexpr \
	    $C_pos $B_real - \
	    affine_translate \
	    load affine_multiply \
	    $B_rot radians \
	    affine_rotate \
	    affine_multiply]
	> { 0.0 1.0 0.0 0.707}
	  {-1.0 0.0 0.0 2.293}
	  {0.0  0.0 1.0 0.0}
	  {0.0  0.0 0.0 1.0}

	set c_real [vexpr $C_pos $c_transform vector_transform]
	> 0.0 2.0 0.0

If you aren't familiar with 3D math and affine transformations, that may look
overly complicated, but as you can see each **vexpr** call is packed with
commands. You can plainly see that after 2 45 degree bends, our "C" point
comes to rest at 0.0,2,0 after completing a 90 degree bend.

# Operations

Note that all arguments that are not one of these operation words are instead treated as values to push onto the evaluation stack.

## affine\_multiply

 > AFFINE AFFINE -> AFFINE

Multiplies 2 4x4 matrices. Used to combine 2 affine transformations. Note:
Some affine transformations need to be performed in a particular order to make
sense.

## affine\_rotate

 > VECTOR -> AFFINE

Converts a "vector" of 3 angles \(Xrotation Yrotation Zrotation\) into an affine
transformation. NOTE: the angles should be in radians.

## affine\_scale

 > VECTOR -> AFFINE

Converts a scale vector \(Xscale Yscale Zscale\) into an affine transformation.
Note: 1.0 1.0 1.0 = No scaling. 2.0 2.0 2.0 = Double the size. 0.5 0.5 0.5 =
Half the size.

## affine\_translate

 > VECTOR -> AFFINE

Converts a displacement vector \(X Y Z\) into an affine transformation	

## cart\_to\_cyl

 > VECTOR -> VECTOR

Converts a cartesian vector to cylindrical coordinates	

## cart\_to\_sphere

 > VECTOR -> VECTOR

Converts a cartesian vector to spherical coordinates

## cross

 > VECTOR VECTOR -> VECTOR

Performs the cross product of two vectors

## copy

 > ANY -> ANY ANY

Copies the top of the stack, pushing it onto the stack.

## cyl\_to\_cart

 > VECTOR -> VECTOR

Converts a vector in cylindrical coordinates to cartesian coordinates	

## cyl\_to\_degrees

 > VECTOR -> VECTOR

Converts a cylindrical vector in radians to degrees.

## cyl\_to\_radians

 > VECTOR -> VECTOR

Converts a cylindrical vector in degrees to radians.

## degrees

 > VECTOR -> VECTOR

Converts a vector or scalar in radians to degrees.

## dot

 > VECTOR VECTOR -> SCALAR

Produces the dot product of two vectors.

## dT

 > \(None\) -> SCALAR

Pushes the value of dT into the stack.

## identity

 > \(None\) -> AFFINE

Pushes the identity matrix onto the stack.

## load

 > \(None\) -> ANY

Pushes the last value stored by STORE onto the stack.	

## pi

 > \(None\) -> SCALER

Pushes the value of PI onto the stack.

## radians

 > VECTOR -> VECTOR

Converts a vector or scalar in degrees to radians.

## setDT

 > SCALAR -> \(None\)

Pops the current stack value and stores it in the dT variable.

## sphere\_to\_cart

 > VECTOR -> VECTOR

Converts a vector in spherical coordinates to cartesian coordinates.

## sphere\_to\_degrees

 > VECTOR -> VECTOR

Converts a spherical vector in radians to a spherical vector in degrees.

## sphere\_to\_radians

 > VECTOR -> VECTOR

Converts a spherical vector in degrees to a spherical vector in radians.

## store

 > ANY -> ANY

Stores the top of the stack internally for later use. The value stored remains at the top of the stack.

## vector\_add

 > VECTOR VECTOR -> VECTOR

Adds 2 vectors, which must be of the same length.

## vector\_length

 > VECTOR -> SCALAR

Produces the length of a vector.

## vector\_scale

 > SCALAR VECTOR -> VECTOR

Scales a vector by a scalar

## vector\_subtract

 > VECTOR VECTOR -> VECTOR

Subtracts one vector from another.

## vector\_transform

 > AFFINE VECTOR -> VECTOR

Transforms a vector using an affine matrix.

# Implementation

A test implementation for **vexpr** is available as an TEA extension, and can be downloaded <http://www.etoyoc.com/tclmatrix3d> .  At this point in time, the goal is adding **vexpr** as a standalone command.

## Limits

**vexpr** converts all arguments to an array of 16 double precision
elements; only the item left on the top of the stack is converted back into a Tcl list. The "stack" itself has a hard-coded limit of 32 elements. \(It is implemented as an array.\) Exceeding the stack size will cause the command to throw a Tcl error.

# Copyright

This document has been placed in the public domain.

Name change from tip/421.tip to tip/421.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62

TIP:		421
Title:		A Command for Iterating Over Arrays
State:		Draft
Type:		Project
Tcl-Version:	8.7
Vote:		Pending
Post-History:	
Version:	$Revision: 1.1 $
Author:		Karl Lehenbauer <[email protected]>
Author:		Donal K. Fellows <[email protected]>
Created:	28-Nov-2012

~ Abstract

This TIP proposes an efficient mechanism for iterating over the contents of a
large array.

~ Rationale

Tcl currently provides three main mechanisms for iterating over the contents
of an array, but none are quite perfect when dealing with a large array.

 * '''array get''' is simple to use (especially with a two-variable
   '''foreach''') but requires the contents of the array to be effectively
   duplicated; even with the use of the Tcl_Obj system for value reference
   management, this is an expensive operation.

 * '''array names''' (with a simple '''foreach''') is also relatively simple
   to use, but requires producing a list whose size is the same as the number
   of elements of the array. (This is half the size that would be required
   with '''array get''', but can still be large.)

 * '''array startsearch''' et al. provide a memory-efficient general iteration
   mechanism, but in a way that is rather difficult to use. It is also subject
   to significant hazards if the array is modified during iteration (a
   particular problem for the global '''env''' array, as that is regenerated
   on almost any read).

The authors propose that there be a new subcommand of '''array''' which allows
for efficient iteration over an array's elements.

~ Proposed Change

There should be a new command, '''array foreach''', that has this syntax:

 > '''array''' '''foreach''' ''arrayName'' {''keyVar'' ''valueVar''} ''body''

This will iterate internally over the elements of the array called
''arrayName'' in array-iteration order (i.e., the same as that used by the
other '''array''' subcommands), setting the variable ''keyVar'' to the name of
the element and the variable ''valueVar'' to the content of the element before
evaluating the script ''body''. The result will be the empty string (excepting
errors, '''return''', etc.) and any contained '''break''' and '''continue'''
will have their normal interpretation as loop control operations.

~ Implementation

Not yet...

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
|
>

|

|

|
|
|

|

|
|

|

|
|
|

|

|

|

|

|
|
|
|
|

|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62

# TIP 421: A Command for Iterating Over Arrays
	State:		Draft
	Type:		Project
	Tcl-Version:	8.7
	Vote:		Pending
	Post-History:	

	Author:		Karl Lehenbauer <[email protected]>
	Author:		Donal K. Fellows <[email protected]>
	Created:	28-Nov-2012
-----

# Abstract

This TIP proposes an efficient mechanism for iterating over the contents of a
large array.

# Rationale

Tcl currently provides three main mechanisms for iterating over the contents
of an array, but none are quite perfect when dealing with a large array.

 * **array get** is simple to use \(especially with a two-variable
   **foreach**\) but requires the contents of the array to be effectively
   duplicated; even with the use of the Tcl\_Obj system for value reference
   management, this is an expensive operation.

 * **array names** \(with a simple **foreach**\) is also relatively simple
   to use, but requires producing a list whose size is the same as the number
   of elements of the array. \(This is half the size that would be required
   with **array get**, but can still be large.\)

 * **array startsearch** et al. provide a memory-efficient general iteration
   mechanism, but in a way that is rather difficult to use. It is also subject
   to significant hazards if the array is modified during iteration \(a
   particular problem for the global **env** array, as that is regenerated
   on almost any read\).

The authors propose that there be a new subcommand of **array** which allows
for efficient iteration over an array's elements.

# Proposed Change

There should be a new command, **array foreach**, that has this syntax:

 > **array** **foreach** _arrayName_ \{_keyVar_ _valueVar_\} _body_

This will iterate internally over the elements of the array called
_arrayName_ in array-iteration order \(i.e., the same as that used by the
other **array** subcommands\), setting the variable _keyVar_ to the name of
the element and the variable _valueVar_ to the content of the element before
evaluating the script _body_. The result will be the empty string \(excepting
errors, **return**, etc.\) and any contained **break** and **continue**
will have their normal interpretation as loop control operations.

# Implementation

Not yet...

# Copyright

This document has been placed in the public domain.

Name change from tip/422.tip to tip/422.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58

59
60
61
62
63
64
65
66
67
68
69
70
71
72

73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92

TIP:            422
Title:          Don't Use stdarg.h/va_list in Public API
Version:        $Revision: 1.1 $
Author:         Jan Nijtmans <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        02-Jan-2013
Post-History:   
Tcl-Version:    9.0
Keywords:	Tcl, API removal, varargs

~ Abstract

This TIP proposes to remove all functions which use the ''va_list'' type from
the public API, and it describes what extensions using this should do to make
their extension portable on the mingw-w64 gcc compiler on the AMD64 platform.

~ Rationale

The use of ''va_list'' in public API has the problem that different compilers
have a different implementation of the ''va_list'' structure. The implication
of this is that extensions which are compiled with mingw-w64 for the AMD64
platform, and call any of those functions will fail with a MSVC-compiled Tcl
core. The reverse fails as well. For a brief description about this problem,
see: http://www.bailopan.net/blog/?p=30.  See also an earlier discusion in the
Tcl Core mailing list: http://code.activestate.com/lists/tcl-core/10807/

~ Specification

This TIP proposes to remove the following 4 functions from the public API

 * Tcl_AppendResultVA

 * Tcl_AppendStringsToObjVA

 * Tcl_SetErrorCodeVA

 * Tcl_PanicVA

In addition, the inclusion of <stdarg.h> should move from tcl.h to tclInt.h,
as no public Tcl header uses it any more.

~ Compatibility

Extensions using any of those functions will not compile and run in Tcl 9.0
any more. They should be rewritten to use the same functions without the VA
parameter. This can be done as follows.

Before:

|int mypanic(const char *fmt, ...) {
|    va_list ap;
|    va_start(ap, fmt);
|    Tcl_PanicVA(fmt, ap);
|    va_end(ap);
|}

After:

|int mypanic(const char *fmt, ...) {
|    va_list ap;
|    char *arg1, *arg2, *arg3, *arg4;
|    va_start(ap, fmt);
|    arg1 = va_arg(argList, char *);
|    arg2 = va_arg(argList, char *);
|    arg3 = va_arg(argList, char *);
|    arg4 = va_arg(argList, char *);
|    va_end(ap);
|    Tcl_Panic(fmt, arg1, arg2, arg3, arg4);
|}

The number of args used (4, in this example) should be chosen to be the
maximum number of additional parameters that is used in any "mypanic" call.
Since this function is only ever called from the extensions itself, this can
be determined easily.

In addition, the extension must do its own inclusion of <stdarg.h>, as tcl.h
doesn't do that any more.

Extensions rewritten this way, will continue to compile and function with Tcl
8.x as well. I am not aware of any extension which actually calls any of those
VA functions.

~ Reference Implementation

A reference implementation is available in the '''novem-remove-va''' branch.
[https://core.tcl.tk/tcl/timeline?r=novem-remove-va]

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|
|

|
|

|

|

|

|

|

|

|
|
|
|
|
<
|
>

|
|
|
|
|
|
|
|
|
|
<
|
>
|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55

56
57
58
59
60
61
62
63
64
65
66
67
68
69

70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92

# TIP 422: Don't Use stdarg.h/va_list in Public API

	Author:         Jan Nijtmans <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        02-Jan-2013
	Post-History:   
	Tcl-Version:    9.0
	Keywords:	Tcl, API removal, varargs
-----

# Abstract

This TIP proposes to remove all functions which use the _va\_list_ type from
the public API, and it describes what extensions using this should do to make
their extension portable on the mingw-w64 gcc compiler on the AMD64 platform.

# Rationale

The use of _va\_list_ in public API has the problem that different compilers
have a different implementation of the _va\_list_ structure. The implication
of this is that extensions which are compiled with mingw-w64 for the AMD64
platform, and call any of those functions will fail with a MSVC-compiled Tcl
core. The reverse fails as well. For a brief description about this problem,
see: <http://www.bailopan.net/blog/?p=30.>  See also an earlier discusion in the
Tcl Core mailing list: <http://code.activestate.com/lists/tcl-core/10807/>

# Specification

This TIP proposes to remove the following 4 functions from the public API

 * Tcl\_AppendResultVA

 * Tcl\_AppendStringsToObjVA

 * Tcl\_SetErrorCodeVA

 * Tcl\_PanicVA

In addition, the inclusion of <stdarg.h> should move from tcl.h to tclInt.h,
as no public Tcl header uses it any more.

# Compatibility

Extensions using any of those functions will not compile and run in Tcl 9.0
any more. They should be rewritten to use the same functions without the VA
parameter. This can be done as follows.

Before:

	int mypanic(const char *fmt, ...) {
	    va_list ap;
	    va_start(ap, fmt);
	    Tcl_PanicVA(fmt, ap);
	    va_end(ap);

	}

After:

	int mypanic(const char *fmt, ...) {
	    va_list ap;
	    char *arg1, *arg2, *arg3, *arg4;
	    va_start(ap, fmt);
	    arg1 = va_arg(argList, char *);
	    arg2 = va_arg(argList, char *);
	    arg3 = va_arg(argList, char *);
	    arg4 = va_arg(argList, char *);
	    va_end(ap);
	    Tcl_Panic(fmt, arg1, arg2, arg3, arg4);

	}

The number of args used \(4, in this example\) should be chosen to be the
maximum number of additional parameters that is used in any "mypanic" call.
Since this function is only ever called from the extensions itself, this can
be determined easily.

In addition, the extension must do its own inclusion of <stdarg.h>, as tcl.h
doesn't do that any more.

Extensions rewritten this way, will continue to compile and function with Tcl
8.x as well. I am not aware of any extension which actually calls any of those
VA functions.

# Reference Implementation

A reference implementation is available in the **novem-remove-va** branch.
<https://core.tcl.tk/tcl/timeline?r=novem-remove-va> 

# Copyright

This document has been placed in the public domain.

Name change from tip/423.tip to tip/423.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40

TIP:		423
Title:		Formatting Timestamps with Milliseconds
Version:	$Revision: 1.1 $
Author:		Thomas Perschak <[email protected]>
State:		Draft
Type:		Project
Tcl-Version:	8.7
Vote:		Pending
Created:	07-Jun-2013
Post-History:
Keywords:	Tcl, time, millisecond resolution

~ Abstract

This TIP describes a change to '''clock format''' to allow it to handle
timestamps with sub-second accuracy.

~ Rationale

Currently, the '''clock format''' accepts only integer numbers for clock
formatting. Since the '''clock milliseconds''' command was introduced in Tcl
8.5, this limitation seems a bit restrictive.

In particular, the timestamp column in a number of databases (e.g.,
http://www.postgresql.org/docs/9.1/static/datatype-datetime.html) handles
high-resolution timestamps by allowing full ISO 8601 times, which look like
"04:05:06.789"; this would simplify database write operations.

~ Proposal

The '''clock format''' command should accept floating point values for
timestamps.

Another format letter should be added to '''clock format''' which puts the
milliseconds into the output string; the millisecond value should not be
written unless explicitly requested.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|
|

|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40

# TIP 423: Formatting Timestamps with Milliseconds

	Author:		Thomas Perschak <[email protected]>
	State:		Draft
	Type:		Project
	Tcl-Version:	8.7
	Vote:		Pending
	Created:	07-Jun-2013
	Post-History:
	Keywords:	Tcl, time, millisecond resolution
-----

# Abstract

This TIP describes a change to **clock format** to allow it to handle
timestamps with sub-second accuracy.

# Rationale

Currently, the **clock format** accepts only integer numbers for clock
formatting. Since the **clock milliseconds** command was introduced in Tcl
8.5, this limitation seems a bit restrictive.

In particular, the timestamp column in a number of databases \(e.g.,
<http://www.postgresql.org/docs/9.1/static/datatype-datetime.html\)> handles
high-resolution timestamps by allowing full ISO 8601 times, which look like
"04:05:06.789"; this would simplify database write operations.

# Proposal

The **clock format** command should accept floating point values for
timestamps.

Another format letter should be added to **clock format** which puts the
milliseconds into the output string; the millisecond value should not be
written unless explicitly requested.

# Copyright

This document has been placed in the public domain.

Name change from tip/424.tip to tip/424.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160

TIP:            424
Title:          Improving [exec]
Version:        $Revision: 1.8 $
Author:         Alexandre Ferrieux <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        07-Jul-2013
Post-History:   
Keywords:       Tcl,subprocess,execution
Tcl-Version:    8.7

~ Abstract

This extension overcomes day-1 limitations of [['''exec''']]'s syntax,
allowing for unconstrained arguments to commands, and opening the path to more
exotic redirections.

~ Summary Change

Replace:

|   exec foo bar baz > file

With:

|   exec | {foo bar baz} > file

~ Rationale

For decades people have rightfully complained about the stubborn limitation of
'''exec''' that prevents it from using commands or args resembling a
redirection. It's not just Quoting Hell; it is simply impossible to spawn the
equivalent of Bourne Shell's "echo \>" from pure Tcl (i.e., without resorting
to another shell).

The reason (excuse?) for this is an unfortunate design choice: stick as
closely as possible to the Bourne Shell's syntax, which indeed seamlessly
intertwines commands, arguments, and redirects. This is unfortunate, because
it overlooks a key difference between the two shells:

 * In Bourne Shell, since everything is about spawning commands, redirects are
   expected everywhere; hence their quoting is ubiquitous, and part of the
   language.

 * In Tcl, spawning processes is only a tiny part of the story. Consequently,
   redirect chars (<>|) are not special, and deserve no core-language quoting
   rules.

In this situation, it would have been possible to add an '''exec'''-specific
layer of quoting, just for these characters.  But as usual, the quoting char
itself (typically "'''\'''") would have itself needed quoting ("'''\\'''"),
which would have overburdened the backslash density of all but the simplest
pipelines...

More importantly, the realization that this was Really Wrong came fairly late
in Tcl's life; or at least late enough to consider any incompatible fix out of
the question.

So '''exec''' can be ''extended'', not ''fixed''.

A few such extensions have been suggested over the years, but none reached
critical mass. A possible interpretation of this is that they were considered
too "disruptive" - while necessary only for a corner case.

The current proposal addresses all the above concerns.  Here are its design
goals by decreasing importance:

 1. Current '''exec''''s unescapable warts should disappear

  > (Yeah, take care of that corner case.)

 2. Current '''exec''''s mapping to '''open |''' should be carried over

  > (This part of '''exec''''s design was Good)

 3. Simple pipelines should give easy-to-read lines (like current '''exec''')

  > (No disruption, Ma'am)

 4. Shell-ish advanced redirections like "'''3>&5'''" should be supported

  > (Not just the corner case: you get a free lunch too)

~ Definition

 * Extend '''exec''' "from its error space", by reserving a single pipe
   character passed as its first argument:

|       exec | ...    ;# activates the new syntax
|       open "|| ..." ;# same in [open]

 * Once the new syntax is unambiguously introduced, parse the rest as follows:

|       exec | $cmd1 {*}$redirs1 | ... | $cmdN {*}$redirsN ?&?
|       open "|[list | $cmd1 {*}$redirs1 | ... ]"

  > where:

  > * '''$cmd'''''K'' and '''$redirs'''''K'' are lists

  > * '''$cmd'''''K'' is a simple command-and-args, no extras

  > * '''$redirs'''''K'' is a list of current exec redirection operators

Examples:

|      exec | {echo >} ;# this returns ">"
|      exec | {cmd "<funny>xml</funny>"} 2>@ $ch < /dev/null | {cmd2 arg} >&2

Goals reached:

 1. Unescapable warts are gone because the '''$cmd''' vs '''$redir''' status
    is positional, not content-based: each command-and args is a separate
    sublist, with no in-band encoding of redirections.

 2. The above mapping is consistent with the existing '''open |[list foo
    bar]''' logic.  It respects the invariant saying, for '''open |''',
    that '''[string range $openarg 1 end]''' is always the list that would be
    passed, expanded, to '''exec'''. And it is handy to type
    '''open "|| {foo >} > file"'''

 3. Simple pipelines are simple.

  >  '''exec | $cmd1 | $cmd2 | $cmd3 > file'''

 4. Advanced redirections are imaginable since the redirection subsyntax
    now lives on its own. For example, with a putative "NUMBER>@" family
    of operators, one could define a nonlinear pipe graph:

|      lassign [chan pipe] pr pw
|      exec | {demuxer ...} 3>@ $pw | {filter ...} | {muxer ...} 3<@ $pr

    The definition of these advanced operators will be hosted by another TIP.

~ TL;DR

This very conservative syntax, in addition to preserving the overall style and
density of current '''exec''', overcomes all the limitations and reaches
Bourne Shell power.

Moreover, it leverages the existing internals, so a nearly free side-effect
is that it works with '''pid''' and '''close''' just like current '''exec'''
does.

~ Rejected Alternatives

 * Replace the leading "'''|'''" in '''exec |''' by '''--extended'''

 * Use a different toplevel command name.

  > '''exec2'''...

~ Reference Implementation

Branch "tip-improve-exec" on core.tcl.tk holds the implementation.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|
|

|

|

|

|

|
|

|

|
|
|
|
|

|

|
|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160

# TIP 424: Improving [exec]

	Author:         Alexandre Ferrieux <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        07-Jul-2013
	Post-History:   
	Keywords:       Tcl,subprocess,execution
	Tcl-Version:    8.7
-----

# Abstract

This extension overcomes day-1 limitations of [**exec**]'s syntax,
allowing for unconstrained arguments to commands, and opening the path to more
exotic redirections.

# Summary Change

Replace:

	   exec foo bar baz > file

With:

	   exec | {foo bar baz} > file

# Rationale

For decades people have rightfully complained about the stubborn limitation of
**exec** that prevents it from using commands or args resembling a
redirection. It's not just Quoting Hell; it is simply impossible to spawn the
equivalent of Bourne Shell's "echo \\>" from pure Tcl \(i.e., without resorting
to another shell\).

The reason \(excuse?\) for this is an unfortunate design choice: stick as
closely as possible to the Bourne Shell's syntax, which indeed seamlessly
intertwines commands, arguments, and redirects. This is unfortunate, because
it overlooks a key difference between the two shells:

 * In Bourne Shell, since everything is about spawning commands, redirects are
   expected everywhere; hence their quoting is ubiquitous, and part of the
   language.

 * In Tcl, spawning processes is only a tiny part of the story. Consequently,
   redirect chars \(<>\|\) are not special, and deserve no core-language quoting
   rules.

In this situation, it would have been possible to add an **exec**-specific
layer of quoting, just for these characters.  But as usual, the quoting char
itself \(typically "**\\**"\) would have itself needed quoting \("**\\\\**"\),
which would have overburdened the backslash density of all but the simplest
pipelines...

More importantly, the realization that this was Really Wrong came fairly late
in Tcl's life; or at least late enough to consider any incompatible fix out of
the question.

So **exec** can be _extended_, not _fixed_.

A few such extensions have been suggested over the years, but none reached
critical mass. A possible interpretation of this is that they were considered
too "disruptive" - while necessary only for a corner case.

The current proposal addresses all the above concerns.  Here are its design
goals by decreasing importance:

 1. Current **exec**'s unescapable warts should disappear

	  > \(Yeah, take care of that corner case.\)

 2. Current **exec**'s mapping to **open \|** should be carried over

	  > \(This part of **exec**'s design was Good\)

 3. Simple pipelines should give easy-to-read lines \(like current **exec**\)

	  > \(No disruption, Ma'am\)

 4. Shell-ish advanced redirections like "**3>&5**" should be supported

	  > \(Not just the corner case: you get a free lunch too\)

# Definition

 * Extend **exec** "from its error space", by reserving a single pipe
   character passed as its first argument:

		       exec | ...    ;# activates the new syntax
		       open "|| ..." ;# same in [open]

 * Once the new syntax is unambiguously introduced, parse the rest as follows:

		       exec | $cmd1 {*}$redirs1 | ... | $cmdN {*}$redirsN ?&?
		       open "|[list | $cmd1 {*}$redirs1 | ... ]"

	  > where:

	  > \* **$cmd**_K_ and **$redirs**_K_ are lists

	  > \* **$cmd**_K_ is a simple command-and-args, no extras

	  > \* **$redirs**_K_ is a list of current exec redirection operators

Examples:

	      exec | {echo >} ;# this returns ">"
	      exec | {cmd "<funny>xml</funny>"} 2>@ $ch < /dev/null | {cmd2 arg} >&2

Goals reached:

 1. Unescapable warts are gone because the **$cmd** vs **$redir** status
    is positional, not content-based: each command-and args is a separate
    sublist, with no in-band encoding of redirections.

 2. The above mapping is consistent with the existing **open \|[list foo
    bar]** logic.  It respects the invariant saying, for **open \|**,
    that **[string range $openarg 1 end]** is always the list that would be
    passed, expanded, to **exec**. And it is handy to type
    **open "\|\| \{foo >\} > file"**

 3. Simple pipelines are simple.

	  >  **exec \| $cmd1 \| $cmd2 \| $cmd3 > file**

 4. Advanced redirections are imaginable since the redirection subsyntax
    now lives on its own. For example, with a putative "NUMBER>@" family
    of operators, one could define a nonlinear pipe graph:

		      lassign [chan pipe] pr pw
		      exec | {demuxer ...} 3>@ $pw | {filter ...} | {muxer ...} 3<@ $pr

    The definition of these advanced operators will be hosted by another TIP.

# TL;DR

This very conservative syntax, in addition to preserving the overall style and
density of current **exec**, overcomes all the limitations and reaches
Bourne Shell power.

Moreover, it leverages the existing internals, so a nearly free side-effect
is that it works with **pid** and **close** just like current **exec**
does.

# Rejected Alternatives

 * Replace the leading "**\|**" in **exec \|** by **--extended**

 * Use a different toplevel command name.

	  > **exec2**...

# Reference Implementation

Branch "tip-improve-exec" on core.tcl.tk holds the implementation.

# Copyright

This document has been placed in the public domain.

Name change from tip/425.tip to tip/425.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104

TIP:            425
Title:          Internationalization of Default Panic Callback on Windows
Version:        $Revision: 1.6 $
Author:         Jan Nijtmans <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        17-Jul-2013
Post-History:   
Keywords:       Tcl,platform integration,i18n
Tcl-Version:    8.7

~ Abstract

The default panic proc on Windows console applications writes the
message in UTF-8 to stderr. Unfortunately, the Windows console
normally does not have UTF-8 as code page but some single-byte
code page like CP1252. When using characters outside the ASCII
range, that does not give the expected output in the console.
This TIP proposes to add a new Console panic proc to the stub
library, and modify the Tcl_Main() macro to use it.

~ Rationale

Many parts of Tcl are initernationalized in Tcl 8.6: The command
line handling, and the communication with all Win32 API functions.
But the Panic proc has - so far - not been modified accordingly
for Windows console applications, even though win32 has a
suitable API to do so.

On Windows, there actually are two different panic procs,
one for GUI applications and one for console applications, but
external embedders don't have an API for deciding which one
should be used other than provide their own. This TIP can
finally do that: The call
''Tcl_SetPanicProc(Tcl_ConsolePanic)'' will initialize the
Tcl subsystem for Console applications, while
''Tcl_SetPanicProc(NULL)'' will continue to use the default.

Making things worse, stderr is implemented by the C runtime,
(msvcrt??.dll) but if a application is embedding or dynamically
loading tcl.dll then the runtime of the embedder might be
different from tcl.dll/tclsh.exe's runtime. The embedder
providing the panic proc gives the highest chance that panic
messages arrive in the same runtime as the embedder.
For tclsh.exe this makes no difference.

~ Proposed Change

A new function ''Tcl_ConsolePanic'' is added to the stub library
on Windows and Cygwin, which can be installed by embedding
application as panic proc. The full signature is:

 > EXTERN void
   '''Tcl_ConsolePanic'''(
       const char *''format''
       ...);

On other platforms than Windows or Cygwin, ''Tcl_ConsolePanic''
is a macro equivalent to NULL, on those platforms
Tcl_SetPanicProc(Tcl_ConsolePanic) has the effect of resetting
the panic proc to the platform's default.

This function is meant to be used for Win32 or Cygwin console
applications, and can deliver the message in 3 possible ways

* If a (Windows) debugger is running, the message is sent there.

* If stderr is connected to a Windows console, the message is
sent there (Windows only).

* Otherwise, the UTF-8 BOM (3 bytes) is written followed by
the unmodified message (assumed to be in UTF-8).

The function ''Tcl_ConsolePanic'' does not assume any locale,
does not allocate memory, neither does it make any assumptions
on the initialized state of Tcl. This makes Tcl_Panic work fine
even in the final stage of a Tcl_Finalize() call.
If a Win32 Unicode API is available for the desired output,
''Tcl_ConsolePanic'' will do at most an UTF-8 to Unicode
conversion using the Win32 function MultiByteToWideChar().

The maximum number of (unicode) characters that is
written is 26000, as that is the maximum that
WriteConsoleW() can handle in a single call. See: 
[https://connect.microsoft.com/VisualStudio/feedback/details/635230]
If the message is longer than that, the string is
truncated and three dots appended to it. If
the message is sent to a character device, the
UTF-8 BOM is prepended.

The function is available from the stub library, in
order to bring the responsibility for correct linking
to the embedding application, in stead of Tcl. In
case of tclsh.exe, this makes no difference.

~ Reference Implementation

A reference implementation is available in the '''win-console-panic''' branch.
[https://core.tcl.tk/tcl/info/00a17823f0]

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|
|
|

|

|

|

|

|
|

|

|
|

|
|

|

|
|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104

# TIP 425: Internationalization of Default Panic Callback on Windows

	Author:         Jan Nijtmans <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        17-Jul-2013
	Post-History:   
	Keywords:       Tcl,platform integration,i18n
	Tcl-Version:    8.7
-----

# Abstract

The default panic proc on Windows console applications writes the
message in UTF-8 to stderr. Unfortunately, the Windows console
normally does not have UTF-8 as code page but some single-byte
code page like CP1252. When using characters outside the ASCII
range, that does not give the expected output in the console.
This TIP proposes to add a new Console panic proc to the stub
library, and modify the Tcl\_Main\(\) macro to use it.

# Rationale

Many parts of Tcl are initernationalized in Tcl 8.6: The command
line handling, and the communication with all Win32 API functions.
But the Panic proc has - so far - not been modified accordingly
for Windows console applications, even though win32 has a
suitable API to do so.

On Windows, there actually are two different panic procs,
one for GUI applications and one for console applications, but
external embedders don't have an API for deciding which one
should be used other than provide their own. This TIP can
finally do that: The call
_Tcl\_SetPanicProc\(Tcl\_ConsolePanic\)_ will initialize the
Tcl subsystem for Console applications, while
_Tcl\_SetPanicProc\(NULL\)_ will continue to use the default.

Making things worse, stderr is implemented by the C runtime,
\(msvcrt??.dll\) but if a application is embedding or dynamically
loading tcl.dll then the runtime of the embedder might be
different from tcl.dll/tclsh.exe's runtime. The embedder
providing the panic proc gives the highest chance that panic
messages arrive in the same runtime as the embedder.
For tclsh.exe this makes no difference.

# Proposed Change

A new function _Tcl\_ConsolePanic_ is added to the stub library
on Windows and Cygwin, which can be installed by embedding
application as panic proc. The full signature is:

 > EXTERN void
   **Tcl\_ConsolePanic**\(
       const char \*_format_
       ...\);

On other platforms than Windows or Cygwin, _Tcl\_ConsolePanic_
is a macro equivalent to NULL, on those platforms
Tcl\_SetPanicProc\(Tcl\_ConsolePanic\) has the effect of resetting
the panic proc to the platform's default.

This function is meant to be used for Win32 or Cygwin console
applications, and can deliver the message in 3 possible ways

* If a \(Windows\) debugger is running, the message is sent there.

* If stderr is connected to a Windows console, the message is
sent there \(Windows only\).

* Otherwise, the UTF-8 BOM \(3 bytes\) is written followed by
the unmodified message \(assumed to be in UTF-8\).

The function _Tcl\_ConsolePanic_ does not assume any locale,
does not allocate memory, neither does it make any assumptions
on the initialized state of Tcl. This makes Tcl\_Panic work fine
even in the final stage of a Tcl\_Finalize\(\) call.
If a Win32 Unicode API is available for the desired output,
_Tcl\_ConsolePanic_ will do at most an UTF-8 to Unicode
conversion using the Win32 function MultiByteToWideChar\(\).

The maximum number of \(unicode\) characters that is
written is 26000, as that is the maximum that
WriteConsoleW\(\) can handle in a single call. See: 
<https://connect.microsoft.com/VisualStudio/feedback/details/635230> 
If the message is longer than that, the string is
truncated and three dots appended to it. If
the message is sent to a character device, the
UTF-8 BOM is prepended.

The function is available from the stub library, in
order to bring the responsibility for correct linking
to the embedding application, in stead of Tcl. In
case of tclsh.exe, this makes no difference.

# Reference Implementation

A reference implementation is available in the **win-console-panic** branch.
<https://core.tcl.tk/tcl/info/00a17823f0> 

# Copyright

This document has been placed in the public domain.

Name change from tip/426.tip to tip/426.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106

TIP:		426
Title:		Determining the "Type" of Commands
State:		Draft
Type:		Project
Tcl-Version:	8.7
Vote:		Pending
Post-History:	
Version:	$Revision: 1.2 $
Author:		Donal K. Fellows <[email protected]>
Created:	31-Jul-2013
Keywords:	introspection, commands, Tcl, Tk

~ Abstract

This TIP describes a mechanism for determining what "type" of command
a particular command is. This can be used as a prelude to performing
other kinds of introspection, such as using '''info body''',
'''namespace origin''' or '''interp alias'''.

~ Rationale

Currently, in order to find out information about an arbitrary command
you have to apply a suitable introspection command and deal with any
errors arising in order to tell that you had a command of some other
type. It was made clear to me at EuroTcl 2013 that this was inelegant,
especially since we in principle had the information available to do
something neater.

The information in question is the pointer to the implementation
function, that's stored in the C record describing the command. All
that is needed is a way to surface that information to Tcl as a new
introspection command.

~ Proposed Change

This new introspection interface shall consist of one new subcommand
of '''info''' and a pair of new public C functions.

 > '''info cmdtype''' ''commandName''

The new '''info''' subcommand is to be called '''cmdtype''' (the name
is chosen so as to not conflict with abbreviations '''info commands'''
even though it does conflict with '''info cmdcount''') and it takes a
single argument, ''commandName'', which must be the name of an
existing Tcl command. The result of this subcommand shall be a string
describing what sort of command ''commandName'' is; if no other
information is available, the result shall be '''native'''.

NB: The Tcl implementation will not make any guarantees of the command
type for any particular command supplied by default in any
interpreter. User code should never assume that just because a command
is implemented one way in one particular version that it will continue
to be implemented that way in any future version.

~~ Supporting C API

The supporting public C functions shall be:

 > void '''Tcl_RegisterCommandTypeName'''(Tcl_ObjCmdProc
   *''implementationProc'', const char *''nameStr'')

 > const char * '''Tcl_GetCommandTypeName'''(Tcl_Command ''command'')

'''Tcl_RegisterCommandTypeName''' will associate a particular
implementation function, ''implementationProc'', with an (assumed
literal constant) string, ''nameStr''; if ''nameStr'' is supplied as
NULL, the mapping for ''implementationProc'' will be removed. The
''implementationProc'' argument must not be NULL. The use of a package
prefix within the name is ''recommended''.

'''Tcl_GetCommandTypeName''' shall take a command handle, ''command'',
and return the registered type name string (as previously passed to
Tcl_RegisterCommandTypeName) for the command implementation function
that the ''command'' is using. If there is no type name registered for
the command's implementation function, the literal string '''native'''
will be returned instead. The result will never be NULL.

~~ Predefined Command Types

The following command types are guaranteed to be among the set defined
by default, but others may be done as well.

 proc: Procedures defined by the '''proc''' command.

 alias: Aliases defined by the '''interp alias''' command.

 ensemble: Ensembles defined by the '''namespace ensemble''' command.

 import: Commands imported by the '''namespace import''' command.

 object: Object (or class) defined by instantiating any TclOO class.

~~ Impact on Tk

It is anticipated that Tk widget instances will report themselves
through this mechanism as well, with a prefix to their names of
'''tk::'''; that prefix ''should not'' be used by any other package.
Note however that not all widget types will be distinguishable; this
is part of the way that Tk is implemented.

The built-in widget creation functions ''may'' declare themselves to
be of type '''tk::widgetFactory'''.

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
|
>

|

|
|

|

|

|

|

|
|
|
|

|
|

|

|
|

|

|
|
|
|
|
|

|
|
|
|
|

|

|

|

|

|

|

|

|

|
|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106

# TIP 426: Determining the "Type" of Commands
	State:		Draft
	Type:		Project
	Tcl-Version:	8.7
	Vote:		Pending
	Post-History:	

	Author:		Donal K. Fellows <[email protected]>
	Created:	31-Jul-2013
	Keywords:	introspection, commands, Tcl, Tk
-----

# Abstract

This TIP describes a mechanism for determining what "type" of command
a particular command is. This can be used as a prelude to performing
other kinds of introspection, such as using **info body**,
**namespace origin** or **interp alias**.

# Rationale

Currently, in order to find out information about an arbitrary command
you have to apply a suitable introspection command and deal with any
errors arising in order to tell that you had a command of some other
type. It was made clear to me at EuroTcl 2013 that this was inelegant,
especially since we in principle had the information available to do
something neater.

The information in question is the pointer to the implementation
function, that's stored in the C record describing the command. All
that is needed is a way to surface that information to Tcl as a new
introspection command.

# Proposed Change

This new introspection interface shall consist of one new subcommand
of **info** and a pair of new public C functions.

 > **info cmdtype** _commandName_

The new **info** subcommand is to be called **cmdtype** \(the name
is chosen so as to not conflict with abbreviations **info commands**
even though it does conflict with **info cmdcount**\) and it takes a
single argument, _commandName_, which must be the name of an
existing Tcl command. The result of this subcommand shall be a string
describing what sort of command _commandName_ is; if no other
information is available, the result shall be **native**.

NB: The Tcl implementation will not make any guarantees of the command
type for any particular command supplied by default in any
interpreter. User code should never assume that just because a command
is implemented one way in one particular version that it will continue
to be implemented that way in any future version.

## Supporting C API

The supporting public C functions shall be:

 > void **Tcl\_RegisterCommandTypeName**\(Tcl\_ObjCmdProc
   *_implementationProc_, const char \*_nameStr_\)

	 > const char \* **Tcl\_GetCommandTypeName**\(Tcl\_Command _command_\)

**Tcl\_RegisterCommandTypeName** will associate a particular
implementation function, _implementationProc_, with an \(assumed
literal constant\) string, _nameStr_; if _nameStr_ is supplied as
NULL, the mapping for _implementationProc_ will be removed. The
_implementationProc_ argument must not be NULL. The use of a package
prefix within the name is _recommended_.

**Tcl\_GetCommandTypeName** shall take a command handle, _command_,
and return the registered type name string \(as previously passed to
Tcl\_RegisterCommandTypeName\) for the command implementation function
that the _command_ is using. If there is no type name registered for
the command's implementation function, the literal string **native**
will be returned instead. The result will never be NULL.

## Predefined Command Types

The following command types are guaranteed to be among the set defined
by default, but others may be done as well.

 proc: Procedures defined by the **proc** command.

 alias: Aliases defined by the **interp alias** command.

 ensemble: Ensembles defined by the **namespace ensemble** command.

 import: Commands imported by the **namespace import** command.

 object: Object \(or class\) defined by instantiating any TclOO class.

## Impact on Tk

It is anticipated that Tk widget instances will report themselves
through this mechanism as well, with a prefix to their names of
**tk::**; that prefix _should not_ be used by any other package.
Note however that not all widget types will be distinguishable; this
is part of the way that Tk is implemented.

The built-in widget creation functions _may_ declare themselves to
be of type **tk::widgetFactory**.

# Copyright

This document has been placed in the public domain.

Name change from tip/427.tip to tip/427.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162

TIP:            427
Title:          Introspection of Asynchronous Socket Connection
Version:        $Revision: 1.13 $
Author:         Reinhard Max <[email protected]>
Author:         Harald Oehlmann <[email protected]>
Author:         Reinhard Max <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        16-Mar-2014
Post-History:   
Keywords:       async socket connect,introspection,IPV6
Tcl-Version:    8.6.4

~ Abstract

This TIP describes a method to introspect the asynchronous connection
process by an extension of the '''fconfigure''' interface in addition
to '''fconfigure -error'''. This will enable better control over the
asynchronous connection process, even in cases where the event loop is
not in use.

~ Rationale

The '''socket''' core command supports two ways to establish a
client socket, ''synchronous'' and ''asynchronous''. In synchronous
mode (which is the default) the command does not return until the
connection attempt has completed (established or failed).

In asynchronous mode ('''-async option''') the command returns after DNS lookup and the connection is established in the background.
This is useful in situations where it is undesirable that a
process or thread blocks for completing a synchronous connection
attempt.
Classically, an asyncronously connecting socket would indicate that it
had connected (or failed to connect) by becoming writeable, which
'''fileevent writable''' can be used to detect.

A DNS name may have multiple IP addresses associated, e.g. for
IPv4/IPv6 dual stack hosts or for fail safety or load balancing
reasons as it is the case for google.com as of this writing.

In Tcl 8.5 the socket command only tried to connect to a single IPv4
address that was randomly picked from the list returned by DNS. In Tcl
8.6, the socket command tries to connect to all the IP addresses of a
DNS name in turn until one succeeds or all have failed.

This caused the following changes to the '''socket -async''' command from Tcl 8.5 to 8.6:

   * The socket introspection options to '''fconfigure''' (i.e.,
     '''-error''', '''-sockname''' and '''-peername''') can change
     between successive invocations while the connection is in
     progress as they reflect the state of the internal loop
     over the IP addresses.

   * The event loop must run in order to allow looping over the
     various possible IP addresses of a host.

The usage of '''socket -async''' is seen as helpful even without the event loop. An example is an application, which checks a list of
hosts for a connection.  The application may start many background
socket connects, do something else, and then collect the results.  Without the event loop (i.e., a '''fileevent writable'''), there is
no non-blocking way to discover if the asynchronous connect has
completed.

In addition, the following future points may be considered:

   * The connection process may internally get delegated to its own
     thread; this would allow the connection process to be
     asynchronous without requiring the event loop.

   * A future Windows implementation may use the Vista+ API
     ''WSAConnectByList'' (once we do not support Windows XP any
     more). Using this, no own looping over the addresses is
     necessary.  It allows the connection process to be a single OS
     call, but does not allow inspection of the different connection
     steps.

~ Proposed Change

~~ Current Introspection Change

The introspection functions should act as follows during an
asynchronous connection process:

   * '''fconfigure -error''' will return the empty string (no error)

   * '''fconfigure -sockname''' will return the empty string

   * '''fconfigure -peername''' will return the empty string

~~ Introspection Command to Inspect a Running Asynchronous Connect

An additional introspection function should inform if the asynchronous
connect is running or if it has terminated:

 > '''fconfigure''' ''channel'' '''-connecting'''

This option returns '''1''' as long as a socket is still in the process of connecting asynchronously and 0 when the asynchronous connection has completed (succeeded or failed) or the socket was opened synchronously.

~~ Non-Event Loop Operation

If the event loop runs, the state machine of a (possibly multiple-address try) async connection proceeds within an internal callback.

In addition to that (for the case the event loop does currently not run), it proceeds whenever a channel operation is attempted on the socket_

   * nonblocking I/O and [fconfigure] will advance it by one step (i.e. do whatever is doable without waiting).

   * blocking I/O will advance it to completion, in essence meaning "switch back to synchronous connect, and in case of success, do the blocking I/O I asked for".

~ Use Case of the Connecting Option

My own use case for the proposed option '''-connecting''' is as follows.
A TCL script is started within Rivet to do two tasks:

 * Verify many URLs for existance (e.g. open a socket and do a head request).

 * Do some data base work.

I don't use the event loop to get a linear program with controlled order.

So the program flow is as follows:

 * Open async sockets for all links to verify:

 > ''foreach host $hosts; set h($host) [socket -async $host 23]''

 * Do some data base work which takes time.

 * Check all open sockets and eventually advance the state machine by calling
   '''-connecting''':

 > ''foreach host $hosts; if {[fconfigure $h($host) -connecting]} {Connected $host}''

 * Do some more data base work which takes time.

 * Check '''-connecting''' and react until there are no open sockets any more.

I have cut the data base processing into two parts; I assume there
are normally two IP's to try, one IPV6 and one IPV4.

~ Alternatives

Two alternatives for the behavior of '''fconfigure -sockname''' and
'''fconfigure -peername''' are:

   * If there is only one destination IP address, return this before
     the connection is really established. This may also happen if the
     socket has processed its ''bind()'' system call in such a way
     that it knows it will not attempt another, and is currently
     processing its ''connect()'' to the remote host.

   * If an asynchronous connect is running, raise an error.

These are open to discussion.

~ Implementation

The fossil branch ''tip-427'' contains an implementation of
these extensions.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|
|

|

|
|
|
|

|

|
|

|

|
|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162

# TIP 427: Introspection of Asynchronous Socket Connection

	Author:         Reinhard Max <[email protected]>
	Author:         Harald Oehlmann <[email protected]>
	Author:         Reinhard Max <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        16-Mar-2014
	Post-History:   
	Keywords:       async socket connect,introspection,IPV6
	Tcl-Version:    8.6.4
-----

# Abstract

This TIP describes a method to introspect the asynchronous connection
process by an extension of the **fconfigure** interface in addition
to **fconfigure -error**. This will enable better control over the
asynchronous connection process, even in cases where the event loop is
not in use.

# Rationale

The **socket** core command supports two ways to establish a
client socket, _synchronous_ and _asynchronous_. In synchronous
mode \(which is the default\) the command does not return until the
connection attempt has completed \(established or failed\).

In asynchronous mode \(**-async option**\) the command returns after DNS lookup and the connection is established in the background.
This is useful in situations where it is undesirable that a
process or thread blocks for completing a synchronous connection
attempt.
Classically, an asyncronously connecting socket would indicate that it
had connected \(or failed to connect\) by becoming writeable, which
**fileevent writable** can be used to detect.

A DNS name may have multiple IP addresses associated, e.g. for
IPv4/IPv6 dual stack hosts or for fail safety or load balancing
reasons as it is the case for google.com as of this writing.

In Tcl 8.5 the socket command only tried to connect to a single IPv4
address that was randomly picked from the list returned by DNS. In Tcl
8.6, the socket command tries to connect to all the IP addresses of a
DNS name in turn until one succeeds or all have failed.

This caused the following changes to the **socket -async** command from Tcl 8.5 to 8.6:

   * The socket introspection options to **fconfigure** \(i.e.,
     **-error**, **-sockname** and **-peername**\) can change
     between successive invocations while the connection is in
     progress as they reflect the state of the internal loop
     over the IP addresses.

   * The event loop must run in order to allow looping over the
     various possible IP addresses of a host.

The usage of **socket -async** is seen as helpful even without the event loop. An example is an application, which checks a list of
hosts for a connection.  The application may start many background
socket connects, do something else, and then collect the results.  Without the event loop \(i.e., a **fileevent writable**\), there is
no non-blocking way to discover if the asynchronous connect has
completed.

In addition, the following future points may be considered:

   * The connection process may internally get delegated to its own
     thread; this would allow the connection process to be
     asynchronous without requiring the event loop.

   * A future Windows implementation may use the Vista\+ API
     _WSAConnectByList_ \(once we do not support Windows XP any
     more\). Using this, no own looping over the addresses is
     necessary.  It allows the connection process to be a single OS
     call, but does not allow inspection of the different connection
     steps.

# Proposed Change

## Current Introspection Change

The introspection functions should act as follows during an
asynchronous connection process:

   * **fconfigure -error** will return the empty string \(no error\)

   * **fconfigure -sockname** will return the empty string

   * **fconfigure -peername** will return the empty string

## Introspection Command to Inspect a Running Asynchronous Connect

An additional introspection function should inform if the asynchronous
connect is running or if it has terminated:

 > **fconfigure** _channel_ **-connecting**

This option returns **1** as long as a socket is still in the process of connecting asynchronously and 0 when the asynchronous connection has completed \(succeeded or failed\) or the socket was opened synchronously.

## Non-Event Loop Operation

If the event loop runs, the state machine of a \(possibly multiple-address try\) async connection proceeds within an internal callback.

In addition to that \(for the case the event loop does currently not run\), it proceeds whenever a channel operation is attempted on the socket\_

   * nonblocking I/O and [fconfigure] will advance it by one step \(i.e. do whatever is doable without waiting\).

   * blocking I/O will advance it to completion, in essence meaning "switch back to synchronous connect, and in case of success, do the blocking I/O I asked for".

# Use Case of the Connecting Option

My own use case for the proposed option **-connecting** is as follows.
A TCL script is started within Rivet to do two tasks:

 * Verify many URLs for existance \(e.g. open a socket and do a head request\).

 * Do some data base work.

I don't use the event loop to get a linear program with controlled order.

So the program flow is as follows:

 * Open async sockets for all links to verify:

	 > _foreach host $hosts; set h\($host\) [socket -async $host 23]_

 * Do some data base work which takes time.

 * Check all open sockets and eventually advance the state machine by calling
   **-connecting**:

	 > _foreach host $hosts; if \{[fconfigure $h($host) -connecting]\} \{Connected $host\}_

 * Do some more data base work which takes time.

 * Check **-connecting** and react until there are no open sockets any more.

I have cut the data base processing into two parts; I assume there
are normally two IP's to try, one IPV6 and one IPV4.

# Alternatives

Two alternatives for the behavior of **fconfigure -sockname** and
**fconfigure -peername** are:

   * If there is only one destination IP address, return this before
     the connection is really established. This may also happen if the
     socket has processed its _bind\(\)_ system call in such a way
     that it knows it will not attempt another, and is currently
     processing its _connect\(\)_ to the remote host.

   * If an asynchronous connect is running, raise an error.

These are open to discussion.

# Implementation

The fossil branch _tip-427_ contains an implementation of
these extensions.

# Copyright

This document has been placed in the public domain.

Name change from tip/428.tip to tip/428.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32

33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79

80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96

TIP:            428
Title:          Produce Error Dictionary from 'fconfigure -error'
Version:        $Revision: 1.26 $
Author:         Harald Oehlmann <[email protected]>
Author:         Harald Oehlmann <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        16-Mar-2014
Post-History:   
Keywords:       socket,non-blocking,error reporting,option dictionary
Tcl-Version:    8.7

~ Abstract

This TIP proposes a new method which allows to return the error message and the error code of a background socket error (as reported by '''fconfigure -error'''), similar to the option dictionaries produced by catch and try and consumed by return.

~ Rationale

The error message of a background channel error may be retrieved and
cleared by '''fconfigure''' ''channel'' '''-error''', but there is no access to the error code.

In addition, the error may not be handled in the TCL-like way using '''catch''' or '''try''' (or just let fail the program).

Specially the new '''try''' syntax (see example in the man page) is well suited to handle socket errors.
Example:

|try {set h [socket $host $port]}\
|trap {POSIX ECONNREFUSED} {} {
|    # handle not open port
|}

Drivers mostly use POSIX errors to report issues where the error code is more portable than the error message (AFAIK).

To handle an error by '''try''', the error must be thrown.

We are limited to an option to the command '''fconfigure''', as this is implemented within the driver interface.

Throwing the error would change the semantics of '''fconfigure''' and thus should not happen (consensus on the core list).
Instead, the new '''fconfigure''' operation should return the error message and the error code.

To finally throw the error, an utility function ('''chan throwerror $h''') may be defined in TCL.
This is not part of this TIP.

~ Proposed Change

The option '''fconfigure channel -error''' should be extended to take an optional argument as follows:

 > '''fconfigure''' ''channel'' '''-error''' ''?errorDictVar?''

If the optional argument ''errorDictVar'' is given, the following dict is written in the named variable of the caller environment:

   *   if there is no error, it should be set to '''-code 0'''

   *   if there is an error, it should be set to '''-code 1 -errorcode ''' ''errorCode''

This is executed in addition to the standard action of '''fconfigure''' ''channel'' '''-error'''.

~ Example

Usage example with failing async connect:

|% set h [connect -async localhost 30001]
|d00000af
|% fileevent $h writable {set x 1}
|% vwait x
|% fconfigure $h -error errorDict
|connection refused
|% set errorDict
|-code 1 -errorcode {POSIX ECONNREFUSED {connection refused}}
|% close $h

The following example demonstrates the implementation of '''chan throwerror''' eg to throw the error from the provided dict.

|proc throwerror {h} {
|    set errorMessage [fconfigure $h -errorDict]
|    return -options $errorDict $errorMessage
|}

~ Alternatives

   *   Revision 1.11 of this TIP proposed to really throw the error.

   *   Revision 1.21 of this TIP proposed a new option to return an error dict directly.

~ Implementation

The tip is implemented in fossil branch '''tip-428'''.

~ Remarks

The idea of this semantics and a feasability study is from Reinhard Max.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|
|
|
<
|
>
|

|

|

|
|

|

|

|

|

|

|

|

|

|

|
|
|
|
|
|
|
|
|

|

|
|
|
<
|
>
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29

30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76

77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96

# TIP 428: Produce Error Dictionary from 'fconfigure -error'

	Author:         Harald Oehlmann <[email protected]>
	Author:         Harald Oehlmann <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        16-Mar-2014
	Post-History:   
	Keywords:       socket,non-blocking,error reporting,option dictionary
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes a new method which allows to return the error message and the error code of a background socket error \(as reported by **fconfigure -error**\), similar to the option dictionaries produced by catch and try and consumed by return.

# Rationale

The error message of a background channel error may be retrieved and
cleared by **fconfigure** _channel_ **-error**, but there is no access to the error code.

In addition, the error may not be handled in the TCL-like way using **catch** or **try** \(or just let fail the program\).

Specially the new **try** syntax \(see example in the man page\) is well suited to handle socket errors.
Example:

	try {set h [socket $host $port]}\
	trap {POSIX ECONNREFUSED} {} {
	    # handle not open port

	}

Drivers mostly use POSIX errors to report issues where the error code is more portable than the error message \(AFAIK\).

To handle an error by **try**, the error must be thrown.

We are limited to an option to the command **fconfigure**, as this is implemented within the driver interface.

Throwing the error would change the semantics of **fconfigure** and thus should not happen \(consensus on the core list\).
Instead, the new **fconfigure** operation should return the error message and the error code.

To finally throw the error, an utility function \(**chan throwerror $h**\) may be defined in TCL.
This is not part of this TIP.

# Proposed Change

The option **fconfigure channel -error** should be extended to take an optional argument as follows:

 > **fconfigure** _channel_ **-error** _?errorDictVar?_

If the optional argument _errorDictVar_ is given, the following dict is written in the named variable of the caller environment:

   *   if there is no error, it should be set to **-code 0**

   *   if there is an error, it should be set to **-code 1 -errorcode ** _errorCode_

This is executed in addition to the standard action of **fconfigure** _channel_ **-error**.

# Example

Usage example with failing async connect:

	% set h [connect -async localhost 30001]
	d00000af
	% fileevent $h writable {set x 1}
	% vwait x
	% fconfigure $h -error errorDict
	connection refused
	% set errorDict
	-code 1 -errorcode {POSIX ECONNREFUSED {connection refused}}
	% close $h

The following example demonstrates the implementation of **chan throwerror** eg to throw the error from the provided dict.

	proc throwerror {h} {
	    set errorMessage [fconfigure $h -errorDict]
	    return -options $errorDict $errorMessage

	}

# Alternatives

   *   Revision 1.11 of this TIP proposed to really throw the error.

   *   Revision 1.21 of this TIP proposed a new option to return an error dict directly.

# Implementation

The tip is implemented in fossil branch **tip-428**.

# Remarks

The idea of this semantics and a feasability study is from Reinhard Max.

# Copyright

This document has been placed in the public domain.

Name change from tip/429.tip to tip/429.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67

TIP:            429
Title:          A 'string' Subcommand for Concatenation
Version:        $Revision: 1.6 $
Author:         Andreas Leitgeb <[email protected]>
Author:         Alexandre Ferrieux <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        27-Jul-2014
Post-History:   
Keywords:       Tcl,cat,scriptlet result
Tcl-Version:    8.6.2

~ Abstract

This TIP describes a new (sub)command '''string cat''' to concatenate an
arbitrary number of strings.

~ Rationale

Tcl has string concatenation built-in. But that is lacking in two specific
cases:

   * one cannot directly concat a braced string with anything else

   * scriptlets such as used for '''lmap''' are expected to contain commands,
     the last one of which returns a value. To have the scriptlet return a
     concatenated string or even just a single string literal, one currently
     needs to misuse some corner-case of a non-trivial command, like '''return
     -level 0 $x$y''' or '''string map {} "$x$y"''' just to have the scriptlet
     produce the string as its result.

~ Proposal

I propose a new subcommand '''string cat''', that will take an arbitrary
number of arguments (i.e., 0 or more), and concatenate them into a single
string that becomes the result of the command.

It would be equivalent to creating a '''list''' of the separate arguments and
use '''join''' on that list with an empty string as second argument.

Compiling that new command to bytecode should be trivial, as concatenation of
strings is already compileable. The added value would be allowing braced
string literals to be involved, and promoting the resulting stack-item to the
result of the command/scriptlet. (This simple compileability is also meant to
be a main advantage over '''join [[list ...]] ""''', where the contents of the
intermediate list are either a single word or many words, or '''lindex [[list
...]] 0''' where the contents of the intermediate list are a single word.)

The following equality will hold for any arbitrary contents of the variables
'''a''' and '''b''':

| string equals $a$b [string cat $a $b]

~ Rejected Alternatives

Lars has mailed on tclcore that TclX has a command '''cconcat''' that does essentially what my proposed '''string cat''' is supposed to do (not sure though whether that is compiled). This proposal sticks to the '''cat''' subcommand, as that is generally the preferred way over new toplevel commands.

Also, '''string concat''' is added to this section, for it is a bit longer than '''string cat''', and (as Lars put it) '''string cat''' is less likely to be misinterpreted as "concat, just moved into the string ensemble."

~ Reference implementation

Available as branch tip-429 on core.tcl.tk.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|

|
|

|
|

|
|
|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67

# TIP 429: A 'string' Subcommand for Concatenation

	Author:         Andreas Leitgeb <[email protected]>
	Author:         Alexandre Ferrieux <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        27-Jul-2014
	Post-History:   
	Keywords:       Tcl,cat,scriptlet result
	Tcl-Version:    8.6.2
-----

# Abstract

This TIP describes a new \(sub\)command **string cat** to concatenate an
arbitrary number of strings.

# Rationale

Tcl has string concatenation built-in. But that is lacking in two specific
cases:

   * one cannot directly concat a braced string with anything else

   * scriptlets such as used for **lmap** are expected to contain commands,
     the last one of which returns a value. To have the scriptlet return a
     concatenated string or even just a single string literal, one currently
     needs to misuse some corner-case of a non-trivial command, like **return
     -level 0 $x$y** or **string map \{\} "$x$y"** just to have the scriptlet
     produce the string as its result.

# Proposal

I propose a new subcommand **string cat**, that will take an arbitrary
number of arguments \(i.e., 0 or more\), and concatenate them into a single
string that becomes the result of the command.

It would be equivalent to creating a **list** of the separate arguments and
use **join** on that list with an empty string as second argument.

Compiling that new command to bytecode should be trivial, as concatenation of
strings is already compileable. The added value would be allowing braced
string literals to be involved, and promoting the resulting stack-item to the
result of the command/scriptlet. \(This simple compileability is also meant to
be a main advantage over **join [list ...] ""**, where the contents of the
intermediate list are either a single word or many words, or **lindex [list
...] 0** where the contents of the intermediate list are a single word.\)

The following equality will hold for any arbitrary contents of the variables
**a** and **b**:

	 string equals $a$b [string cat $a $b]

# Rejected Alternatives

Lars has mailed on tclcore that TclX has a command **cconcat** that does essentially what my proposed **string cat** is supposed to do \(not sure though whether that is compiled\). This proposal sticks to the **cat** subcommand, as that is generally the preferred way over new toplevel commands.

Also, **string concat** is added to this section, for it is a bit longer than **string cat**, and \(as Lars put it\) **string cat** is less likely to be misinterpreted as "concat, just moved into the string ensemble."

# Reference implementation

Available as branch tip-429 on core.tcl.tk.

# Copyright

This document has been placed in the public domain.

Name change from tip/43.tip to tip/43.md.

1
2
3
4
5
6
7
8
9

10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185

TIP:            43
Title:          How to be a TIP Editor
Version:        $Revision: 1.4 $
Author:         Donal K. Fellows <[email protected]>
State:          Draft
Type:           Informative
Vote:           Pending
Created:        07-Jul-2001
Post-History:   

~ Abstract

This TIP describes some of the rules and guidelines that the TIP
Editor uses when accepting TIPs for the first time.

~ Rules

There are some things that are hard rules, which should be obeyed even
if it means having to postpone acceptance of the TIP or rewrite it
yourself.

 * ''Every TIP ''must'' be relevant to Tcl and/or Tk.''

 > It's probably better to suggest that changes that affect just a
   single extension should be dealt with through the processes for
   feature requests for that extension, but where they are about
   providing some kind common interface across a whole group of
   extensions, it is fair to think of using a TIP as well.  I'd reckon
   that's up to the discretion of the editor, but no TIP should be
   rejected by the editor out of hand, and never without a proper
   written explanation.

 > Of course, ultimately whether a TIP is relevant to Tcl and/or Tk is
   up to the whole Tcl Core Team (as described in [0]) so you should
   try to ensure that their policy on TIP-suitability is what you are
   enforcing.

 * ''Every TIP ''must'' be in the TIP format (see [3] for details.)''

 > This is important because it allows the TIP rendering engine to
   handle all the formatting and indexing automatically for you.  Note
   that it is very picky about the format of the header, and not that
   choosy about the format of the content (though it is not a good
   idea to have a sub-item of a list without a previous main item.)
   Get it wrong, and the TIP archive engine will fail in all sorts of
   "interesting" ways.  Take particular note of the format of the
   ''Created:'' line, as it surprises many people in just how exact it
   must be.

 * ''Every author ''must'' be associated with a real email address.''

 > You should fill this in yourself if it is not already supplied and
   spam-protected addresses are not acceptable, since they tend to
   frustrate the main purpose of TIPs which is to foster collaboration
   on things to improve Tcl and Tk.  Proper email addresses help this
   by always allowing people to contact the author of the TIP to give
   suggestions to improve the TIP or to resolve issues they have with
   it.

 * ''Every TIP ''must'' have an Abstract.''

 > Not everyone has the desire, or the time, to read each TIP.
   Providing an abstract allows people to determine whether the TIP is
   relevant to what they are looking for at the moment.  Searches on
   the TIP archive also always search the abstract.

 > Abstracts should be formed of the section title whose text is
   precisely "Abstract" and then a single normal paragraph of no more
   than around 200 words; if it is longer than that then it is no
   longer a summary or abstract but a genuine major part of the
   document body.  While authors should write their own abstracts, it
   is reasonable for the editor to add one, particularly if the
   author's native language is not English.

 * ''Every TIP ''must'' have a Copyright declaration.''

 > World-wide copyright laws are funny things, and I'm not sure that
   it is safe to assume that the submission of the TIP constitutes
   permission for all the things that might be done with it in the
   future.  Work around this by getting every author to clarify the
   copyright position at time of submission by explicitly saying that
   the document is placed in the public domain.  (The way that TIPs
   are kept under CVS should assuage most concerns relating to
   misrepresentation through inappropriate modifications, and it is a
   definite aim that TIPs should be distributed as widely as possible
   to encourage a wide dissemination of the ideas contained.)

~ Guidelines

 * TIPs should be written in English (unless there are very good
   reasons otherwise) since that is the language most widely
   understood in the Tcl/Tk community.

 * TIP should be written so as to be readable!  This requirement is
   not strict, but it will make it much easier for the TCT to
   evaluate...

 * The Abstract should be written in a third-person voice, and
   ''definitely'' in English.  It isn't so important for the rest of
   the TIP, but the abstract will be seen quite a bit more widely and
   without as much context.  It also fits in with the style of the
   existing abstracts.

 * The section headings and title should be capitalised according to
   the rules for such things in English.  It looks neater that way.

 * Spell check before checking in.  No sense in having glaring errors
   in the initial version!  (I do not enforce the use of either US or
   UK spellings; that is rightfully the domain of the TIP author who
   might be based anywhere in the world.)  Be especially careful with
   the checking of the spellings of the names of file names, C
   identifiers or Tcl commands/variables/etc.

 * C identifiers and Tcl commands/variables/etc. should normally be
   ''emphasized'', as should file names.  This should be moderated by
   good sense though; the aim of such emphasis is to indicate that it
   is a reference to an entity in the code domain as opposed to the
   domain of the English language.

 * TIP numbers should be allocated by the TIP editor in sequence of
   the order they are checked into the CVS archive.  Make sure that
   the filename (''num.tip'') matches up with the ''TIP: num'' header
   or bizarre things may happen.

 * Where someone submits a TIP proposing a new Tk widget, invite them
   to supply an image (or two) of how the widget will look in
   operation.  These images will need to be checked in by hand, and
   will not be editable.  Images should be checked in in both a raster
   form (GIF, JPEG or PNG) and as Encapsulated PostScript (EPS) - make
   sure that you set the binary flag on the file when you do this.
   Where someone produces a diagram with a tool that can produce FIG
   files, it is nice if you can check that into CVS as well so that
   the diagram itself can be maintained if necessary.

 > As a convention, name the images with the TIP number as the first
   part of the name.  This makes it much easier to determine what TIP
   a particular image is associated with (and certainly beats grepping
   the whole set of TIPs!)

 * Once a TIP is checked in, it should normally be published to
   news:comp.lang.tcl, news:comp.lang.tcl.annouce and the tcl-core
   mailing list (though with some TIPs it is obvious that wider
   dissemination is less useful.)  It is a good idea to send a copy to
   the TIP author as well, as this lets them know not only that the
   TIP has been accepted but also what it looks like and that it has
   been distributed to the wider community.  The ''postnews.tcl''
   script that comes with the TIP renderer distribution is designed to
   do all this with a minimum of fuss.  A quick "Thank You" note is
   also courteous.

 * When a TIP has been accepted by the TCT in a TYANNOTT vote, put a
   note into the log to record what the vote was.  It is best to do
   this as part of the log message for when you change the Vote: and
   Status: headers...

 * If a TIP does not state whether it is an alteration to Tcl or Tk in
   either its title or its abstract, it is a good idea to add a
   Keywords: header (or a keyword in an existing such header) which
   includes that information.

 * Don't forget to use both '''bold''' and ''italic'' text when
   formatting strings that represent command syntaxes.  It makes them
   much clearer!

''I need to write something here about the production of PS and PDF
versions of the whole TIP archive, but that side of the code is not
yet finished and released.''

~ Notes

TIPs do not need to be tightly focussed.  Making them so does make
them easier to evaluate, but it might also remove the real rationale
behind the changes.  Instead, it is best that they form a coherent
logical entity, since I believe that it is that which makes for a good
TIP.

The title, section headings and list item headings must be plain text.
This is because there are output formats which are very picky about
what is allowed in those sorts of places (PDF bookmarks have
especially strict restrictions) and plain text has the virtue of being
accepted pretty much everywhere.

~ Copyright

This document is placed in the public domain.

<
|
<
|
|
|
|
|
|
>

|

|

|

|

|
|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|
|

|
|

|

|

|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185

# TIP 43: How to be a TIP Editor

	Author:         Donal K. Fellows <[email protected]>
	State:          Draft
	Type:           Informative
	Vote:           Pending
	Created:        07-Jul-2001
	Post-History:   
-----

# Abstract

This TIP describes some of the rules and guidelines that the TIP
Editor uses when accepting TIPs for the first time.

# Rules

There are some things that are hard rules, which should be obeyed even
if it means having to postpone acceptance of the TIP or rewrite it
yourself.

 * _Every TIP _must_ be relevant to Tcl and/or Tk._

	 > It's probably better to suggest that changes that affect just a
   single extension should be dealt with through the processes for
   feature requests for that extension, but where they are about
   providing some kind common interface across a whole group of
   extensions, it is fair to think of using a TIP as well.  I'd reckon
   that's up to the discretion of the editor, but no TIP should be
   rejected by the editor out of hand, and never without a proper
   written explanation.

	 > Of course, ultimately whether a TIP is relevant to Tcl and/or Tk is
   up to the whole Tcl Core Team \(as described in [[0]](0.md)\) so you should
   try to ensure that their policy on TIP-suitability is what you are
   enforcing.

 * _Every TIP _must_ be in the TIP format \(see [[3]](3.md) for details.\)_

	 > This is important because it allows the TIP rendering engine to
   handle all the formatting and indexing automatically for you.  Note
   that it is very picky about the format of the header, and not that
   choosy about the format of the content \(though it is not a good
   idea to have a sub-item of a list without a previous main item.\)
   Get it wrong, and the TIP archive engine will fail in all sorts of
   "interesting" ways.  Take particular note of the format of the
   _Created:_ line, as it surprises many people in just how exact it
   must be.

 * _Every author _must_ be associated with a real email address._

	 > You should fill this in yourself if it is not already supplied and
   spam-protected addresses are not acceptable, since they tend to
   frustrate the main purpose of TIPs which is to foster collaboration
   on things to improve Tcl and Tk.  Proper email addresses help this
   by always allowing people to contact the author of the TIP to give
   suggestions to improve the TIP or to resolve issues they have with
   it.

 * _Every TIP _must_ have an Abstract._

	 > Not everyone has the desire, or the time, to read each TIP.
   Providing an abstract allows people to determine whether the TIP is
   relevant to what they are looking for at the moment.  Searches on
   the TIP archive also always search the abstract.

	 > Abstracts should be formed of the section title whose text is
   precisely "Abstract" and then a single normal paragraph of no more
   than around 200 words; if it is longer than that then it is no
   longer a summary or abstract but a genuine major part of the
   document body.  While authors should write their own abstracts, it
   is reasonable for the editor to add one, particularly if the
   author's native language is not English.

 * _Every TIP _must_ have a Copyright declaration._

	 > World-wide copyright laws are funny things, and I'm not sure that
   it is safe to assume that the submission of the TIP constitutes
   permission for all the things that might be done with it in the
   future.  Work around this by getting every author to clarify the
   copyright position at time of submission by explicitly saying that
   the document is placed in the public domain.  \(The way that TIPs
   are kept under CVS should assuage most concerns relating to
   misrepresentation through inappropriate modifications, and it is a
   definite aim that TIPs should be distributed as widely as possible
   to encourage a wide dissemination of the ideas contained.\)

# Guidelines

 * TIPs should be written in English \(unless there are very good
   reasons otherwise\) since that is the language most widely
   understood in the Tcl/Tk community.

 * TIP should be written so as to be readable!  This requirement is
   not strict, but it will make it much easier for the TCT to
   evaluate...

 * The Abstract should be written in a third-person voice, and
   _definitely_ in English.  It isn't so important for the rest of
   the TIP, but the abstract will be seen quite a bit more widely and
   without as much context.  It also fits in with the style of the
   existing abstracts.

 * The section headings and title should be capitalised according to
   the rules for such things in English.  It looks neater that way.

 * Spell check before checking in.  No sense in having glaring errors
   in the initial version!  \(I do not enforce the use of either US or
   UK spellings; that is rightfully the domain of the TIP author who
   might be based anywhere in the world.\)  Be especially careful with
   the checking of the spellings of the names of file names, C
   identifiers or Tcl commands/variables/etc.

 * C identifiers and Tcl commands/variables/etc. should normally be
   _emphasized_, as should file names.  This should be moderated by
   good sense though; the aim of such emphasis is to indicate that it
   is a reference to an entity in the code domain as opposed to the
   domain of the English language.

 * TIP numbers should be allocated by the TIP editor in sequence of
   the order they are checked into the CVS archive.  Make sure that
   the filename \(_num.tip_\) matches up with the _TIP: num_ header
   or bizarre things may happen.

 * Where someone submits a TIP proposing a new Tk widget, invite them
   to supply an image \(or two\) of how the widget will look in
   operation.  These images will need to be checked in by hand, and
   will not be editable.  Images should be checked in in both a raster
   form \(GIF, JPEG or PNG\) and as Encapsulated PostScript \(EPS\) - make
   sure that you set the binary flag on the file when you do this.
   Where someone produces a diagram with a tool that can produce FIG
   files, it is nice if you can check that into CVS as well so that
   the diagram itself can be maintained if necessary.

	 > As a convention, name the images with the TIP number as the first
   part of the name.  This makes it much easier to determine what TIP
   a particular image is associated with \(and certainly beats grepping
   the whole set of TIPs!\)

 * Once a TIP is checked in, it should normally be published to
   news:comp.lang.tcl, news:comp.lang.tcl.annouce and the tcl-core
   mailing list \(though with some TIPs it is obvious that wider
   dissemination is less useful.\)  It is a good idea to send a copy to
   the TIP author as well, as this lets them know not only that the
   TIP has been accepted but also what it looks like and that it has
   been distributed to the wider community.  The _postnews.tcl_
   script that comes with the TIP renderer distribution is designed to
   do all this with a minimum of fuss.  A quick "Thank You" note is
   also courteous.

 * When a TIP has been accepted by the TCT in a TYANNOTT vote, put a
   note into the log to record what the vote was.  It is best to do
   this as part of the log message for when you change the Vote: and
   Status: headers...

 * If a TIP does not state whether it is an alteration to Tcl or Tk in
   either its title or its abstract, it is a good idea to add a
   Keywords: header \(or a keyword in an existing such header\) which
   includes that information.

 * Don't forget to use both **bold** and _italic_ text when
   formatting strings that represent command syntaxes.  It makes them
   much clearer!

_I need to write something here about the production of PS and PDF
versions of the whole TIP archive, but that side of the code is not
yet finished and released._

# Notes

TIPs do not need to be tightly focussed.  Making them so does make
them easier to evaluate, but it might also remove the real rationale
behind the changes.  Instead, it is best that they form a coherent
logical entity, since I believe that it is that which makes for a good
TIP.

The title, section headings and list item headings must be plain text.
This is because there are output formats which are very picky about
what is allowed in those sorts of places \(PDF bookmarks have
especially strict restrictions\) and plain text has the virtue of being
accepted pretty much everywhere.

# Copyright

This document is placed in the public domain.

Name change from tip/430.tip to tip/430.md.

1
2
3
4
5
6
7
8
9
10
11
12
13
14

15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112

TIP:            430
Title:          Add basic ZIP archive support to Tcl
Version:        $Revision: 1.6 $
Author:         Sean Woods <[email protected]>
Author:         Donal Fellows <[email protected]>
Author:         Poor Yorick <[email protected]>
Author:         Harald Oehlmann <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        03-Sep-2014
Post-History:   
Keywords:       virtual filesystem,zip,tclkit,boot,bootstrap
Tcl-Version:    8.6.3

~ Abstract

This proposal will add basic support for mounting zip archive files as virtual
filesystems to the Tcl core.

~ Target Tcl-Version

This TIP targets TCL Version 8.7 or 9.0, whatever comes first.

~ Rationale

Tcl/Tk relies on the presence of a file system containing Tcl scripts for
bootstrapping the interpreter.  When dealing with code packed in a
self-contained executable, a chicken-and-egg problem arises when developers
try to provide this bootstrap from their attached VFS with extensions like
TclVfs.  TclVfs runs in the Tcl interpreter.  The interpreter needs
''init.tcl'', which would mean that the filesystem containing ''init.tcl'' is
not present until after TclVfs mounts it yet that mount cannot happen until
after ''init.tcl'' has been loaded. Bootstrap filesystem mounts require
built-in support for the filesystem that they use.

With the inclusion of Zlib in the core (starting with 8.6, [244]), all that is
required to implement a zip file system based VFS is to add a C-level VFS
implementation to decode the zip archive format. Thus: this project.

Note that we are prioritizing the zip archive format also because it is
practical to generate the files without a Tcl installation being present; it
is a format with widespread OS support. This makes it much easier to bootstrap
a build of Tcl that uses it without requiring a native build of tclsh to be
present.

~ Specification

There shall be new commands added to safe interpreters withing Tcl. All of which 
shall be in the '''::zvfs''' namespace. These commands shall include:

 * '''zvfs::mount''' ?''archive''? ?''mountpoint''?

 > Mounts the ZIP file ''archive'' at the location given by ''mountpoint'',
   which will default to '''zipfs:''/''archive'' if absent. With no arguments
   this command describes all current mounts, returning a list of pairs.

 * '''zvfs::unmount''' ''archive''

 > Unmounts the ZIP file ''archive'', which must have been previously mounted.

Safe interpreters will not be given the mount or unmount commands. 
Already mounted file systems will be available via the '''glob''' and '''file'''
commands. These commands, and any commands related to building
archives will be marked with the unsafe bit within the '''zipfs''' ensemble,
and will be removed from any interpreter through the normal mechanism
to hide unsafe commands within the core.

~ Implementation

I have adapted Richard Hipp's work on Tcl As One Big Executable (TOBE) to
operate inside of a modern Tcl. That implementation consists of one C file
(''tclZipvfs.c'').  I have also prepared new behaviors for inside of
Tcl_AppInit() to detect if a zip filesystem is attached to the current
executable, and how to extract a "''main.tcl''" as well as the initial file
systems for both Tcl and Tk.

This work is checked in as the "''core_zip_vfs''" branch on both Tcl and Tk.

~ C API

* '''int TclZipfsInit(Tcl_Interp *interp);'''
> Initializes the C API for Zipfs. If called with a non-null ''interp'', adds the commands
   for the zipfs Tcl API to the interpreter. Returns '''TCL_OK''' on success, and '''TCL_ERROR''' in  
   all other cases.

* '''int TclZipfsMount(Tcl_Interp *interp, const char *zipname, const char *mntpt, const char *passwd);'''
> Mounts a zip file ''zipname'' to the mount point ''mntpt''. If ''passwd'' is non-null, that string is
   utilized as the password to decrypt the contents. ''mnpnt'' will always be relative to '''zipfs:'''

* '''int TclZipfsUnmount(Tcl_Interp *interp, const char *zipname);'''
> Unmount the file system created by a prior call to '''TclZipfsMount()'''

~ Bootstraping

The mount and unmount commands are usable within the core as just another feature engine. A call to TclZipfsInit() will be inserted into tclBasic.c, immediately after the code to initialize zlib.

A modified shell (tclkit.exe) will be generated by Make. This shell will:
* Check if the executable has a zip archive attached. If so, that 
archive shall be mounted as '''zipfs:/app'''. 
* If ''zipfs:/app" is present the interpreter will look for 
'''boot/tcl/init.tcl'''. If that file is present, the location for '''$tcl_library'''
will be set to '''zipfs:/app/boot/tcl'''.
* If ''zipfs:/app'' is present the interpreter will look for '''boot/tk/tk.tcl'. If present
the location for '''$tk_library''' will be set to '''zipfs:/boot/tk'''. 
* If the file '''pkgIndex.tcl''' is present, the '''$dir''' variable will be set to
'''zipfs:/app''' and the file will be sourced as if it were a package index.
* If the file '''main.tcl''' is present, the file '''zipfs://app/main.tcl''' will be registered with '''Tcl_SetStartupScript()'''

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|
|
|

|

|

|
|
|

|
|
|

|
|

|

|

|

|
|
|
|
|
|
|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112

# TIP 430: Add basic ZIP archive support to Tcl

	Author:         Sean Woods <[email protected]>
	Author:         Donal Fellows <[email protected]>
	Author:         Poor Yorick <[email protected]>
	Author:         Harald Oehlmann <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        03-Sep-2014
	Post-History:   
	Keywords:       virtual filesystem,zip,tclkit,boot,bootstrap
	Tcl-Version:    8.6.3
-----

# Abstract

This proposal will add basic support for mounting zip archive files as virtual
filesystems to the Tcl core.

# Target Tcl-Version

This TIP targets TCL Version 8.7 or 9.0, whatever comes first.

# Rationale

Tcl/Tk relies on the presence of a file system containing Tcl scripts for
bootstrapping the interpreter.  When dealing with code packed in a
self-contained executable, a chicken-and-egg problem arises when developers
try to provide this bootstrap from their attached VFS with extensions like
TclVfs.  TclVfs runs in the Tcl interpreter.  The interpreter needs
_init.tcl_, which would mean that the filesystem containing _init.tcl_ is
not present until after TclVfs mounts it yet that mount cannot happen until
after _init.tcl_ has been loaded. Bootstrap filesystem mounts require
built-in support for the filesystem that they use.

With the inclusion of Zlib in the core \(starting with 8.6, [[244]](244.md)\), all that is
required to implement a zip file system based VFS is to add a C-level VFS
implementation to decode the zip archive format. Thus: this project.

Note that we are prioritizing the zip archive format also because it is
practical to generate the files without a Tcl installation being present; it
is a format with widespread OS support. This makes it much easier to bootstrap
a build of Tcl that uses it without requiring a native build of tclsh to be
present.

# Specification

There shall be new commands added to safe interpreters withing Tcl. All of which 
shall be in the **::zvfs** namespace. These commands shall include:

 * **zvfs::mount** ?_archive_? ?_mountpoint_?

	 > Mounts the ZIP file _archive_ at the location given by _mountpoint_,
   which will default to **zipfs:_/_archive_ if absent. With no arguments
   this command describes all current mounts, returning a list of pairs.

 * **zvfs::unmount** _archive_

	 > Unmounts the ZIP file _archive_, which must have been previously mounted.

Safe interpreters will not be given the mount or unmount commands. 
Already mounted file systems will be available via the **glob** and **file**
commands. These commands, and any commands related to building
archives will be marked with the unsafe bit within the **zipfs** ensemble,
and will be removed from any interpreter through the normal mechanism
to hide unsafe commands within the core.

# Implementation

I have adapted Richard Hipp's work on Tcl As One Big Executable \(TOBE\) to
operate inside of a modern Tcl. That implementation consists of one C file
\(_tclZipvfs.c_\).  I have also prepared new behaviors for inside of
Tcl\_AppInit\(\) to detect if a zip filesystem is attached to the current
executable, and how to extract a "_main.tcl_" as well as the initial file
systems for both Tcl and Tk.

This work is checked in as the "_core\_zip\_vfs_" branch on both Tcl and Tk.

# C API

* **int TclZipfsInit\(Tcl\_Interp \*interp\);**
	> Initializes the C API for Zipfs. If called with a non-null _interp_, adds the commands
   for the zipfs Tcl API to the interpreter. Returns **TCL\_OK** on success, and **TCL\_ERROR** in  
   all other cases.

* **int TclZipfsMount\(Tcl\_Interp \*interp, const char \*zipname, const char \*mntpt, const char \*passwd\);**
	> Mounts a zip file _zipname_ to the mount point _mntpt_. If _passwd_ is non-null, that string is
   utilized as the password to decrypt the contents. _mnpnt_ will always be relative to **zipfs:**

* **int TclZipfsUnmount\(Tcl\_Interp \*interp, const char \*zipname\);**
	> Unmount the file system created by a prior call to **TclZipfsMount\(\)**

# Bootstraping

The mount and unmount commands are usable within the core as just another feature engine. A call to TclZipfsInit\(\) will be inserted into tclBasic.c, immediately after the code to initialize zlib.

A modified shell \(tclkit.exe\) will be generated by Make. This shell will:
* Check if the executable has a zip archive attached. If so, that 
archive shall be mounted as **zipfs:/app**. 
* If _zipfs:/app" is present the interpreter will look for 
**boot/tcl/init.tcl**. If that file is present, the location for **$tcl\_library**
will be set to **zipfs:/app/boot/tcl**.
* If _zipfs:/app_ is present the interpreter will look for **boot/tk/tk.tcl'. If present
the location for **$tk\_library** will be set to **zipfs:/boot/tk**. 
* If the file **pkgIndex.tcl** is present, the **$dir** variable will be set to
**zipfs:/app** and the file will be sourced as if it were a package index.
* If the file **main.tcl** is present, the file **zipfs://app/main.tcl** will be registered with **Tcl\_SetStartupScript\(\)**

# Copyright

This document has been placed in the public domain.

Name change from tip/431.tip to tip/431.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

TIP:		431
Title:		Add 'tempdir' Subcommand to 'file'
Version:	$Revision: 1.2 $
Author:		Kevin Pasko <[email protected]>
State:		Draft
Type:		Project
Tcl-Version:	8.6.4
Vote:		Pending
Created:		10-Sep-2014
Keywords:	Tcl, directory, file
Post-History:

~ Abstract

This TIP proposes adding a new '''tempdir''' subcommand to the '''file'''
command, simplifying the effort required in creating uniquely named temporary
directories at the scripting level.

~ Rationale

Due to the non-atomic nature of the '''file mkdir''' command it is currently
impossible to create uniquely named temporary directories at the script level
without the possibility of race conditions.

~ Specification

The '''file tempdir''' command shall implement the functionality of the POSIX
standard mkdtemp() function. With no arguments '''file tempdir''' shall create
a uniquely named temporary directory in the native operating system's
temporary directory, with naming convention "'''tcl_'''''XXXXXX''" where each
''X'' is a randomly selected character (following the '''file tempfile'''
naming convention). Successful completion of '''file tempdir''' shall return
the absolute path of the created directory, otherwise an error shall be
thrown.

'''file tempdir''' shall have an optional argument, ''template'', to modify
the created directory's path and name. The ''template'' shall be decomposed
into (up to) two parts: the directory's path and rootname. If either part is
absent, relevant defaults (e.g., according to the native operating system)
shall be used. The entire temporary name shall then be formed from the path,
the root, and a generated unique string of (typically) six characters.

The command syntax should be defined as:

 > '''file tempdir''' ?''template''?

~ Considerations

 * The subcommand '''tempdir''' could be a candidate, later, for returning the
   native file system's temporary directory. Naming the subcommand something
   else such as '''mktempdir''' is another option, though strays from the '''file
   tempfile''' naming convention.

 * For future extensibility the '''template''' argument to '''file tempdir'''
   (since it is optional) could be specified in the key / value format,
   '''-template''', changing the command syntax to:

 > > '''file tempdir''' ?''options...''?

~ Reference Implementation

An example of temporary directory creation has already been developed into the
Tcl core, at the C level, within the platform specific layers of the
'''load''' command. The principal work remaining is to expose this via a Tcl
command.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|
|

|
|
|

|
|
|
|

|

|

|

|

|
|

|
|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

# TIP 431: Add 'tempdir' Subcommand to 'file'

	Author:		Kevin Pasko <[email protected]>
	State:		Draft
	Type:		Project
	Tcl-Version:	8.6.4
	Vote:		Pending
	Created:		10-Sep-2014
	Keywords:	Tcl, directory, file
	Post-History:
-----

# Abstract

This TIP proposes adding a new **tempdir** subcommand to the **file**
command, simplifying the effort required in creating uniquely named temporary
directories at the scripting level.

# Rationale

Due to the non-atomic nature of the **file mkdir** command it is currently
impossible to create uniquely named temporary directories at the script level
without the possibility of race conditions.

# Specification

The **file tempdir** command shall implement the functionality of the POSIX
standard mkdtemp\(\) function. With no arguments **file tempdir** shall create
a uniquely named temporary directory in the native operating system's
temporary directory, with naming convention "**tcl\_**_XXXXXX_" where each
_X_ is a randomly selected character \(following the **file tempfile**
naming convention\). Successful completion of **file tempdir** shall return
the absolute path of the created directory, otherwise an error shall be
thrown.

**file tempdir** shall have an optional argument, _template_, to modify
the created directory's path and name. The _template_ shall be decomposed
into \(up to\) two parts: the directory's path and rootname. If either part is
absent, relevant defaults \(e.g., according to the native operating system\)
shall be used. The entire temporary name shall then be formed from the path,
the root, and a generated unique string of \(typically\) six characters.

The command syntax should be defined as:

 > **file tempdir** ?_template_?

# Considerations

 * The subcommand **tempdir** could be a candidate, later, for returning the
   native file system's temporary directory. Naming the subcommand something
   else such as **mktempdir** is another option, though strays from the **file
   tempfile** naming convention.

 * For future extensibility the **template** argument to **file tempdir**
   \(since it is optional\) could be specified in the key / value format,
   **-template**, changing the command syntax to:

	 > > **file tempdir** ?_options..._?

# Reference Implementation

An example of temporary directory creation has already been developed into the
Tcl core, at the C level, within the platform specific layers of the
**load** command. The principal work remaining is to expose this via a Tcl
command.

# Copyright

This document has been placed in the public domain.

Name change from tip/432.tip to tip/432.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88

TIP:		432
Title:		Support for New Windows File Dialogs in Vista and Later
Version:	$Revision: 1.4 $
Author:		Ashok P. Nadkarni <[email protected]>
State:		Final
Type:		Project
Vote:		Done
Created:	20-Sep-2014
Post-History:	
Tcl-Version:	8.6.3

~ Abstract

This TIP proposes changing the '''tk_getOpenFile''', '''tk_getSaveFile''' and
'''tk_chooseDirectory''' dialog box commands to display the new style file
dialogs available on newer Windows versions.

~ Rationale

As of Tk 8.6.2, the above commands translate to Windows native file dialogs
corresponding to the ones present in Windows XP (the earliest version of
Windows supported by Tcl 8.6). Vista and later Windows systems have newer
versions of these dialogs with additional features and a different look and
feel. Although the older dialogs are functional on these platforms, they have
the following issues:

 * They do not support the new features, such as breadcrumbs, enhanced
   navigation etc.

 * The look and feel is dated and inconsistent not only with other native
   applications, but even with Tk itself since the Ttk widgets adapt to the
   theme for the platform.

In addition, this TIP proposes some changes to behaviour with respect to
existing dialogs that would make the dialogs more consistent with Windows
conventions.

~ Proposed Changes

The proposal will result in the '''tk_getOpenFile''', '''tk_getSaveFile''' and
'''tk_chooseDirectory''' dialog box commands displaying the new Vista style
file dialogs if available and falling back to the older style otherwise.
Options to the commands and return value from the dialogs remain unchanged
except as noted below.

~~ Incompatible changes

If the '''-initialdir''' option is not specified, the new dialog will default
to the default Windows mechanism for choosing the initial directory displayed.
Documentation will be updated to state that the initial directory displayed
when this option is not present is system dependent.

~ Reference Implementation

A reference implementation is available in the apn-win-filedialogs branch.

The new dialogs require a new COM interface IFileDialog. The reference
implementation uses this interface if available and falls back to the old one
otherwise.

~ Discussion

 * The change in behaviour when '''-initialdir''' is not specified is driven
   by the fact that on Windows the current working directory for a GUI program
   is generally the directory where the program was installed. This is almost
   never useful and is contrary to what the user expects which is the last
   directory shown by the program (even across process invocations).

 * Should there be either a global setting or an option that forces the use of
   old style dialogs. Alternatively, should the new dialogs be only displayed
   if a (new) option is specified with the command.  The author is not in
   favor of either of these but applications that have documented screenshots
   may wish to preserve the old dialogs.  As of now, the reference
   implementation has a hidden option '''-xpstyle''' that can be used to
   select between old and new styles.  This is present mainly to allow
   debugging and testing of the older dialogs on newer platforms.

 * The new implementation calls '''CoInitialize''' to initialize COM. It is
   not clear when, and if, '''CoUnInitialize''' needs to be called. In fact,
   as documented in MSDN, even the '''SHBrowseForFolder''' call used by the
   current 8.6 code requires a prior call to '''CoInitialize''' which Tcl does
   not do.  Need discussion on whether Tcl should always call
   '''CoInitialize''' at thread startup and '''CoUnInitialize''' at thread
   shutdown.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|
|

|

|
|

|

|
|

|

|

|

|

|

|

|

|

|
|
|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88

# TIP 432: Support for New Windows File Dialogs in Vista and Later

	Author:		Ashok P. Nadkarni <[email protected]>
	State:		Final
	Type:		Project
	Vote:		Done
	Created:	20-Sep-2014
	Post-History:	
	Tcl-Version:	8.6.3
-----

# Abstract

This TIP proposes changing the **tk\_getOpenFile**, **tk\_getSaveFile** and
**tk\_chooseDirectory** dialog box commands to display the new style file
dialogs available on newer Windows versions.

# Rationale

As of Tk 8.6.2, the above commands translate to Windows native file dialogs
corresponding to the ones present in Windows XP \(the earliest version of
Windows supported by Tcl 8.6\). Vista and later Windows systems have newer
versions of these dialogs with additional features and a different look and
feel. Although the older dialogs are functional on these platforms, they have
the following issues:

 * They do not support the new features, such as breadcrumbs, enhanced
   navigation etc.

 * The look and feel is dated and inconsistent not only with other native
   applications, but even with Tk itself since the Ttk widgets adapt to the
   theme for the platform.

In addition, this TIP proposes some changes to behaviour with respect to
existing dialogs that would make the dialogs more consistent with Windows
conventions.

# Proposed Changes

The proposal will result in the **tk\_getOpenFile**, **tk\_getSaveFile** and
**tk\_chooseDirectory** dialog box commands displaying the new Vista style
file dialogs if available and falling back to the older style otherwise.
Options to the commands and return value from the dialogs remain unchanged
except as noted below.

## Incompatible changes

If the **-initialdir** option is not specified, the new dialog will default
to the default Windows mechanism for choosing the initial directory displayed.
Documentation will be updated to state that the initial directory displayed
when this option is not present is system dependent.

# Reference Implementation

A reference implementation is available in the apn-win-filedialogs branch.

The new dialogs require a new COM interface IFileDialog. The reference
implementation uses this interface if available and falls back to the old one
otherwise.

# Discussion

 * The change in behaviour when **-initialdir** is not specified is driven
   by the fact that on Windows the current working directory for a GUI program
   is generally the directory where the program was installed. This is almost
   never useful and is contrary to what the user expects which is the last
   directory shown by the program \(even across process invocations\).

 * Should there be either a global setting or an option that forces the use of
   old style dialogs. Alternatively, should the new dialogs be only displayed
   if a \(new\) option is specified with the command.  The author is not in
   favor of either of these but applications that have documented screenshots
   may wish to preserve the old dialogs.  As of now, the reference
   implementation has a hidden option **-xpstyle** that can be used to
   select between old and new styles.  This is present mainly to allow
   debugging and testing of the older dialogs on newer platforms.

 * The new implementation calls **CoInitialize** to initialize COM. It is
   not clear when, and if, **CoUnInitialize** needs to be called. In fact,
   as documented in MSDN, even the **SHBrowseForFolder** call used by the
   current 8.6 code requires a prior call to **CoInitialize** which Tcl does
   not do.  Need discussion on whether Tcl should always call
   **CoInitialize** at thread startup and **CoUnInitialize** at thread
   shutdown.

# Copyright

This document has been placed in the public domain.

Name change from tip/433.tip to tip/433.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77

78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103

104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133

TIP:            433
Title:          Add %M binding substitution
Version:        $Revision: 1.7 $
Author:		Joe Mistachkin <[email protected]>
Author:		Brian Griffin <[email protected]>
Author:         Don Porter <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        25-Feb-2015
Post-History:   
Tcl-Version:    8.6.4

~ Abstract

This TIP proposes one new binding substitution, '''%M''', to access
the number of script-based binding patterns matched so far for the event.

~ Background

In a presentation at the 2012 Tcl Conference
(http://www.tclcommunityassociation.org/wub/proceedings/Proceedings-2012/RonWold/Customizable-Keyboard-Shortcuts.pdf), Ron Wold pointed out that
coding global catch-all scripts bound to the same event in many widgets
is complicated because Tk's '''bind''' machinery allows any bind script to
use '''break''' to prevent later bind scripts from evaluating.  Among
a known set of bind scripts, this is a useful technique, but it interferes
with the non-coordinated introduction of additional bind scripts
on the same event.

An alternative strategy is to avoid any bind script using '''break''', but
to give the latter scripts the means to detect when an earlier
script has run so it can defer its own operations.  That motivates
the introduction of a means by which a bind script can discover something
about the history of other bind script evaluations on the same event.

~ Specification

Add to the set of substitutions made in scripts passed to '''bind'''
the new one, '''%M'''.  When the substring '''%M''' appears in a binding
script, it will be replaced with a count of the number of binding
script evaluations that have already been performed in the handling
of the current event.

~ Simple Example

The script...

|   pack [entry .e]
|   bind all <Key> {set z %M}
|   bind Entry <Key> {set y %M}
|   bind .e <Key> {set x %M}
|   event generate .e <Key-a>
|   list $x $y $z

will produce the result:

|   0 1 2

~ Use Case Example

One of the default bind scripts in Tk is

| event add <<NextWindow>> <Tab>
| bind all <<NextWindow>> {tk::TabToWindow [tk_focusNext %W]}

which permits a '''<Tab>''' anywhere in Tk to shift the focus.

Some widgets have their own uses for '''<Tab>''', though, notably

| bind Text <Tab> {
|     if {[%W cget -state] eq "normal"} {
|         tk::TextInsert %W \t
|         focus %W
|         break
|     }
| }

where a text widget in normal state accepts '''Tab'''s as entered
text like any other keypresses.  The '''break''' in this script serves
to prevent the focus shift that would otherwise take place.
Since the same Tk developers coded both bind scripts, the global
knowledge can make the system work as a whole.

However, a third party facility trying to join the party has
difficulty.  Consider a simple key logging facility,

| bind all <Key> {log_key %W %k %s %x %y %X %Y %A}

This will fail to log '''Tab'''s typed in a Text due to the '''break''' noted
above.

With the '''%M''' binding proposed here, an alternative set of
bind scripts is possible.

| bind all <<NextWindow>> {if {%M==0} {tk::TabToWindow [tk_focusNext %W]}}

| bind Text <Tab> {
|     if {[%W cget -state] eq "normal"} {
|         tk::TextInsert %W \t
|         focus %W
|     }
| }

In fact with the revised script bound to '''all''' it may be that
the script bound to '''<Tab>''' is no longer needed at all and the
general default binding

| bind Text <KeyPress> {tk::TextInsert %W %A}

is sufficient.

With this alternative in place the third-party keylogger would work.

~ Compatibility

Any '''%M''' in an existing bind script will now stop reproducing
itself literally, and will result in the new substitution.  This
has the potential to cause trouble with any bind scripts that
themselves make use of '''clock scan''' or '''clock format''' or 
any other command that invites the use of the literal string '''%M'''.
Dealing with such a situation is not difficult, but it is still
a potential incompatibility.

~ Prototype

This feature is already implemented and committed to both the
core-8-5-branch and the trunk, and is poised to be released
as part of Tk 8.5.18 and Tk 8.6.4.  Any objections to that
should be raised in TIP discussion and voting.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|

|

|
|

|

|
|
|
|
|
|

|

|

|
|

|

|

|
|
|
|
|
<
<
|
>
>
|
|

|

|

|

|

|
|
|
|
<
<
|
>
>
|
|

|

|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73

74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99

100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133

# TIP 433: Add %M binding substitution

	Author:		Joe Mistachkin <[email protected]>
	Author:		Brian Griffin <[email protected]>
	Author:         Don Porter <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        25-Feb-2015
	Post-History:   
	Tcl-Version:    8.6.4
-----

# Abstract

This TIP proposes one new binding substitution, **%M**, to access
the number of script-based binding patterns matched so far for the event.

# Background

In a presentation at the 2012 Tcl Conference
\(<http://www.tclcommunityassociation.org/wub/proceedings/Proceedings-2012/RonWold/Customizable-Keyboard-Shortcuts.pdf\),> Ron Wold pointed out that
coding global catch-all scripts bound to the same event in many widgets
is complicated because Tk's **bind** machinery allows any bind script to
use **break** to prevent later bind scripts from evaluating.  Among
a known set of bind scripts, this is a useful technique, but it interferes
with the non-coordinated introduction of additional bind scripts
on the same event.

An alternative strategy is to avoid any bind script using **break**, but
to give the latter scripts the means to detect when an earlier
script has run so it can defer its own operations.  That motivates
the introduction of a means by which a bind script can discover something
about the history of other bind script evaluations on the same event.

# Specification

Add to the set of substitutions made in scripts passed to **bind**
the new one, **%M**.  When the substring **%M** appears in a binding
script, it will be replaced with a count of the number of binding
script evaluations that have already been performed in the handling
of the current event.

# Simple Example

The script...

	   pack [entry .e]
	   bind all <Key> {set z %M}
	   bind Entry <Key> {set y %M}
	   bind .e <Key> {set x %M}
	   event generate .e <Key-a>
	   list $x $y $z

will produce the result:

	   0 1 2

# Use Case Example

One of the default bind scripts in Tk is

	 event add <<NextWindow>> <Tab>
	 bind all <<NextWindow>> {tk::TabToWindow [tk_focusNext %W]}

which permits a **<Tab>** anywhere in Tk to shift the focus.

Some widgets have their own uses for **<Tab>**, though, notably

	 bind Text <Tab> {
	     if {[%W cget -state] eq "normal"} {
	         tk::TextInsert %W \t
	         focus %W
	         break

	     }
	 }

where a text widget in normal state accepts **Tab**s as entered
text like any other keypresses.  The **break** in this script serves
to prevent the focus shift that would otherwise take place.
Since the same Tk developers coded both bind scripts, the global
knowledge can make the system work as a whole.

However, a third party facility trying to join the party has
difficulty.  Consider a simple key logging facility,

	 bind all <Key> {log_key %W %k %s %x %y %X %Y %A}

This will fail to log **Tab**s typed in a Text due to the **break** noted
above.

With the **%M** binding proposed here, an alternative set of
bind scripts is possible.

	 bind all <<NextWindow>> {if {%M==0} {tk::TabToWindow [tk_focusNext %W]}}

	 bind Text <Tab> {
	     if {[%W cget -state] eq "normal"} {
	         tk::TextInsert %W \t
	         focus %W

	     }
	 }

In fact with the revised script bound to **all** it may be that
the script bound to **<Tab>** is no longer needed at all and the
general default binding

	 bind Text <KeyPress> {tk::TextInsert %W %A}

is sufficient.

With this alternative in place the third-party keylogger would work.

# Compatibility

Any **%M** in an existing bind script will now stop reproducing
itself literally, and will result in the new substitution.  This
has the potential to cause trouble with any bind scripts that
themselves make use of **clock scan** or **clock format** or 
any other command that invites the use of the literal string **%M**.
Dealing with such a situation is not difficult, but it is still
a potential incompatibility.

# Prototype

This feature is already implemented and committed to both the
core-8-5-branch and the trunk, and is poised to be released
as part of Tk 8.5.18 and Tk 8.6.4.  Any objections to that
should be raised in TIP discussion and voting.

# Copyright

This document has been placed in the public domain.

Name change from tip/434.tip to tip/434.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

TIP:		434
Title: 		Specify Event Sources for 'vwait'
Version: 	$Revision: 1.1 $
Author:		Jos Decoster <[email protected]>
State:		Draft
Type:		Project
Tcl-Version:	8.6
Vote:	Pending
Created:	26-Feb-2015
Post-History:	

~ Abstract

This TIP proposes to extend the '''vwait''' Tcl command so the event sources can
be specified, as is possible with the '''Tcl_DoOneEvent''' C command.

~ Rationale

In some situations it can be required not to wait for specific event sources or
to wait for specific events sources only. You might want the program to only
react on timer events, and not on file or window events. You can write your own
version of the '''Tcl_VwaitObjCmd''' command in C, and call '''Tcl_DoOneEvent'''
with the flags you need. Making it possible to specify the event sources,
i.e. the arguments for the call to '''Tcl_DoOneEvent''' within
'''Tcl_VwaitObjCmd''', from the Tcl '''vwait''' command would make this
functionality available from the Tcl lebvel.

~ Specification

This document proposes to add optional arguments to the '''vwait''' command. If
these arguments are not specified, the current event source 
'''TCL_ALL_EVENTS''' will be used. If the optinal arguments are specified, they
are the event sources to be passed to '''Tcl_DoOneEvent''' within
'''Tcl_VwaitObjCmd'''. The flags set with the optinal arguments will be
or-ed. Possible flags are corresponding to the flags for the
'''Tcl_DoOneEvent''' command:

   * '''-all''' (default) - process all events

   * '''-file'''          - process file events

   * '''-idle'''          - process idle events

   * '''-timer'''         - process timer events

   * '''-window'''        - process window system events

Example: wait until variable '''a''' is written and only allow timer events to
be processed:

| vwait a -timer

~ Alternatives

A possible alternative is to add support for a '''-events <event_list>'''
argument.

A '''-dont_wait''' argument is not added, a call to '''update''' will have the
same effect.

~ Compatibility

No incompatibilities are introduced.

~ Reference Implementation

A reference implementation is available.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|
|

|

|

|
|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

# TIP 434: Specify Event Sources for 'vwait'

	Author:		Jos Decoster <[email protected]>
	State:		Draft
	Type:		Project
	Tcl-Version:	8.6
	Vote:	Pending
	Created:	26-Feb-2015
	Post-History:	
-----

# Abstract

This TIP proposes to extend the **vwait** Tcl command so the event sources can
be specified, as is possible with the **Tcl\_DoOneEvent** C command.

# Rationale

In some situations it can be required not to wait for specific event sources or
to wait for specific events sources only. You might want the program to only
react on timer events, and not on file or window events. You can write your own
version of the **Tcl\_VwaitObjCmd** command in C, and call **Tcl\_DoOneEvent**
with the flags you need. Making it possible to specify the event sources,
i.e. the arguments for the call to **Tcl\_DoOneEvent** within
**Tcl\_VwaitObjCmd**, from the Tcl **vwait** command would make this
functionality available from the Tcl lebvel.

# Specification

This document proposes to add optional arguments to the **vwait** command. If
these arguments are not specified, the current event source 
**TCL\_ALL\_EVENTS** will be used. If the optinal arguments are specified, they
are the event sources to be passed to **Tcl\_DoOneEvent** within
**Tcl\_VwaitObjCmd**. The flags set with the optinal arguments will be
or-ed. Possible flags are corresponding to the flags for the
**Tcl\_DoOneEvent** command:

   * **-all** \(default\) - process all events

   * **-file**          - process file events

   * **-idle**          - process idle events

   * **-timer**         - process timer events

   * **-window**        - process window system events

Example: wait until variable **a** is written and only allow timer events to
be processed:

	 vwait a -timer

# Alternatives

A possible alternative is to add support for a **-events <event\_list>**
argument.

A **-dont\_wait** argument is not added, a call to **update** will have the
same effect.

# Compatibility

No incompatibilities are introduced.

# Reference Implementation

A reference implementation is available.

# Copyright

This document has been placed in the public domain.

Name change from tip/435.tip to tip/435.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57

TIP:		435
Title: 		Safe Mutex Disposal API
Version: 	$Revision: 1.7 $
Author:		Donal Fellows <[email protected]>
Author:		Joe Mistachkin <[email protected]>
State:		Rejected
Type:		Project
Tcl-Version:	8.6.5
Vote:		Done
Created:	16-May-2015
Post-History:	

~ Abstract

This TIP proposes a new C API for improving mutex deletion.

~ Rationale

Context: Bug #57945b574a

There is a race condition in the code that disposes of mutexes, in that a
mutex must only be disposed of when it is not in use by another thread, yet
the disposal code does not lock it. This would not be a particular problem as
there is a ''global'' lock that protects the disposal code, except that during
the cleanup immediately after a fork (during the '''exec''' command, for
example) things can get deeply confused, and trigger deadlocks under heavy
load. We need to be careful and make sure that we really hold the global lock
when unlocking and disposing mutexes.

Because the pipeline-opening code isn't the only thing that might ever fork
internally, we should provide the capability to do this correctly as part of
Tcl's public API.

~ Specification

This TIP specifies a single new function:

 > void	'''Tcl_MutexUnlockAndFinalize'''(Tcl_Mutex *''mutex'');

The '''Tcl_MutexUnlockAndFinalize''' function (which takes a single argument,
the mutex to operate on) will atomically unlock the mutex and dispose of it
without giving an opportunity for another thread to lock the mutex between
unlocking and disposal.  The mutex must have previously been locked by
'''Tcl_MutexLock'''.

~ Implementation

See branch bug-57945b574a.

~ Acknowlegement

Thanks to Gustaf Neumann for his trouble tracking this down, and apologies for
the problems the fault has caused him.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|
|
|

|

|

|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57

# TIP 435: Safe Mutex Disposal API

	Author:		Donal Fellows <[email protected]>
	Author:		Joe Mistachkin <[email protected]>
	State:		Rejected
	Type:		Project
	Tcl-Version:	8.6.5
	Vote:		Done
	Created:	16-May-2015
	Post-History:	
-----

# Abstract

This TIP proposes a new C API for improving mutex deletion.

# Rationale

Context: Bug \#57945b574a

There is a race condition in the code that disposes of mutexes, in that a
mutex must only be disposed of when it is not in use by another thread, yet
the disposal code does not lock it. This would not be a particular problem as
there is a _global_ lock that protects the disposal code, except that during
the cleanup immediately after a fork \(during the **exec** command, for
example\) things can get deeply confused, and trigger deadlocks under heavy
load. We need to be careful and make sure that we really hold the global lock
when unlocking and disposing mutexes.

Because the pipeline-opening code isn't the only thing that might ever fork
internally, we should provide the capability to do this correctly as part of
Tcl's public API.

# Specification

This TIP specifies a single new function:

 > void	**Tcl\_MutexUnlockAndFinalize**\(Tcl\_Mutex \*_mutex_\);

The **Tcl\_MutexUnlockAndFinalize** function \(which takes a single argument,
the mutex to operate on\) will atomically unlock the mutex and dispose of it
without giving an opportunity for another thread to lock the mutex between
unlocking and disposal.  The mutex must have previously been locked by
**Tcl\_MutexLock**.

# Implementation

See branch bug-57945b574a.

# Acknowlegement

Thanks to Gustaf Neumann for his trouble tracking this down, and apologies for
the problems the fault has caused him.

# Copyright

This document has been placed in the public domain.

Name change from tip/436.tip to tip/436.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56

TIP:		436
Title:		Improve TclOO isa Introspection
State:		Final
Type:		Project
Tcl-Version:	8.6.5
Vote:		Done
Post-History:	
Version:	$Revision: 1.3 $
Author:		Donal Fellows <[email protected]>
Created:	30-Jun-2015

~ Abstract

The various '''info object isa''' introspectors should not produce errors when
given a non-object; the set membership tests should simply return boolean
false in those cases.

~ Rationale

The '''info object isa''' command is intended to be used to allow asking
whether some object is a member of a general set of entities; for example,
'''info object isa object''' allows querying whether you actually have a
handle to an object at all. However, the other membership sets all throw an
error when given a non-object. This complicates the use of the API when all
that is really needed is to return a '''false''' value in those cases.

Motivating example (with thanks to Will Duquette): is the '''proc''' a class?
No. It's not even an object, so it clearly cannot be a class and so the
following command should produce false (or 0) and not an error:

| info object isa class proc

~ Proposed Change

Where one of the '''info object isa''' introspectors:

 * '''info object isa''' ''class object''

 * '''info object isa metaclass''' ''object''

 * '''info object isa mixin''' ''object class''

 * '''info object isa object''' ''object''

 * '''info object isa typeof''' ''object class''

Would produce an error due to either the ''object'' or (where appropriate) the
''class'' object not passing some critical precondition to the test, the
result of the command will be '''0''' (i.e., boolean false). Errors will be
still generated when the wrong number of arguments are supplied.

Note that this rule is already followed by '''info object isa object'''.

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
>

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|
|

|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56

# TIP 436: Improve TclOO isa Introspection
	State:		Final
	Type:		Project
	Tcl-Version:	8.6.5
	Vote:		Done
	Post-History:	

	Author:		Donal Fellows <[email protected]>
	Created:	30-Jun-2015
-----

# Abstract

The various **info object isa** introspectors should not produce errors when
given a non-object; the set membership tests should simply return boolean
false in those cases.

# Rationale

The **info object isa** command is intended to be used to allow asking
whether some object is a member of a general set of entities; for example,
**info object isa object** allows querying whether you actually have a
handle to an object at all. However, the other membership sets all throw an
error when given a non-object. This complicates the use of the API when all
that is really needed is to return a **false** value in those cases.

Motivating example \(with thanks to Will Duquette\): is the **proc** a class?
No. It's not even an object, so it clearly cannot be a class and so the
following command should produce false \(or 0\) and not an error:

	 info object isa class proc

# Proposed Change

Where one of the **info object isa** introspectors:

 * **info object isa** _class object_

 * **info object isa metaclass** _object_

 * **info object isa mixin** _object class_

 * **info object isa object** _object_

 * **info object isa typeof** _object class_

Would produce an error due to either the _object_ or \(where appropriate\) the
_class_ object not passing some critical precondition to the test, the
result of the command will be **0** \(i.e., boolean false\). Errors will be
still generated when the wrong number of arguments are supplied.

Note that this rule is already followed by **info object isa object**.

# Copyright

This document has been placed in the public domain.

Name change from tip/437.tip to tip/437.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48

TIP:		437
Title:		Tk panedwindow options for proxy window
State:		Final
Type:		Project
Tcl-Version:	8.5.18
Vote:		Done
Post-History:	
Version:	$Revision: 1.5 $
Author:		Eric Boudaillier <[email protected]>
Author:		Fran�ois Vogel <[email protected]>
Created:	14-Jul-2015
Keywords:	Tk

~ Abstract

The proxy window (i.e., the moving sash) of the Tk paned window widget is hard
to see in some circumstances.  This TIP adds three options allowing more
control over the display of the proxy so that its visibility can be enhanced
where required.

~ Rationale

As identified in [Bug: 1247115, https://core.tcl.tk/tk/tktview/1247115], a
flat sashrelief is common for '''panedwindow''' widgets, when it separates two
widgets with sunken relief.  For example, the left part can be a tree and the
right part a text widget, both with a white background.  Under Windows, the
paned window has a light grey color, and in this configuration, the proxy
window is not well visible when it is moved over its managed widgets.

~ Proposed Change

It is proposed to add three options to the Tk '''panedwindow''' widget:
'''-proxybackground''', '''-proxyrelief''' and '''-proxyborderwidth'''.

 * '''-proxybackground''' controls the background of the proxy window.  If
   empty (the default), the background is that of the panedwindow widget,
   which is the current behaviour.

 * '''-proxyrelief''' controls the relief of the proxy window.  If empty (the
   default), the relief is that of the panedwindow widget, which is the
   current behaviour.

 * '''-proxyborderwidth''' controls the border width of the proxy window.  The
   default value is 2, which is the current value.

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
|
|
>

|

|

|

|
|

|

|
|

|
|

|
|

|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48

# TIP 437: Tk panedwindow options for proxy window
	State:		Final
	Type:		Project
	Tcl-Version:	8.5.18
	Vote:		Done
	Post-History:	

	Author:		Eric Boudaillier <[email protected]>
	Author:		François Vogel <[email protected]>
	Created:	14-Jul-2015
	Keywords:	Tk
-----

# Abstract

The proxy window \(i.e., the moving sash\) of the Tk paned window widget is hard
to see in some circumstances.  This TIP adds three options allowing more
control over the display of the proxy so that its visibility can be enhanced
where required.

# Rationale

As identified in [Bug: 1247115, <https://core.tcl.tk/tk/tktview/1247115],> a
flat sashrelief is common for **panedwindow** widgets, when it separates two
widgets with sunken relief.  For example, the left part can be a tree and the
right part a text widget, both with a white background.  Under Windows, the
paned window has a light grey color, and in this configuration, the proxy
window is not well visible when it is moved over its managed widgets.

# Proposed Change

It is proposed to add three options to the Tk **panedwindow** widget:
**-proxybackground**, **-proxyrelief** and **-proxyborderwidth**.

 * **-proxybackground** controls the background of the proxy window.  If
   empty \(the default\), the background is that of the panedwindow widget,
   which is the current behaviour.

 * **-proxyrelief** controls the relief of the proxy window.  If empty \(the
   default\), the relief is that of the panedwindow widget, which is the
   current behaviour.

 * **-proxyborderwidth** controls the border width of the proxy window.  The
   default value is 2, which is the current value.

# Copyright

This document has been placed in the public domain.

Name change from tip/438.tip to tip/438.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149

150
151
152
153
154
155
156
157
158
159
160
161
162
163

164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179

180
181
182
183
184
185
186
187

188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253

TIP:            438
Title:          Ensure Line Metrics are Up-to-Date
Version:        $Revision: 1.16 $
Author:         Fran�ois Vogel <[email protected]>
Author:         Jan Nijtmans <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        01-Nov-2015
Post-History:   
Keywords:       Tk,text
Tcl-Version:    8.6.5

~ Abstract

The text widget calculates line metrics asynchronously, for performance
reasons.  Because of this, some commands of the text widget may return wrong
results if the asynchronous calculations are not over.  This TIP is about
providing the user with ways to ensure that line metrics are up-to-date.

~ Rationale

The text widget features asynchronous calculation of the display height of
logical lines. The reasons for this and the details of the implementation are
explained at the beginning of tkTextDisp.c.

This approach has definite advantages among which responsivity of the text
widget is important. Yet, there are drawbacks in the fact the calculation is
asynchronous. Some commands of the text widget may return wrong results if the
asynchronous calculations are not finished at the time these commands are
called. For example this is the case of '''.text count -ypixels''', which was
solved by adding a modifier '''-update''' allowing the user to be sure any
possible out of date line height information is recalculated.

It appears that aside of '''.text count -ypixels''' there are several other
cases where wrong results can be produced by text widget commands. These cases
are illustrated in several bug reports:

 * [http://core.tcl.tk/tk/tktview/1566949]  (.text yview moveto)

 * [http://core.tcl.tk/tk/tktview/e51941c]  (.text yview)

In all these cases, forcing the update by calling '''.text count -update
-ypixels 1.0 end''' before calling '''.text yview''', or '''.text yview
moveto''' solves the issue presented in the ticket. This has however a
performance cost, of course, but the above tickets show that there are cases
where the programmer needs accurate results, be it at the cost of the time
needed to get the line heights calculations up-to-date.

This TIP is about providing the user/programmer with (better) ways to ensure
that line metrics are up-to-date.

Indeed it is not appropriate to let the concerned commands always force update
of the line metrics or wait for the end of the update calculation each time
they are called: performance impact would be way too large.

Also, it has to be noted that the '''update''' command is of no help here since
the line metrics calculation is done within the event loop in a chained
sequence of [after 1] handlers.

~ Proposed Change

It is proposed to add two new commands to the text widget:

 > ''pathName'' '''sync''' ''?-command command?''

 > ''pathName'' '''pendingsync'''

Also a new virtual event '''<<WidgetViewSync>>''' will be added.

Description:

''pathName'' '''sync'''

    Immediately brings the line metrics up-to-date by forcing computation of
    any outdated line pixel heights. Indeed, to maintain a responsive
    user-experience, the text widget caches line heights and re-calculates them
    in the background. The command returns immediately if there is no such
    outdated line heights, otherwise it returns only at the end of the
    computation. The command returns an empty string.

Implementation details: The command executes:

|    TkTextUpdateLineMetrics(textPtr, 1,
|	      TkBTreeNumLines(textPtr->sharedTextPtr->tree, textPtr), -1);

''pathName'' '''sync''' '''-command''' ''command''

    Schedule ''command'' to be executed exactly once as soon as all line
    calculations are up-to-date. If there are no pending line metrics
    calculations, the scheduling is immediate. The command returns the empty
    string. '''bgerror''' is called on ''command'' failure.

''pathName'' '''pendingsync'''

    Returns 1 if the line calculations are not up-to-date, 0 otherwise.

'''<<WidgetViewSync>>'''

    A widget can have a period of time during which the internal data model is
    not in sync with the view. The '''sync''' method forces the view to be in
    sync with the data. The '''<<WidgetViewSync>>''' virtual event fires when
    the internal data model starts to be out of sync with the widget view, and
    also when it becomes again in sync with the widget view. For the text
    widget, it fires when line metrics become outdated, and when they are
    up-to-date again. Note that this means it fires in particular when
    ''pathName'' '''sync''' returns (if there was pending updates). The detail
    field (%d substitution) is either true (when the widget is in sync) or
    false (when it is not).

All '''sync''', '''pendingsync''' and '''<<WidgetViewSync>>''' apply to
each text widget independently of its peers.

The names '''sync''', '''pendingsync''' and '''<<WidgetViewSync>>''' are chosen
because of the potential for generalization to other widgets they have.

The text widget documentation will be augmented by a short section describing
the asynchronous update of line metrics, the reasons for that background
update, the drawbacks regarding possibly wrong results in '''.text yview''' or
'''.text yview moveto''', and the way to solve these issues by using the new
commands. Example code as below will be provided in the documentation, since
this code will not be included in the library (i.e. in ''text.tcl'')).

The existing '''-update''' modifier switch of '''.text count''' will become
obsolete. It will be declared as deprecated in the text widget documentation
page while being still supported for backwards compatibility reasons.

Using the new commands, ways to ensure accurate results in '''.text yview''',
or '''.text yview moveto''' are as in the following example:

|    ## Example 1:
|
|    # runtime, immediately complete line metrics at any cost (GUI unresponsive)
|    $w sync
|    $w yview moveto $fraction
|
|    ## Example 2:
|
|    # runtime, synchronously wait for up-to-date line metrics (GUI responsive)
|    $w sync -command [list $w yview moveto $fraction]
|
|    ## Example 3:
|
|    # init
|    set yud($w) 0
|    proc updateaction w {
|        set ::yud($w) 1
|        # any other update action here...
|    }

|
|    # runtime, synchronously wait for up-to-date line metrics (GUI responsive)
|    $w sync -command [list updateaction $w]
|    vwait yud($w)
|    $w yview moveto $fraction
|
|    ## Example 4:
|
|    # init
|    set todo($w) {}
|    proc updateaction w {
|        foreach cmd $::todo($w) {uplevel #0 $cmd}
|        set todo($w) {}
|    }

|
|    # runtime
|    lappend todo($w) [list $w yview moveto $fraction]
|    $w sync -command [list updateaction $w]
|
|    ## Example 5:
|
|    # init
|    set todo($w) {}
|
|    bind $w <<WidgetViewSync>> {
|        if {%d} {
|            foreach cmd $todo(%W) {eval $cmd}
|            set todo(%W) {}
|        }
|    }

|
|    # runtime
|    if {![$w pendingsync]} {
|        $w yview moveto $fraction
|    } else {
|        lappend todo($w) [list $w yview moveto $fraction]
|    }

~ Rejected alternatives

 * Use a script-visible array variable such as '''::tk::metricsDone($w)'''
   instead of an event.

 * Don't change the source code and better document the '''.text count -update
   -ypixels''' trick. This is believed to be suboptimal considering that
   '''.text count''' indeed performs counting (which has a cost). This
   performance drawback could however be very much alleviated by counting
   between the two same indices: there would be no cost at all if this case
   was detected and was a short-cut in function TextWidgetObjCmd.

 * Instead of a new text widget sub-command, follow the lines of the existing
   example of '''text count''' and provide a new modifier switch '''-update'''
   to all sub-commands that may need it. The list of such sub-commands include
   '''text yview''', '''text yview moveto''', and '''text yview scroll'''.

 * '''update idletasks''' could force line metrics calculation update (in
   addition to what this command already does). This is certainly not the
   right thing to do since it is not very flexible. It would impact the
   performance of all text widgets whereas perhaps only one of them needs
   up-to-date line heights. Also, one could want to update idletasks (in the
   current sense: idle tasks) but not the line heights calculation, or the
   opposite. All in all, linking the event loop and the line heights
   calculation seems bad.

 * For each sub-command that needs up-to-date line heights to provide fully
   correct results, detect whether it is the case or not at the time they are
   called. If so, fine. If not, there could be two ways forward:

 > 1. Force the update. This is not believed to be desirable, again for
   performance reasons. While there are cases where accurate results are
   mandatory (see the tickets above), most of the time one can live with
   approximate results. Any mismatch is temporary, since the asynchronous line
   height calculations will always catch up eventually. It is preferred to let
   the programmer decide if this update is needed or not.

 > 2. Decide that the line height of not yet up-to-date lines is equal to some
   reasonable value, for instance the height of the first displayed line
   (which is likely up-to-date). For text widgets using only a single font,
   this would be OK since all line heights are then the same. However this
   would not solve all cases, for instance in '''text yview''' where the total
   number of pixels used by the text widget contents is needed, because this
   total pixel height calculation involves the total number of display (not
   logical) lines. Assessing the total number of display lines has a
   performance cost similar to proper line heights calculation, which voids
   that path.

 * It has been proposed that the detail field %d for the
   '''<<WidgetViewSync>>''' event contain the number of outdated lines, while
   this event would fire at each [after 1] partial update of the line metrics.
   This was rejected since no use case of this value could be exhibited, and it
   was believed that firing the event twice (when out of sync and when again in
   sync) was sufficient.

 * It has been proposed that the '''text pendingsync''' command return the
   number of currently outdated lines. This was rejected because no use case
   could be found, and because this TIP aims at generalization and it might be
   hard to define the equivalent of "number of lines to do" for other widgets.
   Anyway, using a boolean now (noted as "1" and "0", rather than "true" and
   "false") leaves room to change our minds later with minimal incompatibility,
   since [if {[.t pendingsync]}] will keep its semantics with an integer.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|
|

|

|

|

|
|
|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|
|

|
|
|

|

|

|
|

|

|

|
|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
<
>
>
|
|
|
|
|
|
<
|
>
|

|

|
|
|

|

|

|
|

|
|

|

|

|

|

|

|
|

|

|
|

|

|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147

148
149
150
151
152
153
154
155
156
157
158
159
160
161

162
163
164
165
166
167
168
169
170
171
172
173
174
175
176

177
178
179
180
181
182
183
184

185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253

# TIP 438: Ensure Line Metrics are Up-to-Date

	Author:         François Vogel <[email protected]>
	Author:         Jan Nijtmans <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        01-Nov-2015
	Post-History:   
	Keywords:       Tk,text
	Tcl-Version:    8.6.5
-----

# Abstract

The text widget calculates line metrics asynchronously, for performance
reasons.  Because of this, some commands of the text widget may return wrong
results if the asynchronous calculations are not over.  This TIP is about
providing the user with ways to ensure that line metrics are up-to-date.

# Rationale

The text widget features asynchronous calculation of the display height of
logical lines. The reasons for this and the details of the implementation are
explained at the beginning of tkTextDisp.c.

This approach has definite advantages among which responsivity of the text
widget is important. Yet, there are drawbacks in the fact the calculation is
asynchronous. Some commands of the text widget may return wrong results if the
asynchronous calculations are not finished at the time these commands are
called. For example this is the case of **.text count -ypixels**, which was
solved by adding a modifier **-update** allowing the user to be sure any
possible out of date line height information is recalculated.

It appears that aside of **.text count -ypixels** there are several other
cases where wrong results can be produced by text widget commands. These cases
are illustrated in several bug reports:

 * <http://core.tcl.tk/tk/tktview/1566949>   \(.text yview moveto\)

 * <http://core.tcl.tk/tk/tktview/e51941c>   \(.text yview\)

In all these cases, forcing the update by calling **.text count -update
-ypixels 1.0 end** before calling **.text yview**, or **.text yview
moveto** solves the issue presented in the ticket. This has however a
performance cost, of course, but the above tickets show that there are cases
where the programmer needs accurate results, be it at the cost of the time
needed to get the line heights calculations up-to-date.

This TIP is about providing the user/programmer with \(better\) ways to ensure
that line metrics are up-to-date.

Indeed it is not appropriate to let the concerned commands always force update
of the line metrics or wait for the end of the update calculation each time
they are called: performance impact would be way too large.

Also, it has to be noted that the **update** command is of no help here since
the line metrics calculation is done within the event loop in a chained
sequence of [after 1] handlers.

# Proposed Change

It is proposed to add two new commands to the text widget:

 > _pathName_ **sync** _?-command command?_

 > _pathName_ **pendingsync**

Also a new virtual event **<<WidgetViewSync>>** will be added.

Description:

_pathName_ **sync**

    Immediately brings the line metrics up-to-date by forcing computation of
    any outdated line pixel heights. Indeed, to maintain a responsive
    user-experience, the text widget caches line heights and re-calculates them
    in the background. The command returns immediately if there is no such
    outdated line heights, otherwise it returns only at the end of the
    computation. The command returns an empty string.

Implementation details: The command executes:

	    TkTextUpdateLineMetrics(textPtr, 1,
		      TkBTreeNumLines(textPtr->sharedTextPtr->tree, textPtr), -1);

_pathName_ **sync** **-command** _command_

    Schedule _command_ to be executed exactly once as soon as all line
    calculations are up-to-date. If there are no pending line metrics
    calculations, the scheduling is immediate. The command returns the empty
    string. **bgerror** is called on _command_ failure.

_pathName_ **pendingsync**

    Returns 1 if the line calculations are not up-to-date, 0 otherwise.

**<<WidgetViewSync>>**

    A widget can have a period of time during which the internal data model is
    not in sync with the view. The **sync** method forces the view to be in
    sync with the data. The **<<WidgetViewSync>>** virtual event fires when
    the internal data model starts to be out of sync with the widget view, and
    also when it becomes again in sync with the widget view. For the text
    widget, it fires when line metrics become outdated, and when they are
    up-to-date again. Note that this means it fires in particular when
    _pathName_ **sync** returns \(if there was pending updates\). The detail
    field \(%d substitution\) is either true \(when the widget is in sync\) or
    false \(when it is not\).

All **sync**, **pendingsync** and **<<WidgetViewSync>>** apply to
each text widget independently of its peers.

The names **sync**, **pendingsync** and **<<WidgetViewSync>>** are chosen
because of the potential for generalization to other widgets they have.

The text widget documentation will be augmented by a short section describing
the asynchronous update of line metrics, the reasons for that background
update, the drawbacks regarding possibly wrong results in **.text yview** or
**.text yview moveto**, and the way to solve these issues by using the new
commands. Example code as below will be provided in the documentation, since
this code will not be included in the library \(i.e. in _text.tcl_\)\).

The existing **-update** modifier switch of **.text count** will become
obsolete. It will be declared as deprecated in the text widget documentation
page while being still supported for backwards compatibility reasons.

Using the new commands, ways to ensure accurate results in **.text yview**,
or **.text yview moveto** are as in the following example:

	    ## Example 1:

	    # runtime, immediately complete line metrics at any cost (GUI unresponsive)
	    $w sync
	    $w yview moveto $fraction

	    ## Example 2:

	    # runtime, synchronously wait for up-to-date line metrics (GUI responsive)
	    $w sync -command [list $w yview moveto $fraction]

	    ## Example 3:

	    # init
	    set yud($w) 0
	    proc updateaction w {
	        set ::yud($w) 1
	        # any other update action here...

	    }

	    # runtime, synchronously wait for up-to-date line metrics (GUI responsive)
	    $w sync -command [list updateaction $w]
	    vwait yud($w)
	    $w yview moveto $fraction

	    ## Example 4:

	    # init
	    set todo($w) {}
	    proc updateaction w {
	        foreach cmd $::todo($w) {uplevel #0 $cmd}
	        set todo($w) {}

	    }

	    # runtime
	    lappend todo($w) [list $w yview moveto $fraction]
	    $w sync -command [list updateaction $w]

	    ## Example 5:

	    # init
	    set todo($w) {}

	    bind $w <<WidgetViewSync>> {
	        if {%d} {
	            foreach cmd $todo(%W) {eval $cmd}
	            set todo(%W) {}

	        }
	    }

	    # runtime
	    if {![$w pendingsync]} {
	        $w yview moveto $fraction
	    } else {
	        lappend todo($w) [list $w yview moveto $fraction]

	    }

# Rejected alternatives

 * Use a script-visible array variable such as **::tk::metricsDone\($w\)**
   instead of an event.

 * Don't change the source code and better document the **.text count -update
   -ypixels** trick. This is believed to be suboptimal considering that
   **.text count** indeed performs counting \(which has a cost\). This
   performance drawback could however be very much alleviated by counting
   between the two same indices: there would be no cost at all if this case
   was detected and was a short-cut in function TextWidgetObjCmd.

 * Instead of a new text widget sub-command, follow the lines of the existing
   example of **text count** and provide a new modifier switch **-update**
   to all sub-commands that may need it. The list of such sub-commands include
   **text yview**, **text yview moveto**, and **text yview scroll**.

 * **update idletasks** could force line metrics calculation update \(in
   addition to what this command already does\). This is certainly not the
   right thing to do since it is not very flexible. It would impact the
   performance of all text widgets whereas perhaps only one of them needs
   up-to-date line heights. Also, one could want to update idletasks \(in the
   current sense: idle tasks\) but not the line heights calculation, or the
   opposite. All in all, linking the event loop and the line heights
   calculation seems bad.

 * For each sub-command that needs up-to-date line heights to provide fully
   correct results, detect whether it is the case or not at the time they are
   called. If so, fine. If not, there could be two ways forward:

	 > 1. Force the update. This is not believed to be desirable, again for
   performance reasons. While there are cases where accurate results are
   mandatory \(see the tickets above\), most of the time one can live with
   approximate results. Any mismatch is temporary, since the asynchronous line
   height calculations will always catch up eventually. It is preferred to let
   the programmer decide if this update is needed or not.

	 > 2. Decide that the line height of not yet up-to-date lines is equal to some
   reasonable value, for instance the height of the first displayed line
   \(which is likely up-to-date\). For text widgets using only a single font,
   this would be OK since all line heights are then the same. However this
   would not solve all cases, for instance in **text yview** where the total
   number of pixels used by the text widget contents is needed, because this
   total pixel height calculation involves the total number of display \(not
   logical\) lines. Assessing the total number of display lines has a
   performance cost similar to proper line heights calculation, which voids
   that path.

 * It has been proposed that the detail field %d for the
   **<<WidgetViewSync>>** event contain the number of outdated lines, while
   this event would fire at each [after 1] partial update of the line metrics.
   This was rejected since no use case of this value could be exhibited, and it
   was believed that firing the event twice \(when out of sync and when again in
   sync\) was sufficient.

 * It has been proposed that the **text pendingsync** command return the
   number of currently outdated lines. This was rejected because no use case
   could be found, and because this TIP aims at generalization and it might be
   hard to define the equivalent of "number of lines to do" for other widgets.
   Anyway, using a boolean now \(noted as "1" and "0", rather than "true" and
   "false"\) leaves room to change our minds later with minimal incompatibility,
   since [if {[.t pendingsync]\}] will keep its semantics with an integer.

# Copyright

This document has been placed in the public domain.

Name change from tip/439.tip to tip/439.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161

TIP:            439
Title:          Semantic Versioning
Version:        $Revision: 1.8 $
Author:         Jan Nijtmans <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        08-Dec-2015
Post-History:   
Tcl-Version:    8.7

~ Abstract

The version schema used by Tcl and Tk has the form MAJOR.MINOR.PATCH, which is
the same schema used by "Semantic Versioning" [http://semver.org/]. For alpha
and beta releases the schema is MAJOR.MINORaPATCH resp MAJOR.MINORbPATCH,
which is not following the "Semantic Versioning" rules, but it's close.
This TIP proposes to start using "Semantic Versioning" for Tcl and Tk,
starting with Tcl/Tk 8.7, without making it mandatory for extensions and
Tcl modules: existing extensions and modules written for Tcl/Tk 8.6
or lower must cooperate unmodified with later 8.x versions as well.

~ Rationale

Semantic Versioning is an attempt to assign meaning to a software
version number. It has a very simple rule:

 * Given a version number MAJOR.MINOR.PATCH, increment the:

 > * MAJOR version when you make incompatible API changes,

 > * MINOR version when you add functionality in a backwards-compatible
     manner, and

 > * PATCH version when you make backwards-compatible bug fixes.

 * Additional labels for pre-release and build metadata are available as
   extensions to the MAJOR.MINOR.PATCH format.

As the version number of Tcl has the same form MAJOR.MINOR.PATCH, nothing
needs to be done here: future Tcl releases can be done following the Semantic
Versioning rules. Tcl/Tk alpha/beta releases have the form
MAJOR.MINOR[ab]PATCH, while Semantic Versioning dictates a form siminar to
MAJOR.MINOR.0(-alpha.|-beta.)PATCH: numeric and non-numeric parts must be
preceded by a dash, and separated by additional dots.

So, it is just a small step to adopt the Semantic Versioning idea. This TIP
proposes to do just that, and describes the implications it has on Tcl and
Tk. Alpha releases allow two forms of the version number, the semantic form
MAJOR.MINOR.0-alpha.PATCH or the legacy form MAJOR.MINORaPATCH.  For beta
releases this will be MAJOR.MINOR.0-beta.PATCH resp MAJOR.MINORbPATCH.  In Tcl
9.0, the legacy form might be removed and possibly enhanced to support all
semantic versioning forms, but this is outside the scope of this TIP.

Semantic Versioning will only be adopted for Tcl 8.7 and higher, so Tcl 8.5.x
and 8.6.x will not be affected. This means that it is possible to introduce a
minor new feature in 8.6.6, which would mandate a MINOR increment under the
Semantic Versioning rules. This TIP doesn't apply to Tcl extensions either,
each extension writer is free in whatever version strategy they choose.

~ Proposed Change

This TIP proposes to adopt Semantic Versioning for Tcl and Tk 8.7 and higher.
An exception will be made for Tcl extensions and Tcl modules, each extension
and module author will be free to choose whether or not to adopt Semantic
Versioning. Existing extensions/modules will continue to cooperate unchanged
with future Tcl and Tk 8.x releases.

One of the implications of this change is that there - most likely -
will be future Tcl 8.7/8.8/8.9/8.10 releases. Since all Tcl minor
releases can be installed next to each other, this would be a
maintenance burden since all of those versions need to be maintained
during serveral years in the future. Solution: drop the minor number
from all Tcl and Tk filenames and installation directories. This way,
Tcl 8.7 and 8.8 cannot be installed next to each other any more,
they share the same installation directory if the installation
''prefix'' is the same. Does that matter? No, because you always
can choose a different prefix. Actually, there is no need for 
Tcl 8.7 any more when Tcl 8.8 is available, as they are 100%
upwards compatible: the Semantic Versioning rules assure this.
As soon as Tcl 8.8 is released, no new Tcl 8.7 releases will
come out any more. Any incompatible change will have to wait
for Tcl 9 (or 10 or 11...). Tcl 8.5 and 8.6 releases will
continue to be supported as long as there is sufficient interest,
this TIP doesn't change anything on that.

An important implication of dropping the minor number in
the Tcl installation script directory is that it would
become "<prefix>/lib/tcl8" in stead of "<prefix>/lib/tcl8.7".
This is a problem, because this is the same directory used for
Tcl Modules, which still need to support existing extensions.
This TIP therefore proposes to change TCL_LIBRARY to
'<prefix>/share/tcl8', that won't conflict with anything used thus far.

Since Tcl now starts using TCL_LIBRARY being a subdirectory of
"<prefix>/share", it seems logical to start using this directory
for man-pages as well. Therefore, it is proposed to upgrade
"autoconf" to the latest version (2.69), which brings the
man-page change without further hurdle.

All together, the proposed directory structure (UNIX)

 > ''<prefix>/bin/tclsh8'' '''Tcl executable'''

 > ''<prefix>/lib/libtcl8.so'' '''Tcl shared library'''

 > ''<prefix>/lib/tcl8'' '''Tcl Modules'''

 > ''<prefix>/lib/tcl8/8.6''

 > ''<prefix>/lib/tcl8/8.7''

 > ''<prefix>/lib/tcl8.6'' '''TCL_LIBRARY for Tcl 8.6'''

 > ''<prefix>/share/tcl8'' '''TCL_LIBRARY for Tcl 8.x (x>=7)'''

 > ''<prefix>/share/man'' '''Tcl manual pages'''

Regarding backporting, any change done in Tcl 8.7 could in principle
be backported to Tcl 8.6 and further to 8.5, if desired. This TIP
doesn't put a restriction to that, as Semantic Versioning only starts
with version 8.7. Still it would be desirable for the TCT to describe a
procedure for that, this is outside the scope of this TIP.

Semantic Versioning requires that any API which is removed in Tcl 9.0 must be
made deprecated in Tcl 8.7 (or 8.8). A new macro TCL_DEPRECATED will be
introduced for that in the *Decls.h files. If Tcl 8.7 is compiled with the
flag TCL_NO_DEPRECATED, all deprecated API is removed, by making those entries
MODULE_SCOPE, and putting 0 in the corresponding stub entries.  This can be
used by extensions to see whether they are compatible with the next major Tcl
release or not.

The following API's are declared deprecated in Tcl 8.7 and will be
removed in Tcl 9.0. They are already removed in the "novem" branch:

 * Tcl_Backslash

 * Tcl_EvalFile

 * Tcl_GetDefaultEncodingDir

 * Tcl_SetDefaultEncodingDir

 * Tcl_EvalTokens

 * Tcl_CreateMathFunc

 * Tcl_GetMathFuncInfo

 * Tcl_ListMathFuncs

An implementation of this TIP can be found at [http://core.tcl.tk/tcl]; branch
"semver".

~ Rejected Alternatives

TODO

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|
|
|

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161

# TIP 439: Semantic Versioning

	Author:         Jan Nijtmans <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        08-Dec-2015
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

The version schema used by Tcl and Tk has the form MAJOR.MINOR.PATCH, which is
the same schema used by "Semantic Versioning" <http://semver.org/> . For alpha
and beta releases the schema is MAJOR.MINORaPATCH resp MAJOR.MINORbPATCH,
which is not following the "Semantic Versioning" rules, but it's close.
This TIP proposes to start using "Semantic Versioning" for Tcl and Tk,
starting with Tcl/Tk 8.7, without making it mandatory for extensions and
Tcl modules: existing extensions and modules written for Tcl/Tk 8.6
or lower must cooperate unmodified with later 8.x versions as well.

# Rationale

Semantic Versioning is an attempt to assign meaning to a software
version number. It has a very simple rule:

 * Given a version number MAJOR.MINOR.PATCH, increment the:

	 > \* MAJOR version when you make incompatible API changes,

	 > \* MINOR version when you add functionality in a backwards-compatible
     manner, and

	 > \* PATCH version when you make backwards-compatible bug fixes.

 * Additional labels for pre-release and build metadata are available as
   extensions to the MAJOR.MINOR.PATCH format.

As the version number of Tcl has the same form MAJOR.MINOR.PATCH, nothing
needs to be done here: future Tcl releases can be done following the Semantic
Versioning rules. Tcl/Tk alpha/beta releases have the form
MAJOR.MINOR[ab]PATCH, while Semantic Versioning dictates a form siminar to
MAJOR.MINOR.0\(-alpha.\|-beta.\)PATCH: numeric and non-numeric parts must be
preceded by a dash, and separated by additional dots.

So, it is just a small step to adopt the Semantic Versioning idea. This TIP
proposes to do just that, and describes the implications it has on Tcl and
Tk. Alpha releases allow two forms of the version number, the semantic form
MAJOR.MINOR.0-alpha.PATCH or the legacy form MAJOR.MINORaPATCH.  For beta
releases this will be MAJOR.MINOR.0-beta.PATCH resp MAJOR.MINORbPATCH.  In Tcl
9.0, the legacy form might be removed and possibly enhanced to support all
semantic versioning forms, but this is outside the scope of this TIP.

Semantic Versioning will only be adopted for Tcl 8.7 and higher, so Tcl 8.5.x
and 8.6.x will not be affected. This means that it is possible to introduce a
minor new feature in 8.6.6, which would mandate a MINOR increment under the
Semantic Versioning rules. This TIP doesn't apply to Tcl extensions either,
each extension writer is free in whatever version strategy they choose.

# Proposed Change

This TIP proposes to adopt Semantic Versioning for Tcl and Tk 8.7 and higher.
An exception will be made for Tcl extensions and Tcl modules, each extension
and module author will be free to choose whether or not to adopt Semantic
Versioning. Existing extensions/modules will continue to cooperate unchanged
with future Tcl and Tk 8.x releases.

One of the implications of this change is that there - most likely -
will be future Tcl 8.7/8.8/8.9/8.10 releases. Since all Tcl minor
releases can be installed next to each other, this would be a
maintenance burden since all of those versions need to be maintained
during serveral years in the future. Solution: drop the minor number
from all Tcl and Tk filenames and installation directories. This way,
Tcl 8.7 and 8.8 cannot be installed next to each other any more,
they share the same installation directory if the installation
_prefix_ is the same. Does that matter? No, because you always
can choose a different prefix. Actually, there is no need for 
Tcl 8.7 any more when Tcl 8.8 is available, as they are 100%
upwards compatible: the Semantic Versioning rules assure this.
As soon as Tcl 8.8 is released, no new Tcl 8.7 releases will
come out any more. Any incompatible change will have to wait
for Tcl 9 \(or 10 or 11...\). Tcl 8.5 and 8.6 releases will
continue to be supported as long as there is sufficient interest,
this TIP doesn't change anything on that.

An important implication of dropping the minor number in
the Tcl installation script directory is that it would
become "<prefix>/lib/tcl8" in stead of "<prefix>/lib/tcl8.7".
This is a problem, because this is the same directory used for
Tcl Modules, which still need to support existing extensions.
This TIP therefore proposes to change TCL\_LIBRARY to
'<prefix>/share/tcl8', that won't conflict with anything used thus far.

Since Tcl now starts using TCL\_LIBRARY being a subdirectory of
"<prefix>/share", it seems logical to start using this directory
for man-pages as well. Therefore, it is proposed to upgrade
"autoconf" to the latest version \(2.69\), which brings the
man-page change without further hurdle.

All together, the proposed directory structure \(UNIX\)

 > _<prefix>/bin/tclsh8_ **Tcl executable**

 > _<prefix>/lib/libtcl8.so_ **Tcl shared library**

 > _<prefix>/lib/tcl8_ **Tcl Modules**

 > _<prefix>/lib/tcl8/8.6_

 > _<prefix>/lib/tcl8/8.7_

 > _<prefix>/lib/tcl8.6_ **TCL\_LIBRARY for Tcl 8.6**

 > _<prefix>/share/tcl8_ **TCL\_LIBRARY for Tcl 8.x \(x>=7\)**

 > _<prefix>/share/man_ **Tcl manual pages**

Regarding backporting, any change done in Tcl 8.7 could in principle
be backported to Tcl 8.6 and further to 8.5, if desired. This TIP
doesn't put a restriction to that, as Semantic Versioning only starts
with version 8.7. Still it would be desirable for the TCT to describe a
procedure for that, this is outside the scope of this TIP.

Semantic Versioning requires that any API which is removed in Tcl 9.0 must be
made deprecated in Tcl 8.7 \(or 8.8\). A new macro TCL\_DEPRECATED will be
introduced for that in the \*Decls.h files. If Tcl 8.7 is compiled with the
flag TCL\_NO\_DEPRECATED, all deprecated API is removed, by making those entries
MODULE\_SCOPE, and putting 0 in the corresponding stub entries.  This can be
used by extensions to see whether they are compatible with the next major Tcl
release or not.

The following API's are declared deprecated in Tcl 8.7 and will be
removed in Tcl 9.0. They are already removed in the "novem" branch:

 * Tcl\_Backslash

 * Tcl\_EvalFile

 * Tcl\_GetDefaultEncodingDir

 * Tcl\_SetDefaultEncodingDir

 * Tcl\_EvalTokens

 * Tcl\_CreateMathFunc

 * Tcl\_GetMathFuncInfo

 * Tcl\_ListMathFuncs

An implementation of this TIP can be found at <http://core.tcl.tk/tcl> ; branch
"semver".

# Rejected Alternatives

TODO

# Copyright

This document has been placed in the public domain.

Name change from tip/44.tip to tip/44.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41

42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115

116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148

149
150
151

152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184

TIP:            44
Title:          Move Tk's Private Commands and Variables into ::tk Namespace
Version:        $Revision: 1.9 $
Author:         Don Porter <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        16-Jul-2001
Post-History:   
Tcl-Version:    8.4

~ Abstract

This TIP proposes that Tk's private commands and variables be moved
into the namespace ''::tk'' or its descendent namespace(s).

~ Background

Tk defines several commands and variables in the global namespace
that are not intended for public use.  Some of the commands are
used for widget bindings.  Some of the variables are used to maintain
widget state information.  The definition of these "private" commands
and variables in the global namespace is a legacy held over from Tk 4
and the pre-8 versions of Tcl in which there was only one namespace.

Fortunately, the coders of Tk have maintained good discipline in
naming these commands and variables intended for Tk's internal
use only.  The commands and variables matching the glob pattern
''::tk[[A-Z]]*'' are private.  Consider this interactive tktest session
with Tk 8.3.3:

| $ cd tk/unix
| $ make runtest
| ...
| % # Put Tk through its paces to define all commands and variables
| % source ./../tests/all.tcl
| ...
| % llength [info commands {tk[A-Z]*}]
| 183
| % llength [info vars {tk[A-Z]*}]
| 5

So, on Unix, there are 183 private commands and 5 private variables
polluting the global namespace.  The number and list of commands
and variables varies a bit from platform to platform due to
differences in widget bindings.

More recently, private commands in Tk have been added in the ''::tk''
namespace; two examples are ''tk::PlaceWindow'' and ''tk::SetFocusGrab''.
Likewise the private variable ''tk::FocusGrab'' has also been added
in the ''::tk'' namespace.

There are three reasons why it is better for Tk's private commands
and variables to be moved out of the global namespace and into the
''::tk'' namespace.

 1. The large number of commands and variable makes it more difficult
    to use interactive ''[[info commands]]'' and ''[[info vars]]'' or
    ''[[info globals]]'' introspection to learn about what application
    specific commands and variables are defined.

 2. Placing private commands and variables in the global namespace
    gives them a higher profile, and increases the likelihood that
    they will be used publicly, against the intent of Tk's interface.

 3. By making more use of its own namespace for keeping track of its
    own internals, Tk becomes a better example for authors of other
    packages to copy.

~ Proposal

All commands and variables created by Tk and matching the glob pattern
''::tk[[A-Z]]*'' shall be renamed to a name contained within the
''::tk'' namespace or one of the descendent namespaces of ''::tk''.

The global variable ''::histNum'' created by ''tk/library/console.tcl''
shall also be renamed to ''::tk::HistNum''.

All commands and variables created by the proposal will be given names
that begin with an uppercase character (''[[A-Z]]'') to indicate their
internal status according to the conventions of the Tcl Style Guide
[http://purl.org/tcl/home/doc/styleGuide.pdf].

~ Compatibility and Migration

This proposal only deals with the internals of Tk, so technically there
are no compatibility issues, because Tk users should not be depending
on these private commands and variables.

That said, because these commands and variables have had a high
profile in the global namespace, it seems likely that some users
have written code that depends on them.  To aid such users in a
migration away from that dependence, it is also proposed that
Tk provide two additional unsupported commands:

| ::tk::unsupported::ExposePrivateCommand commandName

and

| ::tk::unsupported::ExposePrivateVariable variableName

The command ''[[::tk::unsupported::ExposePrivateCommand commandName]]''
restores the existence of the Tk private command ''commandName'' in
the global namespace as it was before adoption of this proposal.
The command ''[[::tk::unsupported::ExposePrivateVariable variableName]]''
restores the existence of the Tk private variable ''variableName'' in
the global namespace as it was before adoption of this proposal.
For example, a Tk user who had written code that made use of the Tk
private command ''tkCancelRepeat'' can add the following code to
continue working with Tk after acceptance of this proposal:

| if {![llength [info commands tkCancelRepeat]]} {
|     tk::unsupported::ExposePrivateCommand tkCancelRepeat
| }

These migration commands are in the namespace ''tk::unsupported'',
a new namespace to be used for unsupported commands in Tk.  This
namespace may and should be used for any other unsupported commands
to be created in Tk.  Their implementation is in the new file
''tk/library/unsupported.tcl''.

~ Reference Implementation

This proposal has already been implemented and committed to Tk's
CVS repository on the branch tagged ''dgp-privates-into-namespace''.
That branch is up to date with Tk's HEAD branch as of July 16, 2001.

To make an anonymous checkout of the reference implementation into
a directory named ''tkprivate'', run the following CVS commands:

| $ cvs -d :pserver:[email protected]:/cvsroot/tktoolkit \
|   login
| (Logging in to [email protected])
| CVS password: <Enter>
| $ cvs -z3 -d :pserver:[email protected]:/cvsroot/tktoolkit \
|   co -r dgp-privates-into-namespace -d tkprivate tk

The reference implementation has the same results on the Tk
test suite as the HEAD revision.

In the tktest of the reference implementation:

| $ make runtest
| ...
| % source ./../tests/all.tcl
| ...
| % llength [info commands {tk[A-Z]*}]
| 0

| % llength [info vars {tk[A-Z]*}]
| 0

~ See Also

Feature Request 220936
[http://sf.net/tracker/?func=detail&aid=220936&group_id=12997&atid=362997]

~ Related Ideas / Future Work

The ideas in this section are ''not'' part of this proposal.  They are
related ideas mentioned here as explicitly outside the scope of this
proposal so they will not be counter-proposed.

 * ''Shouldn't Tk's public commands and variables be moved to ::tk too?''

 > Well, yes, I think they should, but that change clearly involves 
   sorting out more difficult compatibility/migration issues.  The
   current proposal is limited to the less controversial topic of
   Tk's private commands and variables.  We'll tackle the rest later.

 * ''Shouldn't Tk make use of [[namespace code]] in its bindings?''

 * ''Wouldn't it make Tk better organized if commands like
   [[tk::IconList_Add]] were further renamed [[tk::IconList::Add]]?''

 > Perhaps so.  There may be many ways in which Tk can and should 
   make better or more idiomatic use of namespaces.  That's not the
   point of this proposal, though.  The point is to get these commands
   and variables out of the global namespace.  Once that is accomplished,
   then these other matters are unquestionably internal and can proceed
   at the discretion of the maintainers without further TIP review.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|
|
|
|
|
|
|
|
|
<
>

|
|
|
|

|

|
|

|

|
|

|
|

|

|

|

|

|

|
|

|
|

|

|
|
<
|
>
|

|

|

|

|

|
|
|
|
|
|

|
|
|
|
|
<
>
|
<
|
>
|

|

|

|

|

|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39

40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112

113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146

147
148

149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184

# TIP 44: Move Tk's Private Commands and Variables into ::tk Namespace

	Author:         Don Porter <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        16-Jul-2001
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

This TIP proposes that Tk's private commands and variables be moved
into the namespace _::tk_ or its descendent namespace\(s\).

# Background

Tk defines several commands and variables in the global namespace
that are not intended for public use.  Some of the commands are
used for widget bindings.  Some of the variables are used to maintain
widget state information.  The definition of these "private" commands
and variables in the global namespace is a legacy held over from Tk 4
and the pre-8 versions of Tcl in which there was only one namespace.

Fortunately, the coders of Tk have maintained good discipline in
naming these commands and variables intended for Tk's internal
use only.  The commands and variables matching the glob pattern
_::tk[A-Z]\*_ are private.  Consider this interactive tktest session
with Tk 8.3.3:

	 $ cd tk/unix
	 $ make runtest
	 ...
	 % # Put Tk through its paces to define all commands and variables
	 % source ./../tests/all.tcl
	 ...
	 % llength [info commands {tk[A-Z]*}]
	 183
	 % llength [info vars {tk[A-Z]*}]

	 5

So, on Unix, there are 183 private commands and 5 private variables
polluting the global namespace.  The number and list of commands
and variables varies a bit from platform to platform due to
differences in widget bindings.

More recently, private commands in Tk have been added in the _::tk_
namespace; two examples are _tk::PlaceWindow_ and _tk::SetFocusGrab_.
Likewise the private variable _tk::FocusGrab_ has also been added
in the _::tk_ namespace.

There are three reasons why it is better for Tk's private commands
and variables to be moved out of the global namespace and into the
_::tk_ namespace.

 1. The large number of commands and variable makes it more difficult
    to use interactive _[info commands]_ and _[info vars]_ or
    _[info globals]_ introspection to learn about what application
    specific commands and variables are defined.

 2. Placing private commands and variables in the global namespace
    gives them a higher profile, and increases the likelihood that
    they will be used publicly, against the intent of Tk's interface.

 3. By making more use of its own namespace for keeping track of its
    own internals, Tk becomes a better example for authors of other
    packages to copy.

# Proposal

All commands and variables created by Tk and matching the glob pattern
_::tk[A-Z]\*_ shall be renamed to a name contained within the
_::tk_ namespace or one of the descendent namespaces of _::tk_.

The global variable _::histNum_ created by _tk/library/console.tcl_
shall also be renamed to _::tk::HistNum_.

All commands and variables created by the proposal will be given names
that begin with an uppercase character \(_[A-Z]_\) to indicate their
internal status according to the conventions of the Tcl Style Guide
<http://purl.org/tcl/home/doc/styleGuide.pdf> .

# Compatibility and Migration

This proposal only deals with the internals of Tk, so technically there
are no compatibility issues, because Tk users should not be depending
on these private commands and variables.

That said, because these commands and variables have had a high
profile in the global namespace, it seems likely that some users
have written code that depends on them.  To aid such users in a
migration away from that dependence, it is also proposed that
Tk provide two additional unsupported commands:

	 ::tk::unsupported::ExposePrivateCommand commandName

and

	 ::tk::unsupported::ExposePrivateVariable variableName

The command _[::tk::unsupported::ExposePrivateCommand commandName]_
restores the existence of the Tk private command _commandName_ in
the global namespace as it was before adoption of this proposal.
The command _[::tk::unsupported::ExposePrivateVariable variableName]_
restores the existence of the Tk private variable _variableName_ in
the global namespace as it was before adoption of this proposal.
For example, a Tk user who had written code that made use of the Tk
private command _tkCancelRepeat_ can add the following code to
continue working with Tk after acceptance of this proposal:

	 if {![llength [info commands tkCancelRepeat]]} {
	     tk::unsupported::ExposePrivateCommand tkCancelRepeat

	 }

These migration commands are in the namespace _tk::unsupported_,
a new namespace to be used for unsupported commands in Tk.  This
namespace may and should be used for any other unsupported commands
to be created in Tk.  Their implementation is in the new file
_tk/library/unsupported.tcl_.

# Reference Implementation

This proposal has already been implemented and committed to Tk's
CVS repository on the branch tagged _dgp-privates-into-namespace_.
That branch is up to date with Tk's HEAD branch as of July 16, 2001.

To make an anonymous checkout of the reference implementation into
a directory named _tkprivate_, run the following CVS commands:

	 $ cvs -d :pserver:[email protected]:/cvsroot/tktoolkit \
	   login
	 (Logging in to [email protected])
	 CVS password: <Enter>
	 $ cvs -z3 -d :pserver:[email protected]:/cvsroot/tktoolkit \
	   co -r dgp-privates-into-namespace -d tkprivate tk

The reference implementation has the same results on the Tk
test suite as the HEAD revision.

In the tktest of the reference implementation:

	 $ make runtest
	 ...
	 % source ./../tests/all.tcl
	 ...
	 % llength [info commands {tk[A-Z]*}]

	 0
	 % llength [info vars {tk[A-Z]*}]

	 0

# See Also

Feature Request 220936
<http://sf.net/tracker/?func=detail&aid=220936&group_id=12997&atid=362997> 

# Related Ideas / Future Work

The ideas in this section are _not_ part of this proposal.  They are
related ideas mentioned here as explicitly outside the scope of this
proposal so they will not be counter-proposed.

 * _Shouldn't Tk's public commands and variables be moved to ::tk too?_

	 > Well, yes, I think they should, but that change clearly involves 
   sorting out more difficult compatibility/migration issues.  The
   current proposal is limited to the less controversial topic of
   Tk's private commands and variables.  We'll tackle the rest later.

 * _Shouldn't Tk make use of [namespace code] in its bindings?_

 * _Wouldn't it make Tk better organized if commands like
   [tk::IconList_Add] were further renamed [tk::IconList::Add]?_

	 > Perhaps so.  There may be many ways in which Tk can and should 
   make better or more idiomatic use of namespaces.  That's not the
   point of this proposal, though.  The point is to get these commands
   and variables out of the global namespace.  Once that is accomplished,
   then these other matters are unquestionably internal and can proceed
   at the discretion of the maintainers without further TIP review.

# Copyright

This document has been placed in the public domain.

Name change from tip/440.tip to tip/440.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43

TIP:            440
Title:          Add engine to tcl_platform Array
Version:        $Revision: 1.13 $
Author:         Joe Mistachkin <[email protected]>
Author:         Jan Nijtmans <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        14-Jan-2016
Post-History:   
Keywords:       language implementation,platform
Tcl-Version:    8.5

~ Abstract

This TIP proposes a mechanism for determining the implementation of the Tcl
language currently in use.

~ Rationale

There is more than one implementation of the Tcl language (see
[http://wiki.tcl.tk/13992]). These implementations differ greatly in their
degree of compatibility and completeness. At the script level, there is
currently no standard way to determine which implementation of the Tcl
language is being used.

~ Specification

The '''engine''' element will be added to the '''tcl_platform''' array. Its
value will be set to '''"Tcl"'''.

~ Reference Implementation

A reference implementation of this TIP is available
[https://core.tcl.tk/tcl/timeline?r=tclPlatformEngine].

The TH1, Jim, Picol, JTcl, and Eagle implementations of the Tcl language already
implement this feature, each using the name of the project as the value of
the '''tcl_platform(engine)''' element.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|
|

|

|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43

# TIP 440: Add engine to tcl_platform Array

	Author:         Joe Mistachkin <[email protected]>
	Author:         Jan Nijtmans <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        14-Jan-2016
	Post-History:   
	Keywords:       language implementation,platform
	Tcl-Version:    8.5
-----

# Abstract

This TIP proposes a mechanism for determining the implementation of the Tcl
language currently in use.

# Rationale

There is more than one implementation of the Tcl language \(see
<http://wiki.tcl.tk/13992> \). These implementations differ greatly in their
degree of compatibility and completeness. At the script level, there is
currently no standard way to determine which implementation of the Tcl
language is being used.

# Specification

The **engine** element will be added to the **tcl\_platform** array. Its
value will be set to **"Tcl"**.

# Reference Implementation

A reference implementation of this TIP is available
<https://core.tcl.tk/tcl/timeline?r=tclPlatformEngine> .

The TH1, Jim, Picol, JTcl, and Eagle implementations of the Tcl language already
implement this feature, each using the name of the project as the value of
the **tcl\_platform\(engine\)** element.

# Copyright

This document has been placed in the public domain.

Name change from tip/441.tip to tip/441.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50

TIP:            441
Title:          Add -justify Configuration Option to the listbox Widget
Version:        $Revision: 1.3 $
Author:         Fran�ois Vogel <[email protected]>
Author:         Fran�ois Vogel <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        18-Jan-2016
Post-History:   
Keywords:       Tk,listbox
Tcl-Version:    8.6.5

~ Abstract

Despite the '''listbox''' widget already having numerous configuration
options, some users need more refinements and have requested the possibility
to control the justification of the text displayed in the items of the
listbox. This TIP proposes to add this option.

~ Rationale

Currently the '''listbox''' widget always aligns its items leftwards. Some
users miss a configuration options allowing to justify items in the
'''listbox''' widget. These RFE include:

  * RFE 454303, [https://core.tcl.tk/tk/tktview/454303]

  * RFE 3f456a5bb9, [https://core.tcl.tk/tk/tktview/3f456a5bb9]

~ Proposed Change

It is proposed to add the '''-justify''' configuration option to the Tk
'''listbox''' widget.

Possible values are as already documented in the '''options''' manual page
(i.e., '''left''', '''center''', or '''right'''), and translate internally
into standard ''Tk_Justify'' values, i.e., TK_JUSTIFY_LEFT, TK_JUSTIFY_CENTER,
and TK_JUSTIFY_RIGHT, respectively.

Default value is '''left''' on all platforms, for backwards compatibility reasons.

~ Reference Implementation

A reference implementation is available in branch tip-441 of the fossil
repository.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|
|

|
|
|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50

# TIP 441: Add -justify Configuration Option to the listbox Widget

	Author:         François Vogel <[email protected]>
	Author:         François Vogel <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        18-Jan-2016
	Post-History:   
	Keywords:       Tk,listbox
	Tcl-Version:    8.6.5
-----

# Abstract

Despite the **listbox** widget already having numerous configuration
options, some users need more refinements and have requested the possibility
to control the justification of the text displayed in the items of the
listbox. This TIP proposes to add this option.

# Rationale

Currently the **listbox** widget always aligns its items leftwards. Some
users miss a configuration options allowing to justify items in the
**listbox** widget. These RFE include:

  * RFE 454303, <https://core.tcl.tk/tk/tktview/454303> 

  * RFE 3f456a5bb9, <https://core.tcl.tk/tk/tktview/3f456a5bb9> 

# Proposed Change

It is proposed to add the **-justify** configuration option to the Tk
**listbox** widget.

Possible values are as already documented in the **options** manual page
\(i.e., **left**, **center**, or **right**\), and translate internally
into standard _Tk\_Justify_ values, i.e., TK\_JUSTIFY\_LEFT, TK\_JUSTIFY\_CENTER,
and TK\_JUSTIFY\_RIGHT, respectively.

Default value is **left** on all platforms, for backwards compatibility reasons.

# Reference Implementation

A reference implementation is available in branch tip-441 of the fossil
repository.

# Copyright

This document has been placed in the public domain.

Name change from tip/442.tip to tip/442.md.

1
2
3
4
5
6
7
8
9
10
11
12
13
14

15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85

86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103

TIP:            442
Title:          Display text in progressbars
Version:        $Revision: 1.11 $
Author:         Ren� Zaumseil <[email protected]>
Author:         Kevin B Kenny <[email protected]>
Author:         Andreas Leitgeb <[email protected]>
Author:         Kevin Kenny <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        17-Feb-2016
Post-History:   
Keywords:       Tk
Tcl-Version:    8.7

~ Abstract

Horizontal progress bars should support the ability to display text inside the progress bar.
Buttons should allow justification of multiline texts.

~ Rationale

It is often useful to be able to display text directly on top of a progress
bar. This text might be a description of the progress percentage, what is
currently being done, or even just a label giving the overall task that is
progressing.

The '''ttk::progressbar''' command can easily enhanced to provide this
support, and there is no interference with existing code as this
functionality
can be done by just introducing new options.  The options required are from
the list of usual Tk well-known option names.

Also the '''ttk::button''' command can easily be enhanced to provide justification of multiline text.

~ Specification

Text will be displayed only on horizontal '''ttk::progressbar'''.
To control the text appearance the following new options will be added:

 -text: The string to display.

 -font: The font used to render the text.

 -foreground: The color of the text.

 -anchor: The anchoring of the text.

 -justify: The justification of the string.

 -wraplength: The length at which the string will be automatically wrapped.

To justify multiline text in '''ttk::button''' a new option will be added:

 -justify: The justification of the string.

~ Notes for future improvements

The underlying Tk text rendering engine supports rotated text, which would
make support on vertical progress bars possible. But control of the rotation
angle might be required (according to whether the text is rotated left or
right, or stays unrotated).

The most contrasting color of the text will depend where on the progress bar
it is placed. This is not an effect that is simply reproduced with the Tk
script level, but is easy to apply during rendering.

~ Implementation

A patch implementing these changes and updating the documentation is available
in the fossil repository in the tip-442 branch.

Implementation is heavily borrowed from the '''ttk::label''' widget featuring
these same options. The names, meanings, and default values of the options are
the same as for '''ttk::label'''. The rendering and processing is the same as
for this latter widget.

~ Example of use

|    package require Tk
|    proc moveit {} {
|      for {set i 0} {$i < 100} {incr i} {
|        .p step ; update ; after 100
|      }
|    }

|    pack [ttk::progressbar .p -value 0 -maximum 50 -orient horizontal -length 500]
|    .p configure -anchor c -foreground blue -justify right \
|            -text "-anchor c -foreground blue -justify right -wraplength 100" \
|            -wraplength 100
|    moveit
|    .p configure -anchor e -font {Arial 10 bold} -foreground green -justify center \
|            -text "-anchor e -font {Arial 10 bold} -foreground green -justify center -wraplength 250" \
|            -wraplength 250
|    moveit
|    .p configure -text "-anchor w -foreground red -justify left -wraplength 50" \
|            -anchor w -foreground red -justify left -wraplength 50
|    moveit
|    .p configure -orient vertical -text "Cannot be seen"
|    moveit

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|
|
|
|
<
<
>
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82

83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103

# TIP 442: Display text in progressbars

	Author:         René Zaumseil <[email protected]>
	Author:         Kevin B Kenny <[email protected]>
	Author:         Andreas Leitgeb <[email protected]>
	Author:         Kevin Kenny <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        17-Feb-2016
	Post-History:   
	Keywords:       Tk
	Tcl-Version:    8.7
-----

# Abstract

Horizontal progress bars should support the ability to display text inside the progress bar.
Buttons should allow justification of multiline texts.

# Rationale

It is often useful to be able to display text directly on top of a progress
bar. This text might be a description of the progress percentage, what is
currently being done, or even just a label giving the overall task that is
progressing.

The **ttk::progressbar** command can easily enhanced to provide this
support, and there is no interference with existing code as this
functionality
can be done by just introducing new options.  The options required are from
the list of usual Tk well-known option names.

Also the **ttk::button** command can easily be enhanced to provide justification of multiline text.

# Specification

Text will be displayed only on horizontal **ttk::progressbar**.
To control the text appearance the following new options will be added:

 -text: The string to display.

 -font: The font used to render the text.

 -foreground: The color of the text.

 -anchor: The anchoring of the text.

 -justify: The justification of the string.

 -wraplength: The length at which the string will be automatically wrapped.

To justify multiline text in **ttk::button** a new option will be added:

 -justify: The justification of the string.

# Notes for future improvements

The underlying Tk text rendering engine supports rotated text, which would
make support on vertical progress bars possible. But control of the rotation
angle might be required \(according to whether the text is rotated left or
right, or stays unrotated\).

The most contrasting color of the text will depend where on the progress bar
it is placed. This is not an effect that is simply reproduced with the Tk
script level, but is easy to apply during rendering.

# Implementation

A patch implementing these changes and updating the documentation is available
in the fossil repository in the tip-442 branch.

Implementation is heavily borrowed from the **ttk::label** widget featuring
these same options. The names, meanings, and default values of the options are
the same as for **ttk::label**. The rendering and processing is the same as
for this latter widget.

# Example of use

	    package require Tk
	    proc moveit {} {
	      for {set i 0} {$i < 100} {incr i} {
	        .p step ; update ; after 100

	      }
	    }
	    pack [ttk::progressbar .p -value 0 -maximum 50 -orient horizontal -length 500]
	    .p configure -anchor c -foreground blue -justify right \
	            -text "-anchor c -foreground blue -justify right -wraplength 100" \
	            -wraplength 100
	    moveit
	    .p configure -anchor e -font {Arial 10 bold} -foreground green -justify center \
	            -text "-anchor e -font {Arial 10 bold} -foreground green -justify center -wraplength 250" \
	            -wraplength 250
	    moveit
	    .p configure -text "-anchor w -foreground red -justify left -wraplength 50" \
	            -anchor w -foreground red -justify left -wraplength 50
	    moveit
	    .p configure -orient vertical -text "Cannot be seen"
	    moveit

# Copyright

This document has been placed in the public domain.

Name change from tip/443.tip to tip/443.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128

TIP:            443
Title:          More Tag Configuration Options for the Text Widget
Version:        $Revision: 1.9 $
Author:         Fran�ois Vogel <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        09-Feb-2016
Post-History:   
Keywords:       Tk
Tcl-Version:    8.6.6

~ Abstract

Despite the '''text''' widget already has numerous configuration options, some
users need more refinements and have requested new tag configuration options.
This TIP proposes to add these options, when deemed relevant.

~ Rationale

Several users have reported they miss different tag configuration options in
the '''text''' widget, stating they cannot achieve the rendering they target.
Such RFE include:

 * RFE 1759972 [https://core.tcl.tk/tk/tktview/1759972], with Patch
   3469780 [https://core.tcl.tk/tk/tktview/3469780]

 * RFE 220889 [https://core.tcl.tk/tk/tktview/220889]

 * RFE 1754048 [https://core.tcl.tk/tk/tktview/1754048]

~ Proposed Change

It is proposed to add the following tag configuration options to the Tk
'''text''' widget:

'''-selectbackground''' ''color'':

 > Specifies the background color to use
   when displaying selected items. It may have any of the forms accepted by
   '''Tk_GetColor'''. If ''color'' has not been specified, or if it is
   specified as an empty string, then the color specified by the
   '''-background''' tag option is used.

 > Note regarding the particular case of the "sel"
 tag: Currently, the "sel" tag '''-background''' tag option is mirrored with
 the '''-selectbackground''' text widget option. This makes sense. It does not
 make real sense to have '''-selectbackground''' applied to the "sel" tag (it
 is more intuitive to use '''-background''' for the "sel" tag). However, if the
 "sel" tag receives non-empty '''-selectbackground''', then this tag option
 prevails on the '''-background''' tag option for mirroring, i.e. the
 '''-selectbackground''' tag option is mirrored with the
 '''-selectbackground''' widget option.

'''-selectforeground''' ''color'':

 > Specifies the foreground color to use
   when displaying selected items. It may have any of the forms accepted by
   '''Tk_GetColor'''. If ''color'' has not been specified, or if it is
   specified as an empty string, then the color specified by the
   '''-foreground''' tag option is used.

 > Note regarding the particular case of the "sel"
 tag: same principle as above for '''-selectbackground'''.

'''-underlinefg''' ''color'':

 > Specifies the color to use when displaying
   the underline. It may have any of the forms accepted by '''Tk_GetColor'''.
   If ''color'' has not been specified, or if it is specified as an empty
   string, then the color specified by the '''-foreground''' tag option is
   used (if there is one, otherwise the the color specified by the
   '''-foreground''' widget option is used).

'''-overstrikefg''' ''color'':

 > Specifies the color to use when
   displaying the overstrike. It may have any of the forms accepted by
   '''Tk_GetColor'''. If ''color'' has not been specified, or if it is
   specified as an empty string, then the color specified by the
   '''-foreground''' tag option is used (if there is one, otherwise the
   color specified by the '''-foreground''' widget option is used).

'''-lmargincolor''' ''color'':

 > ''Color'' specifies the background color
   to use in regions that do not contain characters because they are
   indented by '''-lmargin1''' or '''-lmargin2'''. It may have any of the
   forms accepted by '''Tk_GetColor'''. If ''color'' has not been specified,
   or if it is specified as an empty string, then the color specified by the
   '''-background''' widget option is used.

'''-rmargincolor''' ''color'':

 > ''Color'' specifies the background color
   to use in regions that do not contain characters because they are
   indented by '''-rmargin'''. It may have any of the forms accepted by
   '''Tk_GetColor'''. If ''color'' has not been specified, or if it is
   specified as an empty string, then the color specified by the
   '''-background''' widget option is used.

~ Rejected additional tag configuration options

RFE 1759972 [https://core.tcl.tk/tk/tktview/1759972] requested stippling
('''-selectbgstipple''', '''-selectfgstipple''') for selected text. Also RFE 1754048 [https://core.tcl.tk/tk/tktview/1754048] requested stippling in left and right margins of the text widget ('''-lmargin1stipple''', '''-lmargin2stipple''', '''rmarginstipple''').

Any new stippling options was rejected during the discussion about this
TIP. Reasons were as follows:

 * "Stippling and anything related to Tk Bitmaps should be considered
   obsolete. Their use should be phased out, not expanded. (Bitmaps have
   very limited support in Tk, and stippling is virtually never used in
   modern user interfaces. In fact, AFAIK current graphics stacks --
   cairo, Quartz, Direct3D, &c -- don't even support this operation."

 * "Stippling is something that is an artefact of a prior time, and it
   looks pretty bad now. These days, a way to specify an alpha value for
   the tag would be far more relevant, since then it would end up with a
   blend of the text foreground and background."

~ Reference Implementation

A reference implementation is available in branch tip-443 of the fossil
repository.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|

|

|

|

|

|

|

|
|
|
|
|
|
|
|

|

|

|

|

|

|
|
|
|
|

|

|

|
|

|

|

|
|

|

|

|

|
|

|

|

|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128

# TIP 443: More Tag Configuration Options for the Text Widget

	Author:         François Vogel <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        09-Feb-2016
	Post-History:   
	Keywords:       Tk
	Tcl-Version:    8.6.6
-----

# Abstract

Despite the **text** widget already has numerous configuration options, some
users need more refinements and have requested new tag configuration options.
This TIP proposes to add these options, when deemed relevant.

# Rationale

Several users have reported they miss different tag configuration options in
the **text** widget, stating they cannot achieve the rendering they target.
Such RFE include:

 * RFE 1759972 <https://core.tcl.tk/tk/tktview/1759972> , with Patch
   3469780 <https://core.tcl.tk/tk/tktview/3469780> 

 * RFE 220889 <https://core.tcl.tk/tk/tktview/220889> 

 * RFE 1754048 <https://core.tcl.tk/tk/tktview/1754048> 

# Proposed Change

It is proposed to add the following tag configuration options to the Tk
**text** widget:

**-selectbackground** _color_:

 > Specifies the background color to use
   when displaying selected items. It may have any of the forms accepted by
   **Tk\_GetColor**. If _color_ has not been specified, or if it is
   specified as an empty string, then the color specified by the
   **-background** tag option is used.

 > Note regarding the particular case of the "sel"
 tag: Currently, the "sel" tag **-background** tag option is mirrored with
 the **-selectbackground** text widget option. This makes sense. It does not
 make real sense to have **-selectbackground** applied to the "sel" tag \(it
 is more intuitive to use **-background** for the "sel" tag\). However, if the
 "sel" tag receives non-empty **-selectbackground**, then this tag option
 prevails on the **-background** tag option for mirroring, i.e. the
 **-selectbackground** tag option is mirrored with the
 **-selectbackground** widget option.

**-selectforeground** _color_:

 > Specifies the foreground color to use
   when displaying selected items. It may have any of the forms accepted by
   **Tk\_GetColor**. If _color_ has not been specified, or if it is
   specified as an empty string, then the color specified by the
   **-foreground** tag option is used.

 > Note regarding the particular case of the "sel"
 tag: same principle as above for **-selectbackground**.

**-underlinefg** _color_:

 > Specifies the color to use when displaying
   the underline. It may have any of the forms accepted by **Tk\_GetColor**.
   If _color_ has not been specified, or if it is specified as an empty
   string, then the color specified by the **-foreground** tag option is
   used \(if there is one, otherwise the the color specified by the
   **-foreground** widget option is used\).

**-overstrikefg** _color_:

 > Specifies the color to use when
   displaying the overstrike. It may have any of the forms accepted by
   **Tk\_GetColor**. If _color_ has not been specified, or if it is
   specified as an empty string, then the color specified by the
   **-foreground** tag option is used \(if there is one, otherwise the
   color specified by the **-foreground** widget option is used\).

**-lmargincolor** _color_:

 > _Color_ specifies the background color
   to use in regions that do not contain characters because they are
   indented by **-lmargin1** or **-lmargin2**. It may have any of the
   forms accepted by **Tk\_GetColor**. If _color_ has not been specified,
   or if it is specified as an empty string, then the color specified by the
   **-background** widget option is used.

**-rmargincolor** _color_:

 > _Color_ specifies the background color
   to use in regions that do not contain characters because they are
   indented by **-rmargin**. It may have any of the forms accepted by
   **Tk\_GetColor**. If _color_ has not been specified, or if it is
   specified as an empty string, then the color specified by the
   **-background** widget option is used.

# Rejected additional tag configuration options

RFE 1759972 <https://core.tcl.tk/tk/tktview/1759972>  requested stippling
\(**-selectbgstipple**, **-selectfgstipple**\) for selected text. Also RFE 1754048 <https://core.tcl.tk/tk/tktview/1754048>  requested stippling in left and right margins of the text widget \(**-lmargin1stipple**, **-lmargin2stipple**, **rmarginstipple**\).

Any new stippling options was rejected during the discussion about this
TIP. Reasons were as follows:

 * "Stippling and anything related to Tk Bitmaps should be considered
   obsolete. Their use should be phased out, not expanded. \(Bitmaps have
   very limited support in Tk, and stippling is virtually never used in
   modern user interfaces. In fact, AFAIK current graphics stacks --
   cairo, Quartz, Direct3D, &c -- don't even support this operation."

 * "Stippling is something that is an artefact of a prior time, and it
   looks pretty bad now. These days, a way to specify an alpha value for
   the tag would be far more relevant, since then it would end up with a
   blend of the text foreground and background."

# Reference Implementation

A reference implementation is available in branch tip-443 of the fossil
repository.

# Copyright

This document has been placed in the public domain.

Name change from tip/444.tip to tip/444.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45

TIP:            444
Title:          Add "weekdays" unit in clock add
Version:        $Revision: 1.6 $
Author:         Pietro Cerutti <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        23-Feb-2016
Post-History:   
Tcl-Version:    8.7

~ Abstract

This TIP proposes an enhancement to the '''clock add''' command to support
performing days arithmetic using weekdays only.

~ Rationale

The '''clock add''' command allows to perform time arithmetic using a variety
of time units, including days. However, it offers no easy way to skip
weekends. It is often desired to perform weekdays arithmetic that involves
adding or subtracting a number of non-weekend days to a certain date. This is
useful in example when computing delivery dates.

~ Proposal

The '''weekdays''' time-unit is added to the list accepted by the '''clock
add''' command. The '''count''' argument represents weekdays (Mon-Fri) to be
added (or subtracted in case of a negative value) to the date. The result of
adding weekdays to a date is never a weekend day, unless the starting day is
itself a weekend day and '''count''' is 0.

~ Reference Implementation

Available at http://core.tcl.tk/tcl/timeline?t=tip-444

~ Discussion

A point has been raised as to whether ''weekday'' is unambiguous enough. For instance, in Sweden there seems to be some disagreement on whether the translation ''vardag'' includes Saturdays. As an alternative, the term ''workday'' has been mentioned. This, however, has the downside of introducing the concept of working days vs. public holiday. Also, the working week is not Mon-Fri in all countries, see [https://en.wikipedia.org/wiki/Workweek_and_weekend#Around_the_world].

TIP does not try to accomodate locale-specific features and characteristics. For this reason, it seems best to stick to ''weekday'' as the name of the unit and specifically mention that (Mon-Fri) is intended in the documentation.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|
|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45

# TIP 444: Add "weekdays" unit in clock add

	Author:         Pietro Cerutti <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        23-Feb-2016
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes an enhancement to the **clock add** command to support
performing days arithmetic using weekdays only.

# Rationale

The **clock add** command allows to perform time arithmetic using a variety
of time units, including days. However, it offers no easy way to skip
weekends. It is often desired to perform weekdays arithmetic that involves
adding or subtracting a number of non-weekend days to a certain date. This is
useful in example when computing delivery dates.

# Proposal

The **weekdays** time-unit is added to the list accepted by the **clock
add** command. The **count** argument represents weekdays \(Mon-Fri\) to be
added \(or subtracted in case of a negative value\) to the date. The result of
adding weekdays to a date is never a weekend day, unless the starting day is
itself a weekend day and **count** is 0.

# Reference Implementation

Available at <http://core.tcl.tk/tcl/timeline?t=tip-444>

# Discussion

A point has been raised as to whether _weekday_ is unambiguous enough. For instance, in Sweden there seems to be some disagreement on whether the translation _vardag_ includes Saturdays. As an alternative, the term _workday_ has been mentioned. This, however, has the downside of introducing the concept of working days vs. public holiday. Also, the working week is not Mon-Fri in all countries, see <https://en.wikipedia.org/wiki/Workweek_and_weekend#Around_the_world> .

TIP does not try to accomodate locale-specific features and characteristics. For this reason, it seems best to stick to _weekday_ as the name of the unit and specifically mention that \(Mon-Fri\) is intended in the documentation.

# Copyright

This document has been placed in the public domain.

Name change from tip/445.tip to tip/445.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

TIP:            445
Title:          Tcl_ObjType Utility Routines
Version:        $Revision: 1.7 $
Author:         Don Porter <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        18-Mar-2016
Post-History:   
Tcl-Version:	8.7

~ Abstract

Proposes additional public routines useful for extensions that implement
custom '''Tcl_ObjType''s.

~ Background

When an extension creates a custom '''Tcl_ObjType''' it needs to operate on the fields of the '''Tcl_Obj''' and the '''Tcl_ObjType''' structs.

Almost all of these operations have been nicely encapsulated in utility routines, so for example, an extension calls '''Tcl_GetString''' to make sure a value is set for ''objPtr->bytes'', rather than worrying about the backing details of calling the routine ''objPtr->typePtr->updateStringProc'' (if present) for itself.  Likewise '''Tcl_DuplicateObj''' routes processing to type-specific routines as needed.

There are gaps in this interface.  Most glaring is the lack of any way to call the ''freeIntRepProc'' of an incumbent type other than directly through the ''typePtr'' field.  Another missing bit is an encapsulated way to set the string rep without direct manipulation of the ''bytes'' and ''length'' fields.  Within Tcl itself, there are internal utility macros '''TclFreeIntRep''' and '''TclInitStringRep''' for these tasks, but extensions have nothing.

Besides convenience, utility routines such as these improve chances for correctness, since they bring constraints into one place instead of many places.  For example, the requirement that when ''objPtr->typePtr'' is not NULL, it must be paired with an appropriate ''objPtr->internalRep''.  The '''TclFreeIntRep''' macro has a history of fixing such bugs.  A corresponding routine will offer the same benefit to extensions.

~ Proposal

Add to Tcl's stub table of public C routines a new routine

 > void '''Tcl_FreeIntRep'''(Tcl_Obj* ''objPtr'')

that performs precisely the same task as the existing internal
macro '''TclFreeIntRep'''.

Add to Tcl's stub table of public C routines a new routine 

 > char * '''Tcl_InitStringRep'''(Tcl_Obj* ''objPtr'', const char* ''bytes'', unsigned int ''numBytes'')

that performs the function of the existing internal
macro '''TclInitStringRep''', but is extended to return a pointer to the
string rep, and to accept NULL as a value for ''bytes''.  When ''bytes'' is
NULL and ''objPtr'' has no string rep, an uninitialzed buffer
of ''numBytes'' bytes is created for filling by the caller.
When ''bytes'' is NULL and ''objPtr'' has a string rep, the string rep will
be truncated to a length of ''numBytes'' bytes.  When ''numBytes'' is 
greater than zero, and the returned pointer is NULL, that indicates a
failure to allocate memory for the string representation.  The caller
may then choose whether to raise an error or panic.

Add to Tcl's stub table of public C routines a new routine 

 > int '''Tcl_HasStringRep'''(Tcl_Obj* ''objPtr'')

that returns a boolean indicating whether or not a string rep
is currently stored in ''objPtr''.  This is used when the caller
wants to act on ''objPtr'' differently depending on whether or
not it is a ''pure'' value.  Typically this only makes sense in
an extension if it is already known that ''objPtr'' possesses
an internal type that is managed by the extension.

Define a new public type

 > typedef union '''Tcl_ObjIntRep''' {...} '''Tcl_ObjIntRep'''

where the contents are exactly the existing contents of the union
in the ''internalRep'' field of the '''Tcl_Obj''' struct.  This definition
permits us to pass internal representations and pointers to them as
arguments and results in public routines.

Add to Tcl's stub table of public C routines a new routine 

 > void '''Tcl_StoreIntRep'''(Tcl_Obj* ''objPtr'', const Tcl_ObjType* ''typePtr'', const Tcl_ObjIntRep* ''irPtr'')

which stores in ''objPtr'' a copy of the internal representation pointed
to by ''irPtr'' and sets its type to ''typePtr''.  When ''irPtr'' is NULL,
this leaves ''objPtr'' without a representation for type ''typePtr''.

Add to Tcl's stub table of public C routines a new routine 

 > Tcl_ObjIntRep* '''Tcl_FetchIntRep'''(Tcl_Obj* ''objPtr'', const Tcl_ObjType* ''typePtr'')

which returns a pointer to the internal representation stored
in ''objPtr'' that matches the requested type ''typePtr''.

~ Compatibility

These are new routines, so they have no compatibility concerns in the sense of cause trouble for existing working code.

They do help set up an improved compatibility scenario for the future however.  Extensions that use these new routines to stop directly referring to the fields of the '''Tcl_Obj''' and '''Tcl_ObjType''' structs are prepared to support a source-compatible migration to a Tcl 9 that might then be free to make revisions to those structs.

~ Implementation

Taking shape on the tip-445 branch.

~ Rejected Alternatives

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|

|
|
|
|
|
|

|

|
|
|
|

|

|

|

|
|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

# TIP 445: Tcl_ObjType Utility Routines

	Author:         Don Porter <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        18-Mar-2016
	Post-History:   
	Tcl-Version:	8.7
-----

# Abstract

Proposes additional public routines useful for extensions that implement
custom **Tcl\_ObjType_s.

# Background

When an extension creates a custom **Tcl\_ObjType** it needs to operate on the fields of the **Tcl\_Obj** and the **Tcl\_ObjType** structs.

Almost all of these operations have been nicely encapsulated in utility routines, so for example, an extension calls **Tcl\_GetString** to make sure a value is set for _objPtr->bytes_, rather than worrying about the backing details of calling the routine _objPtr->typePtr->updateStringProc_ \(if present\) for itself.  Likewise **Tcl\_DuplicateObj** routes processing to type-specific routines as needed.

There are gaps in this interface.  Most glaring is the lack of any way to call the _freeIntRepProc_ of an incumbent type other than directly through the _typePtr_ field.  Another missing bit is an encapsulated way to set the string rep without direct manipulation of the _bytes_ and _length_ fields.  Within Tcl itself, there are internal utility macros **TclFreeIntRep** and **TclInitStringRep** for these tasks, but extensions have nothing.

Besides convenience, utility routines such as these improve chances for correctness, since they bring constraints into one place instead of many places.  For example, the requirement that when _objPtr->typePtr_ is not NULL, it must be paired with an appropriate _objPtr->internalRep_.  The **TclFreeIntRep** macro has a history of fixing such bugs.  A corresponding routine will offer the same benefit to extensions.

# Proposal

Add to Tcl's stub table of public C routines a new routine

 > void **Tcl\_FreeIntRep**\(Tcl\_Obj\* _objPtr_\)

that performs precisely the same task as the existing internal
macro **TclFreeIntRep**.

Add to Tcl's stub table of public C routines a new routine 

 > char \* **Tcl\_InitStringRep**\(Tcl\_Obj\* _objPtr_, const char\* _bytes_, unsigned int _numBytes_\)

that performs the function of the existing internal
macro **TclInitStringRep**, but is extended to return a pointer to the
string rep, and to accept NULL as a value for _bytes_.  When _bytes_ is
NULL and _objPtr_ has no string rep, an uninitialzed buffer
of _numBytes_ bytes is created for filling by the caller.
When _bytes_ is NULL and _objPtr_ has a string rep, the string rep will
be truncated to a length of _numBytes_ bytes.  When _numBytes_ is 
greater than zero, and the returned pointer is NULL, that indicates a
failure to allocate memory for the string representation.  The caller
may then choose whether to raise an error or panic.

Add to Tcl's stub table of public C routines a new routine 

 > int **Tcl\_HasStringRep**\(Tcl\_Obj\* _objPtr_\)

that returns a boolean indicating whether or not a string rep
is currently stored in _objPtr_.  This is used when the caller
wants to act on _objPtr_ differently depending on whether or
not it is a _pure_ value.  Typically this only makes sense in
an extension if it is already known that _objPtr_ possesses
an internal type that is managed by the extension.

Define a new public type

 > typedef union **Tcl\_ObjIntRep** \{...\} **Tcl\_ObjIntRep**

where the contents are exactly the existing contents of the union
in the _internalRep_ field of the **Tcl\_Obj** struct.  This definition
permits us to pass internal representations and pointers to them as
arguments and results in public routines.

Add to Tcl's stub table of public C routines a new routine 

 > void **Tcl\_StoreIntRep**\(Tcl\_Obj\* _objPtr_, const Tcl\_ObjType\* _typePtr_, const Tcl\_ObjIntRep\* _irPtr_\)

which stores in _objPtr_ a copy of the internal representation pointed
to by _irPtr_ and sets its type to _typePtr_.  When _irPtr_ is NULL,
this leaves _objPtr_ without a representation for type _typePtr_.

Add to Tcl's stub table of public C routines a new routine 

 > Tcl\_ObjIntRep\* **Tcl\_FetchIntRep**\(Tcl\_Obj\* _objPtr_, const Tcl\_ObjType\* _typePtr_\)

which returns a pointer to the internal representation stored
in _objPtr_ that matches the requested type _typePtr_.

# Compatibility

These are new routines, so they have no compatibility concerns in the sense of cause trouble for existing working code.

They do help set up an improved compatibility scenario for the future however.  Extensions that use these new routines to stop directly referring to the fields of the **Tcl\_Obj** and **Tcl\_ObjType** structs are prepared to support a source-compatible migration to a Tcl 9 that might then be free to make revisions to those structs.

# Implementation

Taking shape on the tip-445 branch.

# Rejected Alternatives

# Copyright

This document has been placed in the public domain.

Name change from tip/446.tip to tip/446.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111

112
113
114
115
116
117
118
119
120
121

122
123

124
125
126
127
128
129
130
131

132
133

134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150

TIP:            446
Title:          Introspect Undo/Redo Stack Depths
Version:        $Revision: 1.9 $
Author:         Fran�ois Vogel <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        05-Apr-2016
Post-History:   
Keywords:       Tk
Tcl-Version:    8.6.6

~ Abstract

Tk features a generic undo/redo mechanism (see [104]). This is used in
practice by the '''text''' widget, within the '''edit''' command. The present
TIP proposes to add two new subcommands to the '''edit''' command allowing the
user to know whether undo and redo is possible for a '''text''' widget.

~ Rationale

The undo/redo feature of Tk is handy and works very well. In modern GUIs,
there is usually a button in a menubar to call the "Undo" command, and
likewise for "Redo". It is good practice to enhance the user experience by
greying out or otherwise changing the button aspect when there is nothing to
undo (or redo). This cannot be achieved currently with the curent Tk
implementation because there is no way to know whether the undo and redo
stacks are empty or not.

This feature was requested for the text widget in RFE 1273358
[https://core.tcl.tk/tk/tktview/1273358]

~ Proposed Change

It is proposed to add the following subcommands to the '''text''' widget's
'''edit''' command:

'''canundo''':

 > Returns a boolean true if undo is possible, i.e. when the undo stack is
   not empty. Otherwise returns false.

'''canredo''':

 > Returns a boolean true if redo is possible, i.e. when the undo stack is
   not empty. Otherwise returns false.

When the '''-undo''' option of the text widget is false both subcommands return
false since indeed no undo nor redo action is possible in this case. The undo/redo
stacks are however not cleared, so that when -undo becomes eventually true again
the two new subcommands will return their value in accordance with the contents
of the stacks (this is current behavior of the undo/redo feature - the TIP does
not intend to change this).

This new capability will be implemented in the generic code for undo
(''generic/tkUndo.c''), so that any client of the generic undo/redo mechanism
can make use of it. Currently, only the '''text''' widget is concerned since
no other Tk widget is featuring undo/redo.

Besides the two new subcommands, it is also proposed to add a new virtual event
'''<<UndoStack>>''', that will trigger each time the undo or redo stack becomes
empty or unempty. When this condition is met, the event will trigger once for
each peer widget.

Despite this is a new feature, this TIP targets Tk 8.6.6 since no change of
any existing behavior is proposed: only new, additonal, features are proposed.
It is therefore believed there is no risk regarding backwards compatibility
in the 8.6.x series.

~ Example

|  package require Tk
|
|  pack [text .t -undo false -autoseparators false]
|
|  set nbUS 0
|  bind .t <<UndoStack>> {incr nbUS}
|
|  .t edit canundo    ; # 0
|  .t edit canredo    ; # 0
|
|  .t configure -undo true
|
|  .t edit canundo    ; # 0
|  .t edit canredo    ; # 0
|
|  .t insert end "ABC\n"
|  .t edit separator
|  .t insert end "DEF\n"
|  .t insert end "DEF again\n"
|  .t edit separator
|
|  .t edit canundo    ; # 1
|  .t edit canredo    ; # 0
|
|  .t edit undo
|
|  .t edit canundo    ; # 1
|  .t edit canredo    ; # 1
|
|  # A quick interactive testing environment...:
|  
|  pack [label .l]
|  
|  proc showit {} {
|    global nbUS
|    .l configure -text "Can undo: [.t edit canundo]\t\t\t \
|                        Can redo: [.t edit canredo]\t\t\t \
|                        <<UndoStack>> triggered: $nbUS"
|    after 200 showit
|  }

|  showit
|  
|  proc toggleautosep {} {
|    global autosepset
|    set autosepset [.t cget -autoseparators]
|    if {$autosepset} {
|      .t configure -autoseparators false
|    } else {
|      .t configure -autoseparators true
|    }

|    set autosepset [.t cget -autoseparators]
|  }

|  proc toggleundo {} {
|    global undoset
|    set undoset [.t cget -undo]
|    if {$undoset} {
|      .t configure -undo false
|    } else {
|      .t configure -undo true
|    }

|    set undoset [.t cget -undo]
|  }

|  button .b1 -text "Insert separator" -command {.t edit separator}
|  button .b2 -text "Undo" -command {.t edit undo}
|  button .b3 -text "Redo" -command {.t edit redo}
|  button .b32 -text "Reset" -command {.t edit reset}
|  checkbutton .b4 -text "-autoseparators" -command toggleautosep -variable autosepset
|  checkbutton .b5 -text "-undo" -command toggleundo -variable undoset ; .b5 select
|  
|  pack .b1 .b2 .b3 .b32 .b4 .b5 -side left -padx 10

~ Reference Implementation

A reference implementation is available in branch tip-446 of the fossil
repository. Credits for this implementation largely go to Neil Hodgson, the author of TkTextPlus (for canundo/canredo) and to Koen Danckaert (for <<UndoStack>>).

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|
|
|

|

|

|

|

|
|

|

|

|

|
|

|
|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
<
>
|
<
>
|
|
|
|
|
|
|
<
>
|
<
>
|
|
|
|
|
|
|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109

110
111
112
113
114
115
116
117
118
119

120
121

122
123
124
125
126
127
128
129

130
131

132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150

# TIP 446: Introspect Undo/Redo Stack Depths

	Author:         François Vogel <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        05-Apr-2016
	Post-History:   
	Keywords:       Tk
	Tcl-Version:    8.6.6
-----

# Abstract

Tk features a generic undo/redo mechanism \(see [[104]](104.md)\). This is used in
practice by the **text** widget, within the **edit** command. The present
TIP proposes to add two new subcommands to the **edit** command allowing the
user to know whether undo and redo is possible for a **text** widget.

# Rationale

The undo/redo feature of Tk is handy and works very well. In modern GUIs,
there is usually a button in a menubar to call the "Undo" command, and
likewise for "Redo". It is good practice to enhance the user experience by
greying out or otherwise changing the button aspect when there is nothing to
undo \(or redo\). This cannot be achieved currently with the curent Tk
implementation because there is no way to know whether the undo and redo
stacks are empty or not.

This feature was requested for the text widget in RFE 1273358
<https://core.tcl.tk/tk/tktview/1273358> 

# Proposed Change

It is proposed to add the following subcommands to the **text** widget's
**edit** command:

**canundo**:

 > Returns a boolean true if undo is possible, i.e. when the undo stack is
   not empty. Otherwise returns false.

**canredo**:

 > Returns a boolean true if redo is possible, i.e. when the undo stack is
   not empty. Otherwise returns false.

When the **-undo** option of the text widget is false both subcommands return
false since indeed no undo nor redo action is possible in this case. The undo/redo
stacks are however not cleared, so that when -undo becomes eventually true again
the two new subcommands will return their value in accordance with the contents
of the stacks \(this is current behavior of the undo/redo feature - the TIP does
not intend to change this\).

This new capability will be implemented in the generic code for undo
\(_generic/tkUndo.c_\), so that any client of the generic undo/redo mechanism
can make use of it. Currently, only the **text** widget is concerned since
no other Tk widget is featuring undo/redo.

Besides the two new subcommands, it is also proposed to add a new virtual event
**<<UndoStack>>**, that will trigger each time the undo or redo stack becomes
empty or unempty. When this condition is met, the event will trigger once for
each peer widget.

Despite this is a new feature, this TIP targets Tk 8.6.6 since no change of
any existing behavior is proposed: only new, additonal, features are proposed.
It is therefore believed there is no risk regarding backwards compatibility
in the 8.6.x series.

# Example

	  package require Tk

	  pack [text .t -undo false -autoseparators false]

	  set nbUS 0
	  bind .t <<UndoStack>> {incr nbUS}

	  .t edit canundo    ; # 0
	  .t edit canredo    ; # 0

	  .t configure -undo true

	  .t edit canundo    ; # 0
	  .t edit canredo    ; # 0

	  .t insert end "ABC\n"
	  .t edit separator
	  .t insert end "DEF\n"
	  .t insert end "DEF again\n"
	  .t edit separator

	  .t edit canundo    ; # 1
	  .t edit canredo    ; # 0

	  .t edit undo

	  .t edit canundo    ; # 1
	  .t edit canredo    ; # 1

	  # A quick interactive testing environment...:

	  pack [label .l]

	  proc showit {} {
	    global nbUS
	    .l configure -text "Can undo: [.t edit canundo]\t\t\t \
	                        Can redo: [.t edit canredo]\t\t\t \
	                        <<UndoStack>> triggered: $nbUS"
	    after 200 showit

	  }
	  showit

	  proc toggleautosep {} {
	    global autosepset
	    set autosepset [.t cget -autoseparators]
	    if {$autosepset} {
	      .t configure -autoseparators false
	    } else {
	      .t configure -autoseparators true

	    }
	    set autosepset [.t cget -autoseparators]

	  }
	  proc toggleundo {} {
	    global undoset
	    set undoset [.t cget -undo]
	    if {$undoset} {
	      .t configure -undo false
	    } else {
	      .t configure -undo true

	    }
	    set undoset [.t cget -undo]

	  }
	  button .b1 -text "Insert separator" -command {.t edit separator}
	  button .b2 -text "Undo" -command {.t edit undo}
	  button .b3 -text "Redo" -command {.t edit redo}
	  button .b32 -text "Reset" -command {.t edit reset}
	  checkbutton .b4 -text "-autoseparators" -command toggleautosep -variable autosepset
	  checkbutton .b5 -text "-undo" -command toggleundo -variable undoset ; .b5 select

	  pack .b1 .b2 .b3 .b32 .b4 .b5 -side left -padx 10

# Reference Implementation

A reference implementation is available in branch tip-446 of the fossil
repository. Credits for this implementation largely go to Neil Hodgson, the author of TkTextPlus \(for canundo/canredo\) and to Koen Danckaert \(for <<UndoStack>>\).

# Copyright

This document has been placed in the public domain.

Name change from tip/447.tip to tip/447.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84

85
86

87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117

TIP:            447
Title:          Execution Time Verbosity Levels in tcltest::configure
Version:        $Revision: 1.6 $
Author:         Pietro Cerutti <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        20-Apr-2016
Post-History:   
Keywords:       Tcl,tcltest
Tcl-Version:    8.7

~ Abstract

The '''-verbose''' option of the '''tcltest::configure''' command accepts a set of
verbosity levels to specify what pieces of information about tests the user
wants reported. This TIP proposes the addition of two new verbosity levels to
report information about the execution time of tests.

~ Rationale

When doing test-driven development, working on the refinement of a new
feature, or fixing a bug, it is very important to be able to measure the
effect of code changes on execution time.

The '''tcltest''' infrastructure is the testing framework used both by Tcl/Tk and
a number of extensions.  The '''tcltest''' infrastructure is highly configurable
and allows the user to choose which information on the tests being run are
reported. This can be done with the '''-verbose''' option of the
'''tcltest::configure''' command. Verbosity levels allow to report, e.g., when a
test passes, is skipped, or fails.

A proper way to measure the time spent running each test is currently missing.
Scope of this TIP is to address this issue by extending the set of verbosity
levels accepted.

~ Proposal

The '''-verbose''' option of the '''tcltest::configure''' command is modified to
accept the following new verbosity levels:

 usec (u): Report execution time of each test, in microseconds.

 msec (m): Report the execution time of each test, in milliseconds.

~ Example

This example demonstrates running a subset of the Tcl test suite with
verbosity level '''usec''' ('''u'''):

|$ make TESTFLAGS="-verbose u -match lsearch-1.*" test
|...
|lsearch.test
|++++ lsearch-1.1 took 521 us
|++++ lsearch-1.2 took 156 us
|++++ lsearch-1.3 took 187 us
|++++ lsearch-1.4 took 120 us

~ Discussion

~~ Additional Time Units

The implementation of additional verbosity levels to track execution times in
seconds, minutes, hours, and so on is trivial, but has been discarded as unit
or functional test are often meant to be fast. Another approach would be to
introduce a configurable verbosity level to carry information on the time unit
to be used. Examples could be '''time:usec''', '''time:msec''', and '''time:sec'''. This
option has also been discarded because it clashes with the current approach of
having verbosity level strings that can be shortened to a single character. It
is the author's opinion that milliseconds and microseconds should address most
use cases.

~~ When Timing Should be Displayed?

The current implementation dumps timing information before any reports of
success / failure. Example:

|---- tsv-lmdb-1.5 start
|++++ tsv-lmdb-1.5 took 828 us
|
|
|==== tsv-lmdb-1.5 tsv::exists - previously set exists FAILED
|---- Result was:
|1

|---- Result should have been (exact matching):
|0

|==== tsv-lmdb-1.5 FAILED

This allows for a simpler implementation that doesn't need to account for the
different code paths taken by '''tcltest''' when reporting success or failure. The
author doesn't have a strong opinion on this matter and is open to discussion,
should anybody have any counter-proposal.

~~ What to Print?

The current implementation decision to print "<testname> took <amount> <unit>"
is arbitrary. Again, the author has no strong opinion on the subject.

~~ On the Goodness of the Times Reported

FV, DGP, and and DKF have raised concerns on the mailing lists on the goodness
of the time values reported by the '''msec''' and '''usec''' verbosity levels.
In particular, the problem of repeatibility of the results has been mentioned,
and it has been noted that while the idea is good, this might not be the right
tool for (the|every) job.

My opinion is that the tool can be useful, given that its scope is made clear.
I followed DGP's suggestion and added [checkin 2b96ef] an explicit note in the
documentation about the "modest ambitions" of this enhancement.

~ Reference Implementation

In the gahr-tip-447 branch.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|
|

|
|

|

|

|

|

|

|

|
|
|
|
|
|
|

|

|

|

|

|
|
|
|
|
|
<
>
|
<
>
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82

83
84

85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117

# TIP 447: Execution Time Verbosity Levels in tcltest::configure

	Author:         Pietro Cerutti <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        20-Apr-2016
	Post-History:   
	Keywords:       Tcl,tcltest
	Tcl-Version:    8.7
-----

# Abstract

The **-verbose** option of the **tcltest::configure** command accepts a set of
verbosity levels to specify what pieces of information about tests the user
wants reported. This TIP proposes the addition of two new verbosity levels to
report information about the execution time of tests.

# Rationale

When doing test-driven development, working on the refinement of a new
feature, or fixing a bug, it is very important to be able to measure the
effect of code changes on execution time.

The **tcltest** infrastructure is the testing framework used both by Tcl/Tk and
a number of extensions.  The **tcltest** infrastructure is highly configurable
and allows the user to choose which information on the tests being run are
reported. This can be done with the **-verbose** option of the
**tcltest::configure** command. Verbosity levels allow to report, e.g., when a
test passes, is skipped, or fails.

A proper way to measure the time spent running each test is currently missing.
Scope of this TIP is to address this issue by extending the set of verbosity
levels accepted.

# Proposal

The **-verbose** option of the **tcltest::configure** command is modified to
accept the following new verbosity levels:

 usec \(u\): Report execution time of each test, in microseconds.

 msec \(m\): Report the execution time of each test, in milliseconds.

# Example

This example demonstrates running a subset of the Tcl test suite with
verbosity level **usec** \(**u**\):

	$ make TESTFLAGS="-verbose u -match lsearch-1.*" test
	...
	lsearch.test
	++++ lsearch-1.1 took 521 us
	++++ lsearch-1.2 took 156 us
	++++ lsearch-1.3 took 187 us
	++++ lsearch-1.4 took 120 us

# Discussion

## Additional Time Units

The implementation of additional verbosity levels to track execution times in
seconds, minutes, hours, and so on is trivial, but has been discarded as unit
or functional test are often meant to be fast. Another approach would be to
introduce a configurable verbosity level to carry information on the time unit
to be used. Examples could be **time:usec**, **time:msec**, and **time:sec**. This
option has also been discarded because it clashes with the current approach of
having verbosity level strings that can be shortened to a single character. It
is the author's opinion that milliseconds and microseconds should address most
use cases.

## When Timing Should be Displayed?

The current implementation dumps timing information before any reports of
success / failure. Example:

	---- tsv-lmdb-1.5 start
	++++ tsv-lmdb-1.5 took 828 us

	==== tsv-lmdb-1.5 tsv::exists - previously set exists FAILED
	---- Result was:

	1
	---- Result should have been (exact matching):

	0
	==== tsv-lmdb-1.5 FAILED

This allows for a simpler implementation that doesn't need to account for the
different code paths taken by **tcltest** when reporting success or failure. The
author doesn't have a strong opinion on this matter and is open to discussion,
should anybody have any counter-proposal.

## What to Print?

The current implementation decision to print "<testname> took <amount> <unit>"
is arbitrary. Again, the author has no strong opinion on the subject.

## On the Goodness of the Times Reported

FV, DGP, and and DKF have raised concerns on the mailing lists on the goodness
of the time values reported by the **msec** and **usec** verbosity levels.
In particular, the problem of repeatibility of the results has been mentioned,
and it has been noted that while the idea is good, this might not be the right
tool for \(the\|every\) job.

My opinion is that the tool can be useful, given that its scope is made clear.
I followed DGP's suggestion and added [checkin 2b96ef] an explicit note in the
documentation about the "modest ambitions" of this enhancement.

# Reference Implementation

In the gahr-tip-447 branch.

# Copyright

This document has been placed in the public domain.

Name change from tip/448.tip to tip/448.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61

TIP:		448
Title:		Update Tcl_SetNotifier to Reinitialize Event Loop
Version:	$Revision: 1.1 $
Author:		Jeff Rogers <[email protected]>
State:		Draft
Type:		Project
Tcl-Version:	8.7
Vote:		Pending
Created:	24-May-2016
Post-History:
Keywords: Tcl, C API

~ Abstract

Tcl_SetNotifier cannot be used in its current state to replace a notifier than
has been initialized because pointers to the old initialized value are kept in
the interp's private data.  This TIP proposes a way to change that.

~ Background

The '''Tcl_SetNotifier''' API was introduced to allow replacement of the
built-in notifier; it works by setting hooks for various notifier functions
that are called in place of the builtin functions.

It was expected that this function would only be called before the Tcl library
had been initialized and any state had been set up; however this prevents it
from being usable from a module loaded by a running interpreter.

This TIP proposes changing the behavior of this API to, in addition to setting
the function hooks will shut down a running notifier and restart the notifier
if it has been running previously.  This is the minimum change necessary to
allow a new notifier to be loaded from within a running interpreter.

It is probably not always possible to stop a running notifier, especially if
event sources have been created.  It is not necessarily easy to detect such
failure; the proposed implementation doesn't even try.  And there is no way to
report such failure if it was detected.  As such, this remains a somewhat
dangerous interface to use.

With this change in place, it will be possible to load a new notifier
implementation (e.g., a ''poll()'' or ''libevent'' based one) from a Tcl
program, provided it is the very first thing done.  An implementation of a
poll-based notifier that requires the functionality in this TIP can be found
at http://fossil.etoyoc.com/sandbox/tcllib/artifact/ad30080cdee762a3

~ Description

Change the implementation of '''Tcl_SetNotifier''' to check if a notifier is
currently running; if it is, the current notifier will be finalized before the
new hooks are swapped in, and afterwards the notifier will be re-initialized.
If the notifier is not already initialized, there is no change to
functionality.

~ Implementation

A patch implementing the proposed change can be found at
http://fossil.etoyoc.com/sandbox/tcllib/artifact/b2b272a285811272

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61

# TIP 448: Update Tcl_SetNotifier to Reinitialize Event Loop

	Author:		Jeff Rogers <[email protected]>
	State:		Draft
	Type:		Project
	Tcl-Version:	8.7
	Vote:		Pending
	Created:	24-May-2016
	Post-History:
	Keywords: Tcl, C API
-----

# Abstract

Tcl\_SetNotifier cannot be used in its current state to replace a notifier than
has been initialized because pointers to the old initialized value are kept in
the interp's private data.  This TIP proposes a way to change that.

# Background

The **Tcl\_SetNotifier** API was introduced to allow replacement of the
built-in notifier; it works by setting hooks for various notifier functions
that are called in place of the builtin functions.

It was expected that this function would only be called before the Tcl library
had been initialized and any state had been set up; however this prevents it
from being usable from a module loaded by a running interpreter.

This TIP proposes changing the behavior of this API to, in addition to setting
the function hooks will shut down a running notifier and restart the notifier
if it has been running previously.  This is the minimum change necessary to
allow a new notifier to be loaded from within a running interpreter.

It is probably not always possible to stop a running notifier, especially if
event sources have been created.  It is not necessarily easy to detect such
failure; the proposed implementation doesn't even try.  And there is no way to
report such failure if it was detected.  As such, this remains a somewhat
dangerous interface to use.

With this change in place, it will be possible to load a new notifier
implementation \(e.g., a _poll\(\)_ or _libevent_ based one\) from a Tcl
program, provided it is the very first thing done.  An implementation of a
poll-based notifier that requires the functionality in this TIP can be found
at <http://fossil.etoyoc.com/sandbox/tcllib/artifact/ad30080cdee762a3>

# Description

Change the implementation of **Tcl\_SetNotifier** to check if a notifier is
currently running; if it is, the current notifier will be finalized before the
new hooks are swapped in, and afterwards the notifier will be re-initialized.
If the notifier is not already initialized, there is no change to
functionality.

# Implementation

A patch implementing the proposed change can be found at
<http://fossil.etoyoc.com/sandbox/tcllib/artifact/b2b272a285811272>

# Copyright

This document has been placed in the public domain.

Name change from tip/449.tip to tip/449.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91

TIP:            449
Title:          [text] undo/redo to Return Range of Characters
Version:        $Revision: 1.12 $
Author:         Fran�ois Vogel <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        07-Jun-2016
Post-History:   
Keywords:       Tk
Tcl-Version:    8.7

~ Abstract

Tk features an undo/redo mechanism for the '''text''' widget. This TIP
proposes that the '''edit undo''' and '''edit redo''' commands of the text
widget return the ranges of characters impacted by the undo or redo operation.

~ Rationale

In some applications using the '''text''' widget, say a text editor, modern
practice is to show the text with highlighting, colorization, or any other
ways of improving readablity for the human user. When undoing/redoing changes
in the text, these applications need to know which characters were impacted by
the undo or redo operation. This can be done by comparing the '''text'''
widget contents before the change with its contents after the change, but this
is very far from optimal especially with large texts.

Therefore a better way to get this information is proposed in the present TIP.

This feature was requested for the text widget in RFE 1217222
[https://core.tcl.tk/tk/tktview/1217222]

~ Proposed Change

Currently, '''edit undo''' and '''edit redo''' commands return the empty
string.

It is proposed to change this return value and make these commands return the
ranges of indices that were impacted by the undo or redo operation.

This return value is a list of indices, with an even number of elements.
Indeed, there can be several edits (insertions and deletions) between two
separators in the undo stack and all such edits have to report which range of
text they changed.

The returned ranges are made of indices making sense at undo/redo return
time, i.e. they refer to the text widget content at the time '''edit undo'''
and '''edit redo''' return. Moreover, the returned list of indices is optimal,
in the sense that the ranges contained in that list are all disjoint
(overlapping ranges are merged, and ranges already contained in another range
are not returned).

~ Backwards Compatibility

The proposed implementation makes use of temporary marks, that only exist
''during'' an undo or redo operation (and never before or after these
operations). This could be seen as a concern regarding backwards
compatibility, however it is believed that the names of these marks has been
chosen such that this should not be an issue. The chosen name is
'''tk::undoMark<g><ID>''', where '''<g>''' is either '''L''' or '''R''', and
'''<ID>''' is an integer identifier. The risk of name collision is therefore
deemed very low due to prefixing by '''tk::'''. This naming convention follows
what is already existing in the text widget (internal anchors).

~ Example

|  package require Tk
|
|  pack [text .t -undo true]
|  .t insert end "Hello World.\n"
|  .t edit separator
|  .t insert end "Again hello.\n"
|
|  .t edit undo  ; # will now return {2.0 3.0}
|  .t edit redo  ; # will now return {2.0 3.0}
|

More examples can be found by studying the new tests in the feature branch
tip-449 hosting the development.

~ Reference Implementation

A reference implementation is available in branch tip-449 of the Tk fossil
repository.

This new capability is implemented entirely in the code of the text widget. The undo/redo generic code (in ''generic/tkUndo.c'') is untouched.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|

|

|

|

|

|

|
|

|
|

|

|
|

|
|
|
|

|

|
|
|
|
|
|
|
|
|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91

# TIP 449: [text] undo/redo to Return Range of Characters

	Author:         François Vogel <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        07-Jun-2016
	Post-History:   
	Keywords:       Tk
	Tcl-Version:    8.7
-----

# Abstract

Tk features an undo/redo mechanism for the **text** widget. This TIP
proposes that the **edit undo** and **edit redo** commands of the text
widget return the ranges of characters impacted by the undo or redo operation.

# Rationale

In some applications using the **text** widget, say a text editor, modern
practice is to show the text with highlighting, colorization, or any other
ways of improving readablity for the human user. When undoing/redoing changes
in the text, these applications need to know which characters were impacted by
the undo or redo operation. This can be done by comparing the **text**
widget contents before the change with its contents after the change, but this
is very far from optimal especially with large texts.

Therefore a better way to get this information is proposed in the present TIP.

This feature was requested for the text widget in RFE 1217222
<https://core.tcl.tk/tk/tktview/1217222> 

# Proposed Change

Currently, **edit undo** and **edit redo** commands return the empty
string.

It is proposed to change this return value and make these commands return the
ranges of indices that were impacted by the undo or redo operation.

This return value is a list of indices, with an even number of elements.
Indeed, there can be several edits \(insertions and deletions\) between two
separators in the undo stack and all such edits have to report which range of
text they changed.

The returned ranges are made of indices making sense at undo/redo return
time, i.e. they refer to the text widget content at the time **edit undo**
and **edit redo** return. Moreover, the returned list of indices is optimal,
in the sense that the ranges contained in that list are all disjoint
\(overlapping ranges are merged, and ranges already contained in another range
are not returned\).

# Backwards Compatibility

The proposed implementation makes use of temporary marks, that only exist
_during_ an undo or redo operation \(and never before or after these
operations\). This could be seen as a concern regarding backwards
compatibility, however it is believed that the names of these marks has been
chosen such that this should not be an issue. The chosen name is
**tk::undoMark<g><ID>**, where **<g>** is either **L** or **R**, and
**<ID>** is an integer identifier. The risk of name collision is therefore
deemed very low due to prefixing by **tk::**. This naming convention follows
what is already existing in the text widget \(internal anchors\).

# Example

	  package require Tk

	  pack [text .t -undo true]
	  .t insert end "Hello World.\n"
	  .t edit separator
	  .t insert end "Again hello.\n"

	  .t edit undo  ; # will now return {2.0 3.0}
	  .t edit redo  ; # will now return {2.0 3.0}

More examples can be found by studying the new tests in the feature branch
tip-449 hosting the development.

# Reference Implementation

A reference implementation is available in branch tip-449 of the Tk fossil
repository.

This new capability is implemented entirely in the code of the text widget. The undo/redo generic code \(in _generic/tkUndo.c_\) is untouched.

# Copyright

This document has been placed in the public domain.

Name change from tip/45.tip to tip/45.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90

TIP:            45
Title:          Empty index lists for [lindex] and [lset]
Version:        $Revision: 1.9 $
Author:         Kevin Kenny <[email protected]>
Author:         Don Porter <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        18-Jul-2001
Post-History:   
Discussions-To: news:comp.lang.tcl,mailto:[email protected]
Keywords:       lindex,lset,multiple arguments,sublists
Tcl-Version:    8.4b1

~ Abstract

TIP's #22 and #33 contain an oversight in specifying the behavior
of the multi-argument forms of ''lset'' and ''lindex'' when an empty
index list is specified.  The intended behavior is that an empty list
of indices designates the entire list.

~ Rationale

In the discussion of [33]
([http://www.geocrawler.com/archives/3/7375/2001/5/0/5784409/]), Jim
Ingham pointed out that the list of indices presented to the
multi-argument forms of ''lindex'' and ''lset'' is analogous to a
database cursor.  This cursor is conceptually navigating a tree
structure; the command:

|  lindex $list {1 2 3}

means, "extract the sublist at index 1 in $list, the sublist at
index 2 in that list, and the element at index 3 in that list".

When implementing this functionality, the author of this TIP
realized that [22] and [33] provide no way to
address the root of the tree -- the entire list being manipulated.
An empty list of indices is a convenient means of specifying the root.

~ Specification

 1. The specification of ''lindex'' in [22] shall be amended so that
    the forms:

|  lindex list

 >  and

|  lindex list {}

 >  will return the value of the entire list.  The ''list'' parameter
    is not required to be a well-formed Tcl list when
    this form is used.

 1. The specification of ''lset'' in [33] shall be amended so that
    the forms:

|  lset var value

 >  and

|  lset var {} value

 >  will simply store the supplied value into the variable named ''var''.
    Neither the old nor the new value of ''var'' is required to be
    a well-formed Tcl list when this form is used.  The return value of
    the operation, as with all other uses of ''lset'', is the
    new value of ''var''.

~ Reference implementation

Work progresses on implementing this functionality; the currently
committed version is on SourceForge in the branch labeled,
''kennykb-tcl-22-33''.

~ Discussion

Since this proposed change introduces syntax that is expressly forbidden in
[22] and [33], it does not have any impact on backward compatibility.
For the same reason, the author thought it unwise to proceed with its
implementation without a vote of the TCT.

~ See Also

[22], [33].

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|
|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90

# TIP 45: Empty index lists for [lindex] and [lset]

	Author:         Kevin Kenny <[email protected]>
	Author:         Don Porter <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        18-Jul-2001
	Post-History:   
	Discussions-To: news:comp.lang.tcl,mailto:[email protected]
	Keywords:       lindex,lset,multiple arguments,sublists
	Tcl-Version:    8.4b1
-----

# Abstract

TIP's \#22 and \#33 contain an oversight in specifying the behavior
of the multi-argument forms of _lset_ and _lindex_ when an empty
index list is specified.  The intended behavior is that an empty list
of indices designates the entire list.

# Rationale

In the discussion of [[33]](33.md)
\(<http://www.geocrawler.com/archives/3/7375/2001/5/0/5784409/> \), Jim
Ingham pointed out that the list of indices presented to the
multi-argument forms of _lindex_ and _lset_ is analogous to a
database cursor.  This cursor is conceptually navigating a tree
structure; the command:

	  lindex $list {1 2 3}

means, "extract the sublist at index 1 in $list, the sublist at
index 2 in that list, and the element at index 3 in that list".

When implementing this functionality, the author of this TIP
realized that [[22]](22.md) and [[33]](33.md) provide no way to
address the root of the tree -- the entire list being manipulated.
An empty list of indices is a convenient means of specifying the root.

# Specification

 1. The specification of _lindex_ in [[22]](22.md) shall be amended so that
    the forms:

		  lindex list

	 >  and

		  lindex list {}

	 >  will return the value of the entire list.  The _list_ parameter
    is not required to be a well-formed Tcl list when
    this form is used.

 1. The specification of _lset_ in [[33]](33.md) shall be amended so that
    the forms:

		  lset var value

	 >  and

		  lset var {} value

	 >  will simply store the supplied value into the variable named _var_.
    Neither the old nor the new value of _var_ is required to be
    a well-formed Tcl list when this form is used.  The return value of
    the operation, as with all other uses of _lset_, is the
    new value of _var_.

# Reference implementation

Work progresses on implementing this functionality; the currently
committed version is on SourceForge in the branch labeled,
_kennykb-tcl-22-33_.

# Discussion

Since this proposed change introduces syntax that is expressly forbidden in
[[22]](22.md) and [[33]](33.md), it does not have any impact on backward compatibility.
For the same reason, the author thought it unwise to proceed with its
implementation without a vote of the TCT.

# See Also

[[22]](22.md), [[33]](33.md).

# Copyright

This document has been placed in the public domain.

Name change from tip/450.tip to tip/450.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93

TIP:		450
Title:	Add [binary] subcommand "set" for in-place modification
Version:	$Revision: 1.1 $
Author:		Arjen Markus <[email protected]>
State:		Draft
Type:		Project
Vote:		Pending
Created:	18-Jul-2016
Post-History: 
Tcl-Version:	8.7
Keywords:	Tcl, binary data

~ Abstract

This TIP proposes a simpler extension of the [binary] command than the related
[418], namely just a subcommand '''set''' that updates an existing byte array
in a variable instead of creating a new one like '''binary format'''. It does
not propose an extension of the various formatting codes.

~ Rationale

As already argued in [418], the '''binary''' command is efficient in creating
new objects of binary data or in parsing existing objects with such data. It
is not currently efficient in ''updating'' existing objects. However, such
data objects are commonly used by compiled extensions.

As a consequence, if you want to manipulate such data objects from Tcl
directly, it is easier to parse the object into, say, a list of numbers, use
list commands like '''lset''' to replace individual values and pack it into a
new binary array before passing it to a compiled extension.

~ Specification

This TIP proposes to add a subcommand '''set''' to the '''binary''' command
with the following signature:

 > '''binary set''' ''varName formatString arg1 arg2 arg3 ...''

The effect of this subcommand is that the byte array data contained in the
variable "varName" is updated in a manner analogous to '''lset''', but using a
format string like '''binary format'''. It could be implemented in Tcl as:

 > set varName [binary format "a*$formatString" $varName $arg1 $arg2 $arg3 ...]

except that this allocates a new block of memory, sets that to null, copies
the contents of ''varName'' into that new block and then does the update.

The new command will have the effect that the first few steps are not
necessary anymore.

~ Implementation Notes

Besides the nominal case of a variable that contains a binary array that is to
be updated within the bounds of that array, three other cases exist and need
to be prepared for:

 * The variable varName does not exist yet

 * The variable varName does not contain a binary array

 * The updating would go past the memory allocated for the binary array

Each of these cases and perhaps others will have to be taken care of. The
first case might be treated as if '''binary format''' was meant. For the
second case the implementation can convert the current value.

The third case might either cause an error (we are updating an existing block
of memory after all) or silently extend the memory, effectively performing
what the Tcl implementation shown above would do. If an error is thrown, then
the first case should probably throw an error as well.

~ Reference Implementation

To be committed to a fossil branch.

A few remarks:

 * The third case is not completely treated yet (see "TODO" in the code)

 * The '''binary set''' command does not properly invalidate the string
   representation. The binary array is, however, updated properly - at least
   according to the very limited tests that were performed.

 * There are no proper test cases yet.

 * There is no proper performance measurement yet; this could be based
   on Wiki page http://wiki.tcl.tk/44363.

 * It does not deal with the possibility that the binary array is shared.

~ Copyright

This document is placed in public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|

|

|

|

|

|

|
|

|

|

|

|
|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93

# TIP 450: Add [binary] subcommand "set" for in-place modification

	Author:		Arjen Markus <[email protected]>
	State:		Draft
	Type:		Project
	Vote:		Pending
	Created:	18-Jul-2016
	Post-History: 
	Tcl-Version:	8.7
	Keywords:	Tcl, binary data
-----

# Abstract

This TIP proposes a simpler extension of the [binary] command than the related
[[418]](418.md), namely just a subcommand **set** that updates an existing byte array
in a variable instead of creating a new one like **binary format**. It does
not propose an extension of the various formatting codes.

# Rationale

As already argued in [[418]](418.md), the **binary** command is efficient in creating
new objects of binary data or in parsing existing objects with such data. It
is not currently efficient in _updating_ existing objects. However, such
data objects are commonly used by compiled extensions.

As a consequence, if you want to manipulate such data objects from Tcl
directly, it is easier to parse the object into, say, a list of numbers, use
list commands like **lset** to replace individual values and pack it into a
new binary array before passing it to a compiled extension.

# Specification

This TIP proposes to add a subcommand **set** to the **binary** command
with the following signature:

 > **binary set** _varName formatString arg1 arg2 arg3 ..._

The effect of this subcommand is that the byte array data contained in the
variable "varName" is updated in a manner analogous to **lset**, but using a
format string like **binary format**. It could be implemented in Tcl as:

 > set varName [binary format "a*$formatString" $varName $arg1 $arg2 $arg3 ...]

except that this allocates a new block of memory, sets that to null, copies
the contents of _varName_ into that new block and then does the update.

The new command will have the effect that the first few steps are not
necessary anymore.

# Implementation Notes

Besides the nominal case of a variable that contains a binary array that is to
be updated within the bounds of that array, three other cases exist and need
to be prepared for:

 * The variable varName does not exist yet

 * The variable varName does not contain a binary array

 * The updating would go past the memory allocated for the binary array

Each of these cases and perhaps others will have to be taken care of. The
first case might be treated as if **binary format** was meant. For the
second case the implementation can convert the current value.

The third case might either cause an error \(we are updating an existing block
of memory after all\) or silently extend the memory, effectively performing
what the Tcl implementation shown above would do. If an error is thrown, then
the first case should probably throw an error as well.

# Reference Implementation

To be committed to a fossil branch.

A few remarks:

 * The third case is not completely treated yet \(see "TODO" in the code\)

 * The **binary set** command does not properly invalidate the string
   representation. The binary array is, however, updated properly - at least
   according to the very limited tests that were performed.

 * There are no proper test cases yet.

 * There is no proper performance measurement yet; this could be based
   on Wiki page <http://wiki.tcl.tk/44363.>

 * It does not deal with the possibility that the binary array is shared.

# Copyright

This document is placed in public domain.

Name change from tip/451.tip to tip/451.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79

TIP:            451
Title:          Modify [update] to Give Full Script Access to Tcl_DoOneEvent
Version:        $Revision: 1.2 $
Author:         Colin McCormack <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        10-Aug-2016
Post-History:   
Keywords:       Tcl,event loop
Tcl-Version:    8.7

~ Abstract

This TIP add flags to '''update''' to represent all the flag values available
to the underlying API, ''Tcl_DoOneEvent()'', exposing them to script access.

~ Rationale

The '''update''' command provides a way into the event loop in addition to '''vwait'''.  Because the Tcl event system derived historically from Tk, '''update''' was written specifically to support Tk's window and idle events.
When Tcl adopted the event system for timers and file events, the '''update''' command was anomalously not updated to cater for these new event types.

The reason this anomaly is worth correcting is that Tcl_DoOneEvent is the actual entry into the event loop and there is no good or sufficient reason for '''update''' to limit the flags to those specific to Tk only.

While '''vwait''' provides a reasonable way into and out of the event loop, there's no good reason to impose that communication mechanism on the application when there are other approaches possible, arguably more useful, and amply provided for by the underlying C implementation.

~ Proposal

The '''update''' command should have the following flags added to it:

 * '''idletasks''' - process any pending window events or idle events, do not wait (this is as currently supported)

 * '''window''' - process window events

 * '''file''' - process file events

 * '''timer''' - process timer events

 * '''onlyidle''' - process only idle events

 * '''all''' - process all events

 * '''wait''' - wait until at least one event has been processed

 * '''nowait''' - return immediately if no events are pending.

 and they should be painted Pantone 13-1520

The somewhat klunky and denormalised logical form of these flags is imposed by the requirement to minimally disrupt existing '''update''' functionality.

~ Reference Implementation

A reference implementation is available in Tcl core fossil under tag
''updateextended''.

~ Use Case 1

Application of '''after idle''' constructs a collection of ''idle tasks.''  An ''idle task'' represents something to be performed when no events are pending.  This proposal exposes the idle state directly to a Tcl script and the application can do what it will in that state.

Use Case Summary: "I would like to handle ''idle tasks'' in some particular or specific order, at script level."

~ Use Case 2

A script may have accepted network file i/o for some time, and have a number of pending background (aka idle) tasks to perform as a result of that i/o.  It may wish to throttle acceptance of connections, or further read events, pending that background processing.

Use Case Summary: As a special case of "I would like direct control over my 'idle' time.", "I would like to throttle network input."

~ Rejected Alternatives

One could create a new command for this functionality, but the minimal impact
on most Tcl users doesn't seem to me to warrant additional population of the
:: command namespace.

One could move '''update''' out of Tcl altogether (to avoid its explicit dependency on Tk events) into Tk.  This would open up the :: namespace to a more general event-handling mechanism.

~ Copyright

This document has been placed in the public domain. In legislations where this
concept does not exist the CC0 license applies.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79

# TIP 451: Modify [update] to Give Full Script Access to Tcl_DoOneEvent

	Author:         Colin McCormack <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        10-Aug-2016
	Post-History:   
	Keywords:       Tcl,event loop
	Tcl-Version:    8.7
-----

# Abstract

This TIP add flags to **update** to represent all the flag values available
to the underlying API, _Tcl\_DoOneEvent\(\)_, exposing them to script access.

# Rationale

The **update** command provides a way into the event loop in addition to **vwait**.  Because the Tcl event system derived historically from Tk, **update** was written specifically to support Tk's window and idle events.
When Tcl adopted the event system for timers and file events, the **update** command was anomalously not updated to cater for these new event types.

The reason this anomaly is worth correcting is that Tcl\_DoOneEvent is the actual entry into the event loop and there is no good or sufficient reason for **update** to limit the flags to those specific to Tk only.

While **vwait** provides a reasonable way into and out of the event loop, there's no good reason to impose that communication mechanism on the application when there are other approaches possible, arguably more useful, and amply provided for by the underlying C implementation.

# Proposal

The **update** command should have the following flags added to it:

 * **idletasks** - process any pending window events or idle events, do not wait \(this is as currently supported\)

 * **window** - process window events

 * **file** - process file events

 * **timer** - process timer events

 * **onlyidle** - process only idle events

 * **all** - process all events

 * **wait** - wait until at least one event has been processed

 * **nowait** - return immediately if no events are pending.

 and they should be painted Pantone 13-1520

The somewhat klunky and denormalised logical form of these flags is imposed by the requirement to minimally disrupt existing **update** functionality.

# Reference Implementation

A reference implementation is available in Tcl core fossil under tag
_updateextended_.

# Use Case 1

Application of **after idle** constructs a collection of _idle tasks._  An _idle task_ represents something to be performed when no events are pending.  This proposal exposes the idle state directly to a Tcl script and the application can do what it will in that state.

Use Case Summary: "I would like to handle _idle tasks_ in some particular or specific order, at script level."

# Use Case 2

A script may have accepted network file i/o for some time, and have a number of pending background \(aka idle\) tasks to perform as a result of that i/o.  It may wish to throttle acceptance of connections, or further read events, pending that background processing.

Use Case Summary: As a special case of "I would like direct control over my 'idle' time.", "I would like to throttle network input."

# Rejected Alternatives

One could create a new command for this functionality, but the minimal impact
on most Tcl users doesn't seem to me to warrant additional population of the
:: command namespace.

One could move **update** out of Tcl altogether \(to avoid its explicit dependency on Tk events\) into Tk.  This would open up the :: namespace to a more general event-handling mechanism.

# Copyright

This document has been placed in the public domain. In legislations where this
concept does not exist the CC0 license applies.

Name change from tip/452.tip to tip/452.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80

TIP:            452
Title:          Add "stubs" Package to or Along Side of TclTest
Version:        $Revision: 1.8 $
Author:         Gerald Lester <[email protected]>
Author:         Gerald W. Lester <[email protected]>
Author:         Gerald W. Lester <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        10-Aug-2016
Post-History:   
Tcl-Version:    8.6

~ Abstract

This TIP proposes an enhancement to the '''tcltest''' package to add support
for easy creation of test stubs, mocks and seams.

~ Rationale

The '''tcltest''' package allows for automated testing of Tcl code. However,
doing proper automated unit testing requires that the unit under test (i.e.,
the method or procedure) not invoke the actual implementation of other units,
but rather should invoke stub or mock units that are under the control of the
test being performed as to the results they return and any exceptions they
raise.

This TIP adds support for building these mechanisms, making it significantly
easier to create isolated unit tests of Tcl code.

~ Proposal

That a framework to easily create test stubs/mocks of
Tcl commands be added to the '''tcltest'' package.  Additionally, to facilitate the creation of automated test for legacy Tcl code.  Commands supporting ''test
seam'' creation and specification would also be included proposed package.

~~ Description

It provides a fully functional implementation of the
following commands:

 * '''::tcltest::TestSetup''' - Defines which procedures/commands are stubbed out
   and how they should behave for each invocation. This should only be called
   once per test.

 * '''::tcltest::AddStub''' - Adds a procedures/commands to the list that are
   stubbed out.

 * '''::tcltest::SaveVars''' - Saves the values of variables to be restored later.
   This should only be called once per test.

 * '''::tcltest::AddVars''' - Add a variable to the list of variables to be
   restored later

 * '''::tcltest::CallCount''' - Returns a dictionary sorted list of the stubbed
   out procedures and how many times they were called.

 * '''::tcltest::TestCleanup''' - Restores saved variables and stubbed out
   procedures.

 * '''::tcltest::SortedArrayData''' - Return the values of an array as a list of
   key-value pairs sorted by the keys.

 * '''::tcltest::CallProc''' - Call the real implementation of a stubbed out
   procedure.

 * '''::tcltest::Seam''' - Test seam definition and injection (aka enabling).  This command is available without requiring the tcltest package. 

~ Reference Implementation

See the tip-452 branch (http://core.tcl.tk/tcl/timeline?n=100&r=tip-452), in particular, see http-tip-452.test for an example of using the procedures added by this package.

~~ Origins of Reference Implementation

The reference implementation was done at SAP Labs, LLC (a subsidiary of SAP
Americal, Inc.) and approved for release as Open Source under a BSD license.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|
|
|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80

# TIP 452: Add "stubs" Package to or Along Side of TclTest

	Author:         Gerald Lester <[email protected]>
	Author:         Gerald W. Lester <[email protected]>
	Author:         Gerald W. Lester <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        10-Aug-2016
	Post-History:   
	Tcl-Version:    8.6
-----

# Abstract

This TIP proposes an enhancement to the **tcltest** package to add support
for easy creation of test stubs, mocks and seams.

# Rationale

The **tcltest** package allows for automated testing of Tcl code. However,
doing proper automated unit testing requires that the unit under test \(i.e.,
the method or procedure\) not invoke the actual implementation of other units,
but rather should invoke stub or mock units that are under the control of the
test being performed as to the results they return and any exceptions they
raise.

This TIP adds support for building these mechanisms, making it significantly
easier to create isolated unit tests of Tcl code.

# Proposal

That a framework to easily create test stubs/mocks of
Tcl commands be added to the **tcltest_ package.  Additionally, to facilitate the creation of automated test for legacy Tcl code.  Commands supporting _test
seam_ creation and specification would also be included proposed package.

## Description

It provides a fully functional implementation of the
following commands:

 * **::tcltest::TestSetup** - Defines which procedures/commands are stubbed out
   and how they should behave for each invocation. This should only be called
   once per test.

 * **::tcltest::AddStub** - Adds a procedures/commands to the list that are
   stubbed out.

 * **::tcltest::SaveVars** - Saves the values of variables to be restored later.
   This should only be called once per test.

 * **::tcltest::AddVars** - Add a variable to the list of variables to be
   restored later

 * **::tcltest::CallCount** - Returns a dictionary sorted list of the stubbed
   out procedures and how many times they were called.

 * **::tcltest::TestCleanup** - Restores saved variables and stubbed out
   procedures.

 * **::tcltest::SortedArrayData** - Return the values of an array as a list of
   key-value pairs sorted by the keys.

 * **::tcltest::CallProc** - Call the real implementation of a stubbed out
   procedure.

 * **::tcltest::Seam** - Test seam definition and injection \(aka enabling\).  This command is available without requiring the tcltest package. 

# Reference Implementation

See the tip-452 branch \(<http://core.tcl.tk/tcl/timeline?n=100&r=tip-452\),> in particular, see http-tip-452.test for an example of using the procedures added by this package.

## Origins of Reference Implementation

The reference implementation was done at SAP Labs, LLC \(a subsidiary of SAP
Americal, Inc.\) and approved for release as Open Source under a BSD license.

# Copyright

This document has been placed in the public domain.

Name change from tip/453.tip to tip/453.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136

137
138
139
140
141
142
143
144
145
146
147
148

149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164

165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191

TIP:            453
Title:          Tcl Based Automation for tcl/pkgs
Version:        $Revision: 1.2 $
Author:         Sean Woods <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        13-Sep-2016
Post-History:   
Keywords:       Build tooling
Tcl-Version:    8.7

~ Abstract

This TIP proposes replacing the '''make package''' process currently employed
by the core with a Tcl-based build automation tool.

~ Background

[376] provides for the distribution of third party packages in the Tcl/Tk core
distributions. To support that TIP currently requires three separate build
automations: a Makefile based automation in Unix, and both an nmake and
Makefile based automation for Windows. These automation systems can get out of
sync and they assume that their job is to build dynamic libraries for local
installation.

~ The Pitch

By the time '''make packages''' has run, the local Tcl interpreter has been
built already. Rather than rely on delicate hacks and makefile tricks, core
distributed packages could be built and installed via exec commands inside a
Tcl script. In addition, this same automation could handle functions like
injecting a core distributed package into a virtual file system, as well as
bundling the Tcl/Tk library file system for [430].

~ The Implementation

For the ''practcl'' branch of tclconfig, I put together a 4000 line
self-contained package and kit building library of tools. This library is
TclOO based, and provides a rudimentary (but functional) system for templating
C code in Tcl, as well as a build system that is capable of nesting
sub-projects. It also steals the useful bits from the '''fileutil''' module of
tcllib, providing implementations for concatenating files, performing file
searches, and building a global package index from a soup of modules. The
library also has a wrapper to download external sources from fossil. It also
contains procs that can compile a static library, dynamic library, or self
contained shell directly from exec calls.

I would propose that this tool system (or a new creation by the community in a
similar spirit) be included in the library/ section of the tcl core. The
provisional name for this tool set would be '''practcl'''. A version of this
tool could also be provided in tcllib to allow 8.5 and 8.6 based cores to
continue to build extensions.

In the new scheme, '''make packages''' (in all its forms) would be replaced
with a call to "''$(TCLSH) ${srcdir}/pkgs/make.tcl build''". '''make
packages-install''' would be replaced by a call to "''$(TCLSH)
${srcdir}/pkgs/make.tcl install''". For advanced users, these toplevel commands
'''build''' and '''install''' will accept additional arguments. For instance,
to install the core distributed packages into the VFS of a kit: "''$(TCLSH)
${srcdir}/make.tcl install -destdir ${MyVFS}/lib''".

~ pkgs/make.tcl

'''make.tcl''' would be maintained as part of the core, and provide the
top-level control system to build, install, or repackage the core distributed
extensions. That script will also provide mechanisms to populate the pkgs file
system for developers who build the tcl core from fossil checkouts.

Commands:

 * '''basekit'''

 > Compile a ZipFs style basekit suitable for the '''wrap''' command.

 * '''build''' ?'''all'''? ?''package''? ?''package...''?

 > Compile the source code for core distributed packages into binary products
   (as applicable.) If '''all''' is given, an attempt is made to compile all
   packages under ''pkgs/''. Any other argument is interpreted to be the name
   of an individual package to be compiled.

 * '''install''' ?'''-destdir''' ''destinationpath''?

 > Install all core distributed packages locally. If '''-destdir''' is given,
   install the packages relative to ''destinationpath'' in the same way that
   "'''make DESTDIR=''' ''destdir''" would. If '''-destdir''' is not given, or
   is an empty string, perform an install relative to the '''exec_prefix''' in
   '''tclConfig.sh'''

 * '''wrap''' ''exename'' ''vfspath'' ?''dir...''?

 > Generate a self contained executable constructed from the virtual file
    system amalgamated from ''vfspath'' and any other directories given as
    arguments. This VFS will automatically be populated with the
    '''library/''' file system from Tcl.

 * '''distribution'''

> Download packages listed in the "packages.tcl" file, unpack their source 
    code in the /pkgs folder, and perform any steps required to prepare those
    extensions for inclusion in a core snapshot for distribution.

 * '''developer'''

> Download packages listed in the "packages.tcl" file, unpack their source 
    code in the parent folder to the one the core has been unpacked from,
    and perform any steps required to prepare those extensions to be compiled
    locally as part of a developer build. This is intended for developers who work
    from fossil checkouts of the Tcl core.

 * '''package-list'''

> Stream to stdout a list of all of the packages in the packages.tcl file, in a flat
    machine readable format.

~ Tk

This same mechanism will be adapted for Tk. Tk will be also provide a
'''pkgs/''' directory. Its base kits will be based on a modified ''wish''
instead of a modified ''tclsh''.

~ Normal Operation

During the build/install/etc phase each directory will be scanned for either
a "configure" file or a "prac.tcl" file. Standard TEA extensions will be detected
by the presence of a "configure" file. The prac.tcl file is a hint to the build
system that the package needs either additional instructions and guidance.
The contents of the file are interpreted by the object which is implementing the
extension's ambassador to the build system.

If one were to decide to bundle tcllib, and wished to exercise its SAK based installer
the prac.tcl file would read:

|# Implement the install routine for tcllib
|#

|oo::objdefine [self] {
|  method install DEST {
|    set pkg [my define get pkg_name [my define get name]]
|    my unpack
|    set prefix [string trimleft [my <project> define get prefix] /] 
|    set srcdir [my define get srcdir]
|    ::practcl::dotclexec [file join $srcdir installer.tcl] \
|      -pkg-path [file join $DEST $prefix lib $pkg] \ -no-examples -no-html -no-nroff \
|      -no-wait -no-gui -no-apps
|  }
|}

~ Maintaining the Package List

Each '''pkgs/''' file system for both Tcl and Tk will also contain a file
'''packages.tcl'''. This file will be human and machine readable. It contains
a description of every core distributed package, where the sources can be
found, as well as which fossil tags can be utilized as either development or
release with this particular version of the core.

'''packages.txt''' contains a series of keywords populating a data structure.

A simple example would by the tclconfig templates from TEA:

|EXTENSION tclconfig {
|   tag  trunk
|   fossil_url http://core.tcl.tk/tclconfig
|}

The EXTENSION keyword is intended to take the following arguments:

 > ''name'' ''key/value-configuration-dict''

~~Reserved keys

~~~tag
Source code management tag for the release bundled with this edition of the core

~~~fossil_url
If the extension is managed via fossil, a url that can be fed to "fossil clone". If no tag is specified "trunk" is assumed.

~~~git_url
If the extension is managed via git, a url that can be fed to "git clone". If no tag is specified "HEAD" is assumed

~~~file_url
If the extension is only available via source snapshot, a url where the file can be downloaded. 
Supported formats are .tar.gz and .zip.

The list is kept separate from the actual '''make.tcl''' so that users can simply
steal the list for making "batteries included" distributions. It also allows the package list
to remain distinct for each branch of the core.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|

|
|
|

|
|
|
|
|
|
|

|

|

|

|

|

|
|
|

|

|
|
|
|
|

|

|
|

|

|

|

|

|

|

|

|

|
|

|

|
<
>
|
|
|
|
|
|
|
|
|
<
<
|
>
>
|

|
|

|

|
|
|
<
>

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134

135
136
137
138
139
140
141
142
143
144

145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162

163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191

# TIP 453: Tcl Based Automation for tcl/pkgs

	Author:         Sean Woods <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        13-Sep-2016
	Post-History:   
	Keywords:       Build tooling
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes replacing the **make package** process currently employed
by the core with a Tcl-based build automation tool.

# Background

[[376]](376.md) provides for the distribution of third party packages in the Tcl/Tk core
distributions. To support that TIP currently requires three separate build
automations: a Makefile based automation in Unix, and both an nmake and
Makefile based automation for Windows. These automation systems can get out of
sync and they assume that their job is to build dynamic libraries for local
installation.

# The Pitch

By the time **make packages** has run, the local Tcl interpreter has been
built already. Rather than rely on delicate hacks and makefile tricks, core
distributed packages could be built and installed via exec commands inside a
Tcl script. In addition, this same automation could handle functions like
injecting a core distributed package into a virtual file system, as well as
bundling the Tcl/Tk library file system for [[430]](430.md).

# The Implementation

For the _practcl_ branch of tclconfig, I put together a 4000 line
self-contained package and kit building library of tools. This library is
TclOO based, and provides a rudimentary \(but functional\) system for templating
C code in Tcl, as well as a build system that is capable of nesting
sub-projects. It also steals the useful bits from the **fileutil** module of
tcllib, providing implementations for concatenating files, performing file
searches, and building a global package index from a soup of modules. The
library also has a wrapper to download external sources from fossil. It also
contains procs that can compile a static library, dynamic library, or self
contained shell directly from exec calls.

I would propose that this tool system \(or a new creation by the community in a
similar spirit\) be included in the library/ section of the tcl core. The
provisional name for this tool set would be **practcl**. A version of this
tool could also be provided in tcllib to allow 8.5 and 8.6 based cores to
continue to build extensions.

In the new scheme, **make packages** \(in all its forms\) would be replaced
with a call to "_$\(TCLSH\) $\{srcdir\}/pkgs/make.tcl build_". **make
packages-install** would be replaced by a call to "_$\(TCLSH\)
$\{srcdir\}/pkgs/make.tcl install_". For advanced users, these toplevel commands
**build** and **install** will accept additional arguments. For instance,
to install the core distributed packages into the VFS of a kit: "_$\(TCLSH\)
$\{srcdir\}/make.tcl install -destdir $\{MyVFS\}/lib_".

# pkgs/make.tcl

**make.tcl** would be maintained as part of the core, and provide the
top-level control system to build, install, or repackage the core distributed
extensions. That script will also provide mechanisms to populate the pkgs file
system for developers who build the tcl core from fossil checkouts.

Commands:

 * **basekit**

	 > Compile a ZipFs style basekit suitable for the **wrap** command.

 * **build** ?**all**? ?_package_? ?_package..._?

	 > Compile the source code for core distributed packages into binary products
   \(as applicable.\) If **all** is given, an attempt is made to compile all
   packages under _pkgs/_. Any other argument is interpreted to be the name
   of an individual package to be compiled.

 * **install** ?**-destdir** _destinationpath_?

	 > Install all core distributed packages locally. If **-destdir** is given,
   install the packages relative to _destinationpath_ in the same way that
   "**make DESTDIR=** _destdir_" would. If **-destdir** is not given, or
   is an empty string, perform an install relative to the **exec\_prefix** in
   **tclConfig.sh**

 * **wrap** _exename_ _vfspath_ ?_dir..._?

	 > Generate a self contained executable constructed from the virtual file
    system amalgamated from _vfspath_ and any other directories given as
    arguments. This VFS will automatically be populated with the
    **library/** file system from Tcl.

 * **distribution**

	> Download packages listed in the "packages.tcl" file, unpack their source 
    code in the /pkgs folder, and perform any steps required to prepare those
    extensions for inclusion in a core snapshot for distribution.

 * **developer**

	> Download packages listed in the "packages.tcl" file, unpack their source 
    code in the parent folder to the one the core has been unpacked from,
    and perform any steps required to prepare those extensions to be compiled
    locally as part of a developer build. This is intended for developers who work
    from fossil checkouts of the Tcl core.

 * **package-list**

	> Stream to stdout a list of all of the packages in the packages.tcl file, in a flat
    machine readable format.

# Tk

This same mechanism will be adapted for Tk. Tk will be also provide a
**pkgs/** directory. Its base kits will be based on a modified _wish_
instead of a modified _tclsh_.

# Normal Operation

During the build/install/etc phase each directory will be scanned for either
a "configure" file or a "prac.tcl" file. Standard TEA extensions will be detected
by the presence of a "configure" file. The prac.tcl file is a hint to the build
system that the package needs either additional instructions and guidance.
The contents of the file are interpreted by the object which is implementing the
extension's ambassador to the build system.

If one were to decide to bundle tcllib, and wished to exercise its SAK based installer
the prac.tcl file would read:

	# Implement the install routine for tcllib

	#
	oo::objdefine [self] {
	  method install DEST {
	    set pkg [my define get pkg_name [my define get name]]
	    my unpack
	    set prefix [string trimleft [my <project> define get prefix] /] 
	    set srcdir [my define get srcdir]
	    ::practcl::dotclexec [file join $srcdir installer.tcl] \
	      -pkg-path [file join $DEST $prefix lib $pkg] \ -no-examples -no-html -no-nroff \
	      -no-wait -no-gui -no-apps

	  }
	}

# Maintaining the Package List

Each **pkgs/** file system for both Tcl and Tk will also contain a file
**packages.tcl**. This file will be human and machine readable. It contains
a description of every core distributed package, where the sources can be
found, as well as which fossil tags can be utilized as either development or
release with this particular version of the core.

**packages.txt** contains a series of keywords populating a data structure.

A simple example would by the tclconfig templates from TEA:

	EXTENSION tclconfig {
	   tag  trunk
	   fossil_url http://core.tcl.tk/tclconfig

	}

The EXTENSION keyword is intended to take the following arguments:

 > _name_ _key/value-configuration-dict_

## Reserved keys

### tag
Source code management tag for the release bundled with this edition of the core

### fossil\_url
If the extension is managed via fossil, a url that can be fed to "fossil clone". If no tag is specified "trunk" is assumed.

### git\_url
If the extension is managed via git, a url that can be fed to "git clone". If no tag is specified "HEAD" is assumed

### file\_url
If the extension is only available via source snapshot, a url where the file can be downloaded. 
Supported formats are .tar.gz and .zip.

The list is kept separate from the actual **make.tcl** so that users can simply
steal the list for making "batteries included" distributions. It also allows the package list
to remain distinct for each branch of the core.

# Copyright

This document has been placed in the public domain.

Name change from tip/454.tip to tip/454.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48

49
50
51
52
53
54
55

56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

72
73
74
75
76
77
78
79

80
81
82
83
84
85
86
87
88
89
90
91

92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121

122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198

TIP:            454
Title:          Automatically Resize Frames After Last Child Removed
Version:        $Revision: 1.19 $
Author:         Harald Oehlmann <[email protected]>
Author:         Harald Oehlmann <[email protected]>
Author:         Francois Vogel <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        21-Sep-2016
Post-History:   
Keywords:       Tk
Tcl-Version:    8.6.6

~ Abstract

A '''frame'''-like widget has 1x1 required size if created.
If children are added by pack/grid and the last children is unpacked/grid, the frame-like widget does not return to the 1x1 required size.
Instead, it keeps the size of the last packed item.
It should automatically or under control resize to the initial requested size of 1x1.

~ Rationale

A '''frame''' keeping a size without reason just feels like a bug and mostly leads to unwanted results.

Mostly, it looks just ugly, but there are critical use-cases, specially in scrolled frames.

When the BWidget autoscroll package is used, which displays scrollbars on demand, the scrollbars do not disappear if the contents is gone.
And there is nothing, a programmer can do, as the Configure event does not fire on the scrolled frame widget.

Another example is the scrolledwindow example by Emiliano in ticket 2863003fff [https://core.tcl.tk/tk/info/12006979562649c9], where the solution 2 specific part may be removed (or is ignored).

A typical workaround is to configure the width/height manually after the last children was unmapped.
Unfortunately, this fact may not be determined for example by scrolling widgets etc. An eventual Configure binding is not firing.

~ Example

Here is an example to ilustrate the issue.
It consisting of a simple scrolling megawidget.
The megawidget exposes a frame where a user may pack or grid other widgets and the scrollbar is adjusted following the changing content.
This works well when widgets are added or removed. Only removing the last client will not update the scrollbar. With the proposed patch applied, it will update the scrollbar also when the last user widget is removed.

Please paste the code below to a wish console or execute it.
On startup it shows on the console:
|requested frame height: 1

Then press the "+" button to add a user widget. The console output is:
|+

|requested frame height: 100
Technically, the frame ".c.f.i1" was packed into the client frame ".c.f".
The client frame ".c.f" changes its requested size to hold the new child, which invokes the Convigure event and adjustes the scrolling region of the canvas.
The new scrolling region is shown graphically by the scrollbar.

Then press the "-" button to remove the user widget. The console output is:
|-

So, the child widget ".c.f.i1" is destroyed, but the frame ".c.f" does not rechange its requested size to 1x1 (initial value) but stays at 100x100 showing an empty plane.
The scrollbar is not updated and the megawidget has no possibility to adjust that (expect additional user action to inform that the last child was removed).

One may also try to add two childs and to remove them. It gets clear, that the widget is resized on removel if it is not the last widget.

With the proposed patch applied, the removal of the last widget would restore the initial frame size of 1x1 which would invoke the Configure event and the scrollbar would be adjusted.

|wm geometry . 90x90
|# Button to add box on scrolling canvas
|set itemNo 0
|pack [button .b1 -command newBox -text +] -side left -fill y
|proc newBox {} {
|    puts +
|    incr ::itemNo
|    pack  [frame .c.f.i$::itemNo -borderwidth 4 -relief raised -bg red -width 100 -height 100] -side top
|}

|# Button to remove box on scrolling canvas
|pack [button .b2 -command removeBox -text -] -side left -fill y
|proc removeBox {} {
|    puts -
|    if {$::itemNo == 0} {return}
|    destroy .c.f.i$::itemNo
|    incr ::itemNo -1
|}

|
|# This is the scrolling megawidget which exposes frame .c.f for users to pack or grid clients
|# It has no knowledge, when the user adds or removes clients, e.g. when +/- is pressed
|pack [scrollbar .s -command {.c yview}] -side right -fill y
|pack [canvas .c -borderwidth 0 -yscrollcommand {.s set}] -side left -fill both
|frame .c.f
|.c create window 0 0 -window .c.f -anchor nw -tags win
|proc frameConfigure {} {
|    set y [winfo reqheight .c.f]
|    puts "requested frame height: $y"
|    .c configure -scrollregion [list 0 0 100 $y]
|}

|frameConfigure
|bind .c.f <Configure> frameConfigure

~ Proposal

The proposal is to change request size of the frame automatically to 1x1 if the last children is unpacked/ungridded/destroyed.

Koen Dankart has passed a patch for this solution containing 3 additional lines of C code.
It is available in branch bug-d6b95ce492-alt: [http://core.tcl.tk/tk/timeline?r=bug-d6b95ce492-alt&nd&c=2016-09-22+09%3A16%3A21&n=200].

It just solves the issue by restoring initial size if the last children is unpacked/ungridded.

This is not backward compatible.
But that is a side effect of fixing this bug.

~ Rejected Proposal

Another proposal is to invoke the virtual event <<GeometryManager>> when the last children is unpacked/ungridded/destroyed.

Emiliano has provided a ticket 2863003fff [https://core.tcl.tk/tk/info/2863003fff] with the implementation in branch bug-d6b95ce492 [https://core.tcl.tk/tk/timeline?r=bug-d6b95ce492&nd&c=2016-09-21+06%3A32%3A55&n=200]:

The virtual event '''<<GeometryManager>>''' is defined which informs the master (a frame-like widget) that it has no child widget any more and that its size is not managed any more by grid/pack.

The program may bind to this event and resize to size 1x1:

|   bind .c <<GeometryManager>> "resizeFrame %W"
|   proc resizeFrame w {
|      $w configure -height 1 -width 1
|      $w configure -height 0 -width 0
|   }

Discussion:

   *   Backward compatible, no visual change to present code

   *   Requires additional code to fix the bug (auto-resizing to 1x1)

   *   The information of the last size is preserved and may be queried by the script (winfo reqheight)

   *   Other sizes may be set on this event

   *   May extend the first solution, maybe in another TIP.

   *   May be emulated by a Configure event and some code, if the first solution is used.

   *   Feels like a workaround to me.

~ Voted as yes and withdrawn due to implementation and compatibility issues

The TIP was voted yes. (only this section was added after voting)

Nevertheless, it was withdrawn due to the following compatibility issues:

   *   flickering

Moen wrote on the core list on 2016-10-26 18:53:

I have to say though that I'm getting less sure about this TIP. I found some comments in the code indicating that the old behaviour was not so much a design choice, but rather an implementation issue. Specifically, this comment in tkGrid.c:1735.

|    /*
|     * If the master has no slaves anymore, then don't do anything at all:
|     * just leave the master's size as-is. Otherwise there is no way to
|     * "relinquish" control over the master so another geometry manager can
|     * take over.
|     */

The current patch for TIP 454 bypasses this by doing the Tk_GeometryRequest() immediately instead of at idle time. The result is that another geometry manager can still take over, but it introduces some flickering (collapse + expand):

|  pack [text .t]
|  pack forget .t; grid .t
|  grid forget .t; pack .t

   *   Additional Configure

The patch introduces an additional Configure event where applications may not be aware of.
Brian worte on 2016-10-26 19:46:

It turns out that this "flicker" is also what is causing our tests to hang.  Our UI is a complex set of nested and tabbed panes where the implementation that manages the panes relies on a complex dance of <Configure>, <Map> and <Unmap> events to make the right things happen.  The consequence of adding one more <Configure> event is causing a hang at a [tkwait visibility $win] where $win never appears.  $win is a single child in a frame that is unmanaged and (re)managed based on various conditions.

The hang is easily solved, but that means that the behavior is different.  (the difference is not right or wrong, just different*)  This difference is also demonstrated in the textWind failures.  It could be asserted that these tk tests are confirming that the comment in the tkGrid.c has been faithfully implemented.

I cannot write just one piece of code that works in both 8.6.6 and 8.6.7.  I contend that this sort of thing is forbidden in a minor patch release, and questionable in a major patch relase (e.g. 8.7).  I'm struggling with this, but I think this kind of change might have to be deferred to 9.0.

*: the change in our code means removing some code.  From a perspective of "less is more", the patch is "better".

Due to those issues, the TIP is withdrawn.
A way to solve the issue in a compatible and working way is to use Emilianos additional virtual event, as described in the section 'rejected alternatives'.

~ Additional information and examples

   *   frame wiki page http://wiki.tcl.tk/frame

   *   Tk bug ticket [https://core.tcl.tk/tk/info/d6b95ce49207c823]

   *   Discussion on the core list [http://code.activestate.com/lists/tcl-core/16363/]

~ Compatibility

Fixing the issue breaks visual compatibility.  Nevertheless, as it is seen as a bug, this is OK.

~ Reference Implementation

Reference implementations are mentioned in the solution sections.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|
<
>
|

<
>
|
|

|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|

|

|

|

|

|

|
|
|
|
<
>

|

|

|

|

|
|
|
|
|
|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46

47
48
49
50
51
52
53

54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

70
71
72
73
74
75
76
77

78
79
80
81
82
83
84
85
86
87
88
89

90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119

120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198

# TIP 454: Automatically Resize Frames After Last Child Removed

	Author:         Harald Oehlmann <[email protected]>
	Author:         Harald Oehlmann <[email protected]>
	Author:         Francois Vogel <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        21-Sep-2016
	Post-History:   
	Keywords:       Tk
	Tcl-Version:    8.6.6
-----

# Abstract

A **frame**-like widget has 1x1 required size if created.
If children are added by pack/grid and the last children is unpacked/grid, the frame-like widget does not return to the 1x1 required size.
Instead, it keeps the size of the last packed item.
It should automatically or under control resize to the initial requested size of 1x1.

# Rationale

A **frame** keeping a size without reason just feels like a bug and mostly leads to unwanted results.

Mostly, it looks just ugly, but there are critical use-cases, specially in scrolled frames.

When the BWidget autoscroll package is used, which displays scrollbars on demand, the scrollbars do not disappear if the contents is gone.
And there is nothing, a programmer can do, as the Configure event does not fire on the scrolled frame widget.

Another example is the scrolledwindow example by Emiliano in ticket 2863003fff <https://core.tcl.tk/tk/info/12006979562649c9> , where the solution 2 specific part may be removed \(or is ignored\).

A typical workaround is to configure the width/height manually after the last children was unmapped.
Unfortunately, this fact may not be determined for example by scrolling widgets etc. An eventual Configure binding is not firing.

# Example

Here is an example to ilustrate the issue.
It consisting of a simple scrolling megawidget.
The megawidget exposes a frame where a user may pack or grid other widgets and the scrollbar is adjusted following the changing content.
This works well when widgets are added or removed. Only removing the last client will not update the scrollbar. With the proposed patch applied, it will update the scrollbar also when the last user widget is removed.

Please paste the code below to a wish console or execute it.
On startup it shows on the console:
	requested frame height: 1

Then press the "\+" button to add a user widget. The console output is:

	+
	requested frame height: 100
Technically, the frame ".c.f.i1" was packed into the client frame ".c.f".
The client frame ".c.f" changes its requested size to hold the new child, which invokes the Convigure event and adjustes the scrolling region of the canvas.
The new scrolling region is shown graphically by the scrollbar.

Then press the "-" button to remove the user widget. The console output is:

	-
So, the child widget ".c.f.i1" is destroyed, but the frame ".c.f" does not rechange its requested size to 1x1 \(initial value\) but stays at 100x100 showing an empty plane.
The scrollbar is not updated and the megawidget has no possibility to adjust that \(expect additional user action to inform that the last child was removed\).

One may also try to add two childs and to remove them. It gets clear, that the widget is resized on removel if it is not the last widget.

With the proposed patch applied, the removal of the last widget would restore the initial frame size of 1x1 which would invoke the Configure event and the scrollbar would be adjusted.

	wm geometry . 90x90
	# Button to add box on scrolling canvas
	set itemNo 0
	pack [button .b1 -command newBox -text +] -side left -fill y
	proc newBox {} {
	    puts +
	    incr ::itemNo
	    pack  [frame .c.f.i$::itemNo -borderwidth 4 -relief raised -bg red -width 100 -height 100] -side top

	}
	# Button to remove box on scrolling canvas
	pack [button .b2 -command removeBox -text -] -side left -fill y
	proc removeBox {} {
	    puts -
	    if {$::itemNo == 0} {return}
	    destroy .c.f.i$::itemNo
	    incr ::itemNo -1

	}

	# This is the scrolling megawidget which exposes frame .c.f for users to pack or grid clients
	# It has no knowledge, when the user adds or removes clients, e.g. when +/- is pressed
	pack [scrollbar .s -command {.c yview}] -side right -fill y
	pack [canvas .c -borderwidth 0 -yscrollcommand {.s set}] -side left -fill both
	frame .c.f
	.c create window 0 0 -window .c.f -anchor nw -tags win
	proc frameConfigure {} {
	    set y [winfo reqheight .c.f]
	    puts "requested frame height: $y"
	    .c configure -scrollregion [list 0 0 100 $y]

	}
	frameConfigure
	bind .c.f <Configure> frameConfigure

# Proposal

The proposal is to change request size of the frame automatically to 1x1 if the last children is unpacked/ungridded/destroyed.

Koen Dankart has passed a patch for this solution containing 3 additional lines of C code.
It is available in branch bug-d6b95ce492-alt: <http://core.tcl.tk/tk/timeline?r=bug-d6b95ce492-alt&nd&c=2016-09-22+09%3A16%3A21&n=200> .

It just solves the issue by restoring initial size if the last children is unpacked/ungridded.

This is not backward compatible.
But that is a side effect of fixing this bug.

# Rejected Proposal

Another proposal is to invoke the virtual event <<GeometryManager>> when the last children is unpacked/ungridded/destroyed.

Emiliano has provided a ticket 2863003fff <https://core.tcl.tk/tk/info/2863003fff>  with the implementation in branch bug-d6b95ce492 <https://core.tcl.tk/tk/timeline?r=bug-d6b95ce492&nd&c=2016-09-21+06%3A32%3A55&n=200> :

The virtual event **<<GeometryManager>>** is defined which informs the master \(a frame-like widget\) that it has no child widget any more and that its size is not managed any more by grid/pack.

The program may bind to this event and resize to size 1x1:

	   bind .c <<GeometryManager>> "resizeFrame %W"
	   proc resizeFrame w {
	      $w configure -height 1 -width 1
	      $w configure -height 0 -width 0

	   }

Discussion:

   *   Backward compatible, no visual change to present code

   *   Requires additional code to fix the bug \(auto-resizing to 1x1\)

   *   The information of the last size is preserved and may be queried by the script \(winfo reqheight\)

   *   Other sizes may be set on this event

   *   May extend the first solution, maybe in another TIP.

   *   May be emulated by a Configure event and some code, if the first solution is used.

   *   Feels like a workaround to me.

# Voted as yes and withdrawn due to implementation and compatibility issues

The TIP was voted yes. \(only this section was added after voting\)

Nevertheless, it was withdrawn due to the following compatibility issues:

   *   flickering

Moen wrote on the core list on 2016-10-26 18:53:

I have to say though that I'm getting less sure about this TIP. I found some comments in the code indicating that the old behaviour was not so much a design choice, but rather an implementation issue. Specifically, this comment in tkGrid.c:1735.

	    /*
	     * If the master has no slaves anymore, then don't do anything at all:
	     * just leave the master's size as-is. Otherwise there is no way to
	     * "relinquish" control over the master so another geometry manager can
	     * take over.
	     */

The current patch for TIP 454 bypasses this by doing the Tk\_GeometryRequest\(\) immediately instead of at idle time. The result is that another geometry manager can still take over, but it introduces some flickering \(collapse \+ expand\):

	  pack [text .t]
	  pack forget .t; grid .t
	  grid forget .t; pack .t

   *   Additional Configure

The patch introduces an additional Configure event where applications may not be aware of.
Brian worte on 2016-10-26 19:46:

It turns out that this "flicker" is also what is causing our tests to hang.  Our UI is a complex set of nested and tabbed panes where the implementation that manages the panes relies on a complex dance of <Configure>, <Map> and <Unmap> events to make the right things happen.  The consequence of adding one more <Configure> event is causing a hang at a [tkwait visibility $win] where $win never appears.  $win is a single child in a frame that is unmanaged and \(re\)managed based on various conditions.

The hang is easily solved, but that means that the behavior is different.  \(the difference is not right or wrong, just different\*\)  This difference is also demonstrated in the textWind failures.  It could be asserted that these tk tests are confirming that the comment in the tkGrid.c has been faithfully implemented.

I cannot write just one piece of code that works in both 8.6.6 and 8.6.7.  I contend that this sort of thing is forbidden in a minor patch release, and questionable in a major patch relase \(e.g. 8.7\).  I'm struggling with this, but I think this kind of change might have to be deferred to 9.0.

*: the change in our code means removing some code.  From a perspective of "less is more", the patch is "better".

Due to those issues, the TIP is withdrawn.
A way to solve the issue in a compatible and working way is to use Emilianos additional virtual event, as described in the section 'rejected alternatives'.

# Additional information and examples

   *   frame wiki page <http://wiki.tcl.tk/frame>

   *   Tk bug ticket <https://core.tcl.tk/tk/info/d6b95ce49207c823> 

   *   Discussion on the core list <http://code.activestate.com/lists/tcl-core/16363/> 

# Compatibility

Fixing the issue breaks visual compatibility.  Nevertheless, as it is seen as a bug, this is OK.

# Reference Implementation

Reference implementations are mentioned in the solution sections.

# Copyright

This document has been placed in the public domain.

Name change from tip/455.tip to tip/455.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112

TIP:	455
Title:	 Extensions to [vwait]: Variable Sets and Scripted Access to Tcl_DoOneEvent
Version:	$Revision: 1.2 $
Author:	Christian Werner <[email protected]>
State:	Draft
Type:	Project
Tcl-Version:	8.7
Vote:	Pending
Created:	07-Oct-2016
Keywords:	Tcl, event loop
Post-History: 

~ Abstract

This TIP generalizes the '''vwait''' command to allow waiting on zero, one, or
more variable changes, on zero or more file events, to time-limit the wait,
and to control on which kinds of events is to be waited by partially exposing
the underlying API, namely ''Tcl_DoOneEvent()''.

~ Rationale

One remarkable property of Tcl is the ability to add traces, i.e., the
execution of callback functions, on certain operations affecting variables.
The '''vwait''' command combines the variable trace facility with the (a)
event loop, i.e., the (a) program main loop waiting and processing various
types of events generated by I/O channels, time, and internal activities of
the running Tcl program.

The current implementation of '''vwait''' allows to open an unconstrained
event loop which is terminated by writing or unsetting exactly one global
variable given as argument to '''vwait'''.

This proposal extends '''vwait''' to terminate its event loop on occurence of

 1. zero or more variable modifications

 2. readability/writability of zero or more file channels (usually sockets)

 3. an optional timeout specified as milliseconds as in the '''after'''
    command.

Additional flags to '''vwait''' control which types of events are to be dealt
with in its event loop, i.e. the underlying ''Tcl_DoOneEvent()'' API.

When more than one kind of active input (both, variables and file channels) is
involved in an instance of '''vwait''', another flag controls if any one of
the inputs or all inputs must have been activated in order to terminate the
event loop. This allows for scenarios, where '''vwait''' can be used as a kind
of "barrier" getting broken if all required conditions are fulfilled, i.e.,
axe, saw, spade, fire accelerant, water, in order to demolish, burn down,
extinct the glow, and finally bury the trellis-work fence.

However, in contrast to the illustrious demolition job, the order of occurence
of events breaking that "barrier" is indeterminate.

If the '''vwait''' is constrained by a timeout, and the time limit is reached,
its event loop terminates early and '''vwait''' indicates that timeout by a
negative integer result. Otherwise, the return value is the remaining number
of milliseconds (positive integer which can be zero) of the timeout
constraint. This property combined with [302] allows to implement (soft
real-time) control loops.

~ Proposal

The '''vwait''' command shall have the following signature:

 > '''vwait''' ''variable-name'' - well known and implemented behaviour

 > '''vwait''' ''options'' ?''variable-names''? - all available enhanced
    features; more than one variable name may be given, in which case the wait
    will terminate when any of the variables are written to (unless the
    '''-all''' option below is given)

where ''options'' are:

  --:                              indicates end of options

  -all:                            all (except timeout) conditions must be met

  -nofileevents:                   don't consider file events

  -noidleevents:                   don't consider idle events

  -notimerevents:                  don't consider timer events

  -nowindowevents:                 don't consider window system events

  -readable <chan>:                ''<chan>'' becomes readable

  -timeout <ms>: timeout in milliseconds; return the estimated number of
    milliseconds remaining in the wait.

  -writable <chan>:                ''<chan>'' becomes writable

The return value of '''vwait''' shall be the empty string except when the
'''-timeout''' option is in effect (see above).

Where the combination of options doesn't make sense, or even conflicts, an
appropriate error message shall be thrown, e.g., '''-timeout''' and
'''-notimerevents''' can't be specified at the same time.

If all event types except idle events are excluded, the event loop
(''Tcl_DoOneEvent'') shall be constrained by '''TCL_DONT_WAIT'''.

Interesting analogies to the '''update''' command can be infered: '''vwait
--''' is equivalent to '''update''', '''vwait -nofileevents -notimerevents
-nowindoevents''' is equivalent to '''update idletasks'''.

~ Copyright

This document has been placed in the public domain. In legislations where this
concept does not exist the CC0 license applies.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|

|

|

|

|

|
|

|
|

|

|
|

|
|
|

|

|

|

|

|
|

|

|

|

|

|
|

|
|

|

|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112

# TIP 455: Extensions to [vwait]: Variable Sets and Scripted Access to Tcl_DoOneEvent

	Author:	Christian Werner <[email protected]>
	State:	Draft
	Type:	Project
	Tcl-Version:	8.7
	Vote:	Pending
	Created:	07-Oct-2016
	Keywords:	Tcl, event loop
	Post-History: 
-----

# Abstract

This TIP generalizes the **vwait** command to allow waiting on zero, one, or
more variable changes, on zero or more file events, to time-limit the wait,
and to control on which kinds of events is to be waited by partially exposing
the underlying API, namely _Tcl\_DoOneEvent\(\)_.

# Rationale

One remarkable property of Tcl is the ability to add traces, i.e., the
execution of callback functions, on certain operations affecting variables.
The **vwait** command combines the variable trace facility with the \(a\)
event loop, i.e., the \(a\) program main loop waiting and processing various
types of events generated by I/O channels, time, and internal activities of
the running Tcl program.

The current implementation of **vwait** allows to open an unconstrained
event loop which is terminated by writing or unsetting exactly one global
variable given as argument to **vwait**.

This proposal extends **vwait** to terminate its event loop on occurence of

 1. zero or more variable modifications

 2. readability/writability of zero or more file channels \(usually sockets\)

 3. an optional timeout specified as milliseconds as in the **after**
    command.

Additional flags to **vwait** control which types of events are to be dealt
with in its event loop, i.e. the underlying _Tcl\_DoOneEvent\(\)_ API.

When more than one kind of active input \(both, variables and file channels\) is
involved in an instance of **vwait**, another flag controls if any one of
the inputs or all inputs must have been activated in order to terminate the
event loop. This allows for scenarios, where **vwait** can be used as a kind
of "barrier" getting broken if all required conditions are fulfilled, i.e.,
axe, saw, spade, fire accelerant, water, in order to demolish, burn down,
extinct the glow, and finally bury the trellis-work fence.

However, in contrast to the illustrious demolition job, the order of occurence
of events breaking that "barrier" is indeterminate.

If the **vwait** is constrained by a timeout, and the time limit is reached,
its event loop terminates early and **vwait** indicates that timeout by a
negative integer result. Otherwise, the return value is the remaining number
of milliseconds \(positive integer which can be zero\) of the timeout
constraint. This property combined with [[302]](302.md) allows to implement \(soft
real-time\) control loops.

# Proposal

The **vwait** command shall have the following signature:

 > **vwait** _variable-name_ - well known and implemented behaviour

 > **vwait** _options_ ?_variable-names_? - all available enhanced
    features; more than one variable name may be given, in which case the wait
    will terminate when any of the variables are written to \(unless the
    **-all** option below is given\)

where _options_ are:

  --:                              indicates end of options

  -all:                            all \(except timeout\) conditions must be met

  -nofileevents:                   don't consider file events

  -noidleevents:                   don't consider idle events

  -notimerevents:                  don't consider timer events

  -nowindowevents:                 don't consider window system events

  -readable <chan>:                _<chan>_ becomes readable

  -timeout <ms>: timeout in milliseconds; return the estimated number of
    milliseconds remaining in the wait.

  -writable <chan>:                _<chan>_ becomes writable

The return value of **vwait** shall be the empty string except when the
**-timeout** option is in effect \(see above\).

Where the combination of options doesn't make sense, or even conflicts, an
appropriate error message shall be thrown, e.g., **-timeout** and
**-notimerevents** can't be specified at the same time.

If all event types except idle events are excluded, the event loop
\(_Tcl\_DoOneEvent_\) shall be constrained by **TCL\_DONT\_WAIT**.

Interesting analogies to the **update** command can be infered: **vwait
--** is equivalent to **update**, **vwait -nofileevents -notimerevents
-nowindoevents** is equivalent to **update idletasks**.

# Copyright

This document has been placed in the public domain. In legislations where this
concept does not exist the CC0 license applies.

Name change from tip/456.tip to tip/456.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88

TIP:            456
Title:          Extend the C API to Support Passing Options to TCP Server Creation
Version:        $Revision: 1.6 $
Author:         LemonBoy <[email protected]>
Author:         lime boy <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        18-Nov-2016
Post-History:   
Keywords:       Tcl,socket,SO_REUSEPORT,SO_REUSEADDR
Tcl-Version:    8.7

~ Abstract

The '''Tcl_OpenTcpServer''' interface doesn't provide enough flexibility as
experienced during the implementation of the scaffolding necessary to support
the '''SO_REUSEPORT''' flag for sockets. This TIP adds that capability through
a new API function, '''Tcl_OpenTcpServerEx''', that takes extra options.

~ Rationale

Currently there's no way to pass extra informations to '''Tcl_OpenTcpServer'''
which is the function that does the heavy lifting by wrapping the socket
creation and connection phase.

For example, during the implementation of a '''-reuseport''' option for the
'''socket''' command, a roadblock was hit since informing
'''Tcl_OpenTcpServer''' about the presence of the new flag was only possible
via hacks such as exploiting the upper unused bits of the port parameter or
its sign bit.

A clean solution that also paves the way to the implementation of other
switches (such as one for the SO_REUSEADDR flag) is to introduce another
function named '''Tcl_OpenTcpServerEx''' whose signature closely matches the
'''Tcl_OpenTcpServer''' but allows passing a set of flags to customize its
behaviour.

Following the aforementioned changes to the C API the '''socket''' command is
enhanced with two new options allowing the user to take advantage of the newly
introduced flags.

~ Specification

A '''Tcl_OpenTcpServerEx''' function is introduced with the following
signature:

 > Tcl_Channel '''Tcl_OpenTcpServerEx'''(Tcl_Interp *''interp'', const char *
    ''service'', const char *''myHost'', unsigned int ''flags'', 
    Tcl_TcpAcceptProc *''acceptProc'', ClientData ''acceptProcData'')

Most arguments are identical to '''Tcl_OpenTcpServer''' with the exception of
the ''port'' parameter being replaced by the ''service'' one taking a string
instead of an integer.  Two entries for the ''flags'' bitset are defined by this 
TIP:

 * '''TCL_TCPSERVER_REUSEADDR''' - indicate that the socket flag SO_REUSEADDR (or
   equivalent) should be set.

 * '''TCL_TCPSERVER_REUSEPORT''' - indicate that the socket flag SO_REUSEPORT (or
   equivalent) should be set.

The '''Tcl_OpenTcpServer''' function is then rewritten to be an alias of
'''Tcl_OpenTcpServerEx''' with the '''flags''' parameter set by default to
TCL_TCPSERVER_REUSEADDR so that we keep the API and behaviour compatible with the
previous Tcl versions.

As for the Tcl side, the '''socket''' command gains two new optional switches
that are only valid for server sockets: '''?-reuseaddr boolean?''' and
'''?-reuseport boolean?''', both accepting a boolean argument to either turn off
or on the selected behaviour.

~ Reference Implementation

Please refer to the ''tip-456'' branch of the core Tcl repository.

~ Backwards Compatibility

Since '''Tcl_OpenTcpServer''' can be easily re-implemented in terms of 
'''Tcl_OpenTcpServerEx''' the old behaviour is retained.

The '''socket''' command defaults to '''-reuseaddr''' set to ''yes'' as it was
already doing before, the user is now able to turn off this behaviour by using
'''-reuseaddr no'''.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|
|

|

|

|
|
|

|
|
|

|

|

|

|
|
|

|
|
|

|
|

|
|

|
|
|

|
|
|

|

|

|

|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88

# TIP 456: Extend the C API to Support Passing Options to TCP Server Creation

	Author:         LemonBoy <[email protected]>
	Author:         lime boy <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        18-Nov-2016
	Post-History:   
	Keywords:       Tcl,socket,SO_REUSEPORT,SO_REUSEADDR
	Tcl-Version:    8.7
-----

# Abstract

The **Tcl\_OpenTcpServer** interface doesn't provide enough flexibility as
experienced during the implementation of the scaffolding necessary to support
the **SO\_REUSEPORT** flag for sockets. This TIP adds that capability through
a new API function, **Tcl\_OpenTcpServerEx**, that takes extra options.

# Rationale

Currently there's no way to pass extra informations to **Tcl\_OpenTcpServer**
which is the function that does the heavy lifting by wrapping the socket
creation and connection phase.

For example, during the implementation of a **-reuseport** option for the
**socket** command, a roadblock was hit since informing
**Tcl\_OpenTcpServer** about the presence of the new flag was only possible
via hacks such as exploiting the upper unused bits of the port parameter or
its sign bit.

A clean solution that also paves the way to the implementation of other
switches \(such as one for the SO\_REUSEADDR flag\) is to introduce another
function named **Tcl\_OpenTcpServerEx** whose signature closely matches the
**Tcl\_OpenTcpServer** but allows passing a set of flags to customize its
behaviour.

Following the aforementioned changes to the C API the **socket** command is
enhanced with two new options allowing the user to take advantage of the newly
introduced flags.

# Specification

A **Tcl\_OpenTcpServerEx** function is introduced with the following
signature:

 > Tcl\_Channel **Tcl\_OpenTcpServerEx**\(Tcl\_Interp \*_interp_, const char \*
    _service_, const char \*_myHost_, unsigned int _flags_, 
    Tcl\_TcpAcceptProc \*_acceptProc_, ClientData _acceptProcData_\)

Most arguments are identical to **Tcl\_OpenTcpServer** with the exception of
the _port_ parameter being replaced by the _service_ one taking a string
instead of an integer.  Two entries for the _flags_ bitset are defined by this 
TIP:

 * **TCL\_TCPSERVER\_REUSEADDR** - indicate that the socket flag SO\_REUSEADDR \(or
   equivalent\) should be set.

 * **TCL\_TCPSERVER\_REUSEPORT** - indicate that the socket flag SO\_REUSEPORT \(or
   equivalent\) should be set.

The **Tcl\_OpenTcpServer** function is then rewritten to be an alias of
**Tcl\_OpenTcpServerEx** with the **flags** parameter set by default to
TCL\_TCPSERVER\_REUSEADDR so that we keep the API and behaviour compatible with the
previous Tcl versions.

As for the Tcl side, the **socket** command gains two new optional switches
that are only valid for server sockets: **?-reuseaddr boolean?** and
**?-reuseport boolean?**, both accepting a boolean argument to either turn off
or on the selected behaviour.

# Reference Implementation

Please refer to the _tip-456_ branch of the core Tcl repository.

# Backwards Compatibility

Since **Tcl\_OpenTcpServer** can be easily re-implemented in terms of 
**Tcl\_OpenTcpServerEx** the old behaviour is retained.

The **socket** command defaults to **-reuseaddr** set to _yes_ as it was
already doing before, the user is now able to turn off this behaviour by using
**-reuseaddr no**.

# Copyright

This document has been placed in the public domain.

Name change from tip/457.tip to tip/457.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115

116
117
118
119
120
121
122
123
124
125
126
127
128
129

130
131

132
133

134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172

173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396

TIP:            457
Title:          Add Support for Named Arguments
Version:        $Revision: 1.22 $
Author:         Mathieu Lafon <[email protected]>
Author:         Andreas Leitgeb <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        21-Nov-2016
Post-History:   
Keywords:       Tcl,procedure,argument handling
Tcl-Version:    8.7

~ Abstract

This TIP proposes an enhancement of the Tcl language to support named
arguments and additional features when calling a procedure.

~ Rationale

The naming of arguments to procedures is a computer language feature which
allow developers to specify the name of an argument when calling a function.
This is especially useful when dealing with arguments with default values, as
this does not require to specify all previous arguments when only one argument
is required to be specified.

As such, this is a commonly requested feature by Tcl developers, who have
created various code snippets [http://wiki.tcl.tk/10702] to simulate it. These
snippets have drawbacks: not intuitive for new users, require to add extra
code at the start of each procedure, no standard on the format to use, few
errors handling, etc.

After discussing various possibilities with the community, it has been
decided to extend the argument specification of the ''proc'' command
and allow users to define options on arguments. This can be used to
support named arguments but also add additional enhancements:
flag arguments, pass-by-name (''upvar'') arguments, non-required
arguments, ...

The others possibilities discussed are detailed in the ''Discussion''
section at the end of the document.

~ Specification

The ''proc'' documentation currently define argument specifiers as a list
of one or two fields where the first field is the name of the argument and
the optional second field is its default value.

The proposed modification is to support an alternate specifier format where
the first field is also the name of the argument, followed by a paired list
of options and their values. This format does not prevent the original format
to be used as they can be easily distinguished: the new format uses an
odd size list with a minimal size of three fields.

~~ Available argument specifiers

The following argument specifiers are defined in this TIP:

 * '''-default VALUE''' defines the default value for the argument.
   It is ignored if ''-required 1'' is also used.

|% proc p { { a -default {} } } { list a $a }
|% p
|a {}
|% p foo
|a foo

 * '''-name NAME''' defines the argument to be a named argument.
   NAME defines the name of the argument when it is  defined as a
   single string. If NAME is a list of strings, it is the list of
   names that can be used to refer to the argument (i.e. aliases).
   On the call-site, the name of the argument is prefixed by a single dash
   and followed by the value.

|% proc p1 { { v -name val } } { list v $v }
|% p1 -val 1
|v 1

|% proc p2 { { v -name {v val value} } } { list v $v }
|% p2 -value 2
|v 2
|% p2 -v 2
|v 2

 * '''-switch SWITCHES''' defines that the argument is defined on the
   call-site as a flag-only/switch parameter. SWITCHES is a list of
   possible switches. Each switch is defined either as a single string
   (switch name) or as a list of two entries (switch name and related
   value). On the call-site, the name of the switch is prefixed by a
   single dash and is not followed by any value. The value assigned to
   the argument is either the switch name or the related value depending
   on how it was defined.

|% proc p { { dbg -default 0 -switch debug } } { list dbg $dbg }
|% p
|dbg 0
|% p -debug
|dbg debug

|% proc p { { level -switch {{quiet 0} {verbose 9}} } { list level $level }
|% p -quiet
|level 0
|% p -verbose
|level 9

 * '''-required BOOLEAN''' defines that the value is required to be set.
   If set to true, the argument is required and any default  value is
   ignored. It is the default handling for non-named argument without a
   default value. If set to false, the argument is not required to be set
   and the related argument will be left unset if there is no default value.
   It is the default  handling for named argument.  

|% proc p { { v -required 0 }  } {
|    if {[info exist v]} {list v $v} {return "v is unset"}
|  }

|% p 5
|v 5
|% p
|v is unset

 * '''-upvar LEVEL''' defines, that the local argument will become an
   alias to the variable in the frame at level LEVEL corresponding to
   the parameter value. This is similar to what is achieved when using
   the ''upvar'' command.
   This  specifier is incompatible with the ''-switch'' specifier.

|% proc p { { v -upvar 1 } } { incr v }
|% set a 2
|2

|% p a
|3

|% set a
|3

Further argument specifiers may be added in future TIP. Examples of
new argument specifiers which may be added in the future:

 * type assertion ('''-assume TYPE''')

 * argument documentation ('''-docstring DOC''')

 * ...

~~ Named arguments

The following rules define how named arguments are expected to be specified
on the call-site:

 * Named arguments must always be specified using their name, they can't be
   specified as positional arguments.

|% proc p { {a -name A} } { list a $a }
|% p aa
|wrong # args: should be "p |-A a|"
|% p -A aa
|a aa

 * When several names (using ''-name'' or ''-switch'' options) are
   specified for the same argument, only one is required to be used on
   the call-site, unless a default value is also specified. If more than
   one is used, the latest value/switch is kept.

|% proc p { { v -name {v val} } } { list v $v }
|% p -v 6 -val 8
|v 8

 * Both ''-name'' and ''-switch'' specifiers can be used on the same
   argument.

|% proc p { { level -name level -switch {{quiet 0} {verbose 9}} } {
|    list level $level
|  }

|% p -level 4
|level 4
|% p -verbose
|level 9

 * A group of contiguous named arguments are handled together and are not
   required to be specified in the same order as defined.

|% proc p { {a -name A} {b -name B} } { list a $a b $b }
|% p -B bb -A aa
|% a aa b bb

 * The handling of a group of contiguous named arguments (which can be
   only one argument) is ended on the first argument which is either
   a parameter not starting with a dash or the special ''--'' end-of-options
   marker. Remaining arguments will then be assigned to following positional
   arguments.

|% proc p { {o -name opt} args } { list o $o args $args }
|% p -opt O 5
|o O args 5
|% p -opt O -1 0
|wrong # args: should be "p |-opt o| ?arg ...?"
|% p -opt O -- -1 0
|o O args {-1 0}

 * If there is a fixed number of non-optional positional arguments and no
   special ''args'' variable after the named group, the handling of a named
   group will also be ended when the remaining arguments to assign
   will be equal to the number of positional arguments after the group.

|% proc p { {o -name opt} posarg } { list o $o posarg $posarg }
|% p -opt O -1
|o O posarg -1

~~ Generated usage description

The error message, automatically generated when the input arguments are
invalid, is updated regarding new options:

 * Pass-by-name arguments (specified using ''-upvar level'' option) are
   surrounded by the '&' character.

|% proc p { { v -upvar 1 } } { }
|% p
|wrong # args: should be "p &v&"

 * Named arguments are showed how they should be called and surounded
   by the '|' character. If several names have been specified,
   they are grouped together.

|% proc p { { l -name level -switch {high low} -required 1} } {}
|% p
|wrong # args: should be "p |-level l|-high|-low|"

 * When an argument is optional, '?' is used.

|% proc p { { v -name var } a } {}
|% p
|wrong # args: should be "p ?|-var v|? a"

~~ Introspection

The ''info argspec proc'' command is added to get the argument specification
of all arguments or of a specific argument.

|% proc p { a { b 1 } { c -name c } } {}
|% info argspec proc p
|a { b -default 1 } { c -name c }
|% info argspec proc p c
|-name c

Similar ''info argspec'' subcommands are also added for lambda, object
method and object constructor.

The ''info argspec specifiers'' command is added to get the specifiers 
supported by the current interpreter.

|% info argspec specifiers
|-default -name -required -switch -upvar

~~ Other use cases

Extended argument specifiers can also be used with other ''proc''-like
functions. The following functions are supported and can use extended
argument specifiers:

 * anonymous functions (lambda), used with ''apply'' command ;

 * TclOO constructor or methods.

~~ Performance

The proposed modification has no significant performance impact:

 * existing code (and code not using extended argspec) is not impacted
   by the change as the current initialisation code is still available
   and used ;

 * code using extended argspec ''may'' be impacted because the
   initialisation code is different and is required to loop on each
   argument, but initial testing does not show a significant slowdown.

When using named arguments specifiers to replace a similar handling done
in Tcl-pure code, there is however a significant increase in performance.

See [https://gist.github.com/mlafon/70480877a28f3571e0377eabc0cee7be]
for details on performance testing done on the proposed implementation.

~ Implementation

This document proposes the following changes to the Tcl core:

 1. Add ExtendedArgSpec structure which is linked from CompiledLocal
    and contains information about extended argument specification;

 2. Add a flags field in the Proc structure to later identify a proc
    with at least one argument defined with an extended argument
    specification (PROC_HAS_EXT_ARG_SPEC);

 3. Update proc creation to handle the extended argument specification
    and fill the ExtendedArgSpec structure;

 4. Update InitArgsAndLocals to initialize the compiled locals using
    a dedicated function if the PROC_HAS_EXT_ARG_SPEC flag has been
    set on the proc. If unset, the original initialization code is
    still used.

 5. Update ProcWrongNumArgs to generate an appropriate error message
    when an argument has been defined using an extended argument
    specification;

 6. Add ''info argspec'' command;

 7. Update documentation in doc/proc.n and doc/info.n;

 8. Update impacted tests and add dedicated tests in tests/proc-enh.test.

~~ Reference Implementation

The reference implementation is available in the tip-457
[http://core.tcl.tk/tcl/timeline?r=tip-457] branch.

The code is licensed under the BSD license.

~ Discussion

This section details some of the alternate solutions for this feature or
specific comments about it.

Initial approaches that tried to work with unmodified procedures are
not detailed here for clarity.

~~ Dedicated builtin command

A dedicated command can be used to handle the named arguments, using an
''-option value'' syntax, before calling the target procedures with all
arguments correctly prepared.

|% call -opts myproc -optC foo -optB {5 5} -- "some pos arg"

An implementation of this proposal is available at
[https://github.com/mlafon/tcl/tree/457-CALL-CMD]. This proposal was
abandoned as it was not enough intuitive for users.

~~ Modification in how proc are defined

Tcl-pure procedures can be defined in a way which state that the procedure
will automatically handle ''-option value'' arguments.

|% proc -np myproc { varA { optB defB } { optC defC } { optD defD } args } { .. }
|% myproc -optC foo -optB {5 5} -- "some pos arg"

An other possibility is to support options on arguments and allow name
specification:

|% proc myproc { varA { optB -default defB -name B } args } { .. }
|% myproc a -B b zz

This is the currently proposed solution in this TIP. It requires the
procedures to be modified but allow additional features.

Some people have expressed concern about the modification of the ''proc''
command, which is a core command of Tcl. A particular attention has been paid
to ensure that existing code will not be impacted and that future usage could
be later added by adding new specifiers.

~~ Argument Parsing command

Cyan Ogilvie's paper from Tcl2016
[https://www.tcl.tk/community/tcl2016/assets/talk33/parse_args-paper.pdf]
describes a C extension to provide core-like argument parsing at speed
comparable to ''proc'' argument handling, in a terse and self-documenting
way.

Alexandre Ferrieux has proposed
[http://code.activestate.com/lists/tcl-core/18447/] to use the same
argument specifiers than this proposal, but with a dedicated command which
can be called from the proc body. This has the advantage to not alter the
''proc'' command and could be located in an extension.

Although the ''proc'' usage will not be modified, this new command will
probably have to access or modify internal proc structures, for example
to support introspection.

Having to declare final local variables in the body, also seems confusing
for users.

~~ Preventing Data-dependent bugs

It has been proposed by Christian Gollwitzer
[http://code.activestate.com/lists/tcl-core/18457/] to make the special '--'
end-of-options marker mandatory when the number of positional arguments after
the named group is not fixed. This would suppress any potential Data-Dependent
bugs related to the search of the initial dash and remove any unwanted object
stringification, at the expense of forcing the user to explicitely use
the end-of-option marker.

This proposal is currently not implemented but the documentation has been
modified to list the cases for which '--' should be use.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|
|

|
|
|
|
|

|

|

|
|
|

|
|
|
|
|

|

|
|

|
|
|
|
|

|
|
|
|
|

|

|
|
<
>
|
|
|
|

|

|
|

|
|
<
>
|
<
>
|
<
>

|

|

|

|
|
|
|
|

|

|
|
|

|

|
|
<
>
|
|
|
|

|
|
|

|
|
|

|
|
|
|
|
|
|

|

|
|
|

|

|

|
|
|

|

|
|
|

|
|
|

|

|

|
|
|
|
|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|
|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113

114
115
116
117
118
119
120
121
122
123
124
125
126
127

128
129

130
131

132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170

171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396

# TIP 457: Add Support for Named Arguments

	Author:         Mathieu Lafon <[email protected]>
	Author:         Andreas Leitgeb <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        21-Nov-2016
	Post-History:   
	Keywords:       Tcl,procedure,argument handling
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes an enhancement of the Tcl language to support named
arguments and additional features when calling a procedure.

# Rationale

The naming of arguments to procedures is a computer language feature which
allow developers to specify the name of an argument when calling a function.
This is especially useful when dealing with arguments with default values, as
this does not require to specify all previous arguments when only one argument
is required to be specified.

As such, this is a commonly requested feature by Tcl developers, who have
created various code snippets <http://wiki.tcl.tk/10702>  to simulate it. These
snippets have drawbacks: not intuitive for new users, require to add extra
code at the start of each procedure, no standard on the format to use, few
errors handling, etc.

After discussing various possibilities with the community, it has been
decided to extend the argument specification of the _proc_ command
and allow users to define options on arguments. This can be used to
support named arguments but also add additional enhancements:
flag arguments, pass-by-name \(_upvar_\) arguments, non-required
arguments, ...

The others possibilities discussed are detailed in the _Discussion_
section at the end of the document.

# Specification

The _proc_ documentation currently define argument specifiers as a list
of one or two fields where the first field is the name of the argument and
the optional second field is its default value.

The proposed modification is to support an alternate specifier format where
the first field is also the name of the argument, followed by a paired list
of options and their values. This format does not prevent the original format
to be used as they can be easily distinguished: the new format uses an
odd size list with a minimal size of three fields.

## Available argument specifiers

The following argument specifiers are defined in this TIP:

 * **-default VALUE** defines the default value for the argument.
   It is ignored if _-required 1_ is also used.

		% proc p { { a -default {} } } { list a $a }
		% p
		a {}
		% p foo
		a foo

 * **-name NAME** defines the argument to be a named argument.
   NAME defines the name of the argument when it is  defined as a
   single string. If NAME is a list of strings, it is the list of
   names that can be used to refer to the argument \(i.e. aliases\).
   On the call-site, the name of the argument is prefixed by a single dash
   and followed by the value.

		% proc p1 { { v -name val } } { list v $v }
		% p1 -val 1
		v 1

		% proc p2 { { v -name {v val value} } } { list v $v }
		% p2 -value 2
		v 2
		% p2 -v 2
		v 2

 * **-switch SWITCHES** defines that the argument is defined on the
   call-site as a flag-only/switch parameter. SWITCHES is a list of
   possible switches. Each switch is defined either as a single string
   \(switch name\) or as a list of two entries \(switch name and related
   value\). On the call-site, the name of the switch is prefixed by a
   single dash and is not followed by any value. The value assigned to
   the argument is either the switch name or the related value depending
   on how it was defined.

		% proc p { { dbg -default 0 -switch debug } } { list dbg $dbg }
		% p
		dbg 0
		% p -debug
		dbg debug

		% proc p { { level -switch {{quiet 0} {verbose 9}} } { list level $level }
		% p -quiet
		level 0
		% p -verbose
		level 9

 * **-required BOOLEAN** defines that the value is required to be set.
   If set to true, the argument is required and any default  value is
   ignored. It is the default handling for non-named argument without a
   default value. If set to false, the argument is not required to be set
   and the related argument will be left unset if there is no default value.
   It is the default  handling for named argument.  

		% proc p { { v -required 0 }  } {
		    if {[info exist v]} {list v $v} {return "v is unset"}

		  }
		% p 5
		v 5
		% p
		v is unset

 * **-upvar LEVEL** defines, that the local argument will become an
   alias to the variable in the frame at level LEVEL corresponding to
   the parameter value. This is similar to what is achieved when using
   the _upvar_ command.
   This  specifier is incompatible with the _-switch_ specifier.

		% proc p { { v -upvar 1 } } { incr v }
		% set a 2

		2
		% p a

		3
		% set a

		3

Further argument specifiers may be added in future TIP. Examples of
new argument specifiers which may be added in the future:

 * type assertion \(**-assume TYPE**\)

 * argument documentation \(**-docstring DOC**\)

 * ...

## Named arguments

The following rules define how named arguments are expected to be specified
on the call-site:

 * Named arguments must always be specified using their name, they can't be
   specified as positional arguments.

		% proc p { {a -name A} } { list a $a }
		% p aa
		wrong # args: should be "p |-A a|"
		% p -A aa
		a aa

 * When several names \(using _-name_ or _-switch_ options\) are
   specified for the same argument, only one is required to be used on
   the call-site, unless a default value is also specified. If more than
   one is used, the latest value/switch is kept.

		% proc p { { v -name {v val} } } { list v $v }
		% p -v 6 -val 8
		v 8

 * Both _-name_ and _-switch_ specifiers can be used on the same
   argument.

		% proc p { { level -name level -switch {{quiet 0} {verbose 9}} } {
		    list level $level

		  }
		% p -level 4
		level 4
		% p -verbose
		level 9

 * A group of contiguous named arguments are handled together and are not
   required to be specified in the same order as defined.

		% proc p { {a -name A} {b -name B} } { list a $a b $b }
		% p -B bb -A aa
		% a aa b bb

 * The handling of a group of contiguous named arguments \(which can be
   only one argument\) is ended on the first argument which is either
   a parameter not starting with a dash or the special _--_ end-of-options
   marker. Remaining arguments will then be assigned to following positional
   arguments.

		% proc p { {o -name opt} args } { list o $o args $args }
		% p -opt O 5
		o O args 5
		% p -opt O -1 0
		wrong # args: should be "p |-opt o| ?arg ...?"
		% p -opt O -- -1 0
		o O args {-1 0}

 * If there is a fixed number of non-optional positional arguments and no
   special _args_ variable after the named group, the handling of a named
   group will also be ended when the remaining arguments to assign
   will be equal to the number of positional arguments after the group.

		% proc p { {o -name opt} posarg } { list o $o posarg $posarg }
		% p -opt O -1
		o O posarg -1

## Generated usage description

The error message, automatically generated when the input arguments are
invalid, is updated regarding new options:

 * Pass-by-name arguments \(specified using _-upvar level_ option\) are
   surrounded by the '&' character.

		% proc p { { v -upvar 1 } } { }
		% p
		wrong # args: should be "p &v&"

 * Named arguments are showed how they should be called and surounded
   by the '\|' character. If several names have been specified,
   they are grouped together.

		% proc p { { l -name level -switch {high low} -required 1} } {}
		% p
		wrong # args: should be "p |-level l|-high|-low|"

 * When an argument is optional, '?' is used.

		% proc p { { v -name var } a } {}
		% p
		wrong # args: should be "p ?|-var v|? a"

## Introspection

The _info argspec proc_ command is added to get the argument specification
of all arguments or of a specific argument.

	% proc p { a { b 1 } { c -name c } } {}
	% info argspec proc p
	a { b -default 1 } { c -name c }
	% info argspec proc p c
	-name c

Similar _info argspec_ subcommands are also added for lambda, object
method and object constructor.

The _info argspec specifiers_ command is added to get the specifiers 
supported by the current interpreter.

	% info argspec specifiers
	-default -name -required -switch -upvar

## Other use cases

Extended argument specifiers can also be used with other _proc_-like
functions. The following functions are supported and can use extended
argument specifiers:

 * anonymous functions \(lambda\), used with _apply_ command ;

 * TclOO constructor or methods.

## Performance

The proposed modification has no significant performance impact:

 * existing code \(and code not using extended argspec\) is not impacted
   by the change as the current initialisation code is still available
   and used ;

 * code using extended argspec _may_ be impacted because the
   initialisation code is different and is required to loop on each
   argument, but initial testing does not show a significant slowdown.

When using named arguments specifiers to replace a similar handling done
in Tcl-pure code, there is however a significant increase in performance.

See <https://gist.github.com/mlafon/70480877a28f3571e0377eabc0cee7be> 
for details on performance testing done on the proposed implementation.

# Implementation

This document proposes the following changes to the Tcl core:

 1. Add ExtendedArgSpec structure which is linked from CompiledLocal
    and contains information about extended argument specification;

 2. Add a flags field in the Proc structure to later identify a proc
    with at least one argument defined with an extended argument
    specification \(PROC\_HAS\_EXT\_ARG\_SPEC\);

 3. Update proc creation to handle the extended argument specification
    and fill the ExtendedArgSpec structure;

 4. Update InitArgsAndLocals to initialize the compiled locals using
    a dedicated function if the PROC\_HAS\_EXT\_ARG\_SPEC flag has been
    set on the proc. If unset, the original initialization code is
    still used.

 5. Update ProcWrongNumArgs to generate an appropriate error message
    when an argument has been defined using an extended argument
    specification;

 6. Add _info argspec_ command;

 7. Update documentation in doc/proc.n and doc/info.n;

 8. Update impacted tests and add dedicated tests in tests/proc-enh.test.

## Reference Implementation

The reference implementation is available in the tip-457
<http://core.tcl.tk/tcl/timeline?r=tip-457>  branch.

The code is licensed under the BSD license.

# Discussion

This section details some of the alternate solutions for this feature or
specific comments about it.

Initial approaches that tried to work with unmodified procedures are
not detailed here for clarity.

## Dedicated builtin command

A dedicated command can be used to handle the named arguments, using an
_-option value_ syntax, before calling the target procedures with all
arguments correctly prepared.

	% call -opts myproc -optC foo -optB {5 5} -- "some pos arg"

An implementation of this proposal is available at
<https://github.com/mlafon/tcl/tree/457-CALL-CMD> . This proposal was
abandoned as it was not enough intuitive for users.

## Modification in how proc are defined

Tcl-pure procedures can be defined in a way which state that the procedure
will automatically handle _-option value_ arguments.

	% proc -np myproc { varA { optB defB } { optC defC } { optD defD } args } { .. }
	% myproc -optC foo -optB {5 5} -- "some pos arg"

An other possibility is to support options on arguments and allow name
specification:

	% proc myproc { varA { optB -default defB -name B } args } { .. }
	% myproc a -B b zz

This is the currently proposed solution in this TIP. It requires the
procedures to be modified but allow additional features.

Some people have expressed concern about the modification of the _proc_
command, which is a core command of Tcl. A particular attention has been paid
to ensure that existing code will not be impacted and that future usage could
be later added by adding new specifiers.

## Argument Parsing command

Cyan Ogilvie's paper from Tcl2016
<https://www.tcl.tk/community/tcl2016/assets/talk33/parse_args-paper.pdf> 
describes a C extension to provide core-like argument parsing at speed
comparable to _proc_ argument handling, in a terse and self-documenting
way.

Alexandre Ferrieux has proposed
<http://code.activestate.com/lists/tcl-core/18447/>  to use the same
argument specifiers than this proposal, but with a dedicated command which
can be called from the proc body. This has the advantage to not alter the
_proc_ command and could be located in an extension.

Although the _proc_ usage will not be modified, this new command will
probably have to access or modify internal proc structures, for example
to support introspection.

Having to declare final local variables in the body, also seems confusing
for users.

## Preventing Data-dependent bugs

It has been proposed by Christian Gollwitzer
<http://code.activestate.com/lists/tcl-core/18457/>  to make the special '--'
end-of-options marker mandatory when the number of positional arguments after
the named group is not fixed. This would suppress any potential Data-Dependent
bugs related to the search of the initial dash and remove any unwanted object
stringification, at the expense of forcing the user to explicitely use
the end-of-option marker.

This proposal is currently not implemented but the documentation has been
modified to list the cases for which '--' should be use.

# Copyright

This document has been placed in the public domain.

Name change from tip/458.tip to tip/458.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83

TIP:            458
Title:          Add Support for epoll() and kqueue() in the Notifier
Version:        $Revision: 1.6 $
Author:         Lucio Andr�s Illanes Albornoz <[email protected]>
Author:         Lucio Andr�s Illanes Albornoz <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        24-Nov-2016
Post-History:   
Keywords:       event loop,scalability
Tcl-Version:    8.7

~ Abstract

This TIP proposes to replace ''select''(2) in the notifier implementation with ''epoll''(7) and ''kqueue''(2) on Linux and DragonFly-, Free-, Net-, and OpenBSD respectively. This is to remove a major bottleneck in the ability of Tcl to scale up to thousands and tens of thousands of sockets (aka '''C10K''').
Furthermore, this should also provide sufficient infrastructure in order to permit adding support for other platform-specific event mechanisms in the future, such as IOCPs on Solaris and Windows.

~ Rationale

The drawbacks associated with ''poll''(2) and ''select''(2) and the tremendously improved ability to scale of ''epoll''(7) and ''kqueue''(2) are well-known [https://en.wikipedia.org/wiki/C10k_problem]; a previous attempt at implementing this feature elaborates on this subject and can be found at [https://sourceforge.net/p/tcl/mailman/tcl-core/?viewmonth=200909&viewday=10].

Initially, the notifier thread was retained to provide for event notification and inter-thread IPC. This eventually proved unnecessary and thus the ''epoll''(7)/''kqueue''(2) source modules now no longer contain the notifier thread and its infrastructure, particularly as this also reduces code size and complexity.

Threads that intend to wait on one or more file descriptors they own will now directly call ''epoll_wait''(2)/''kevent''(2) themselves during ''Tcl_WaitForEvent''().  Inter-thread IPC is provided for by a per-thread trigger pipe, analogous to the trigger pipe of the notifier thread. On Linux, an ''eventfd''(2) is used instead, which only requires one single fd. Furthermore, events for regular files are not processed via ''epoll''(7), as it does not support them at present. Instead, events for regular files are immediately returned by the notifier when waiting for events.

The new implementation of the notifier only has two minor drawbacks:

 1. Each thread that has called ''Tcl_WaitForEvent''() at least once will create an ''epoll''(7)/''kqueue''(2) file descriptor.

 2. All threads create two ''pipe''(2) file descriptors for inter-thread IPC; on Linux, one single ''eventfd''(2) is created and used.

Therefore, threads that have waited on events at least once now own an additional amount of three/two file descriptors. Whether this could prove to be a problem remains a point of contention that should be subject to further discussion.

As far as the notifier implementation is concerned, threads do not share data structures or file descriptors; IPC is provided for explicitly. However, a thread may queue events to and then alert another thread in order to allow for less primitive forms of IPC. Therefore, the previously static mutex protecting the notifier list has become a per-thread mutex. Instead of protecting the notifier list, it protects per-thread event queues from event queue/unqueue race conditions. This only applies to the ''epoll''(7)/''kqueue''(2)-based notifier implementations.

The majority of Tcl code should be unable to observe any difference at the script level.

~ Specification

At present, code changes are almost entirely constrained to either ''unix/tclEpollNotfy.c'' wherever ''epoll''(7) is supported or ''unix/tclKqueueNotfy.c'' wherever ''kqueue''(2) is supported. The original ''select''(2)-based notifier implementation now lives in ''unix/tclSelectNotfy.c''.

Subroutines shared between ''unix/tcl{Epoll,Kqueue}Notfy.c'' have been moved to ''unix/tclUnixNotfy.c'', which is '''#include'''d by the former. As explained in the last section of this document, the previously static mutex in ''generic/tclNotify.c'' has become a per-thread mutex.

The new code associates the newly introduced (but private) ''PlatformEventData'' structure with each file descriptor to wait on and its corresponding ''FileHandler'' struct. ''PlatformEventData'' contains:

 1. A pointer to the ''FileHandler'' the file descriptor belongs to. This specifically facilitates updating the platform-specific mask of new events for the file descriptor of a ''FileHandler'' after returning from ''epoll_wait''(2)/''kevent''(2) in ''NotifierThreadProc''().

 2. A pointer to the ''ThreadSpecificData'' of the thread to whom the ''FileHandler'' belongs. This specifically facilitates alerting threads waiting on one or more ''FileHandlers'' in ''NotifierThreadProc''().

The core implementation is found in a set of six (6) newly introduced static subroutines in ''unix/tcl{Epoll,Kqueue}Notfy.c'':

 1. ''PlatformEventsControl''() - abstracts ''epoll_ctl''(2)/''kevent''(2). Called by ''Tcl_{Create,Delete}FileHandler''() to add/update event masks for a new or an old ''FileHandler'' and ''NotifierThreadProc''() in order to include the ''receivePipe'' fd when waiting for and processing events.

 2. ''PlatformEventsFinalize''() - abstracts ''close''(2) and ''ckfree''(). Called by ''Tcl_FinalizeNotifier''().

 3. ''PlatformEventsGet''() - abstracts iterating over an array of events. Called by ''NotifierThreadProc''().

 4. ''PlatformEventsInit''() - abstracts ''epoll_create''(2)/''kqueue''(2). Called by ''PlatformEvents{Control,Wait}''() and ''Tcl_WaitForEvent''().

 5. ''PlatformEventsTranslate''() - translates platform-specific event masks to '''TCL_{READABLE,WRITABLE,EXCEPTION}''' bits. Called by ''Tcl_WaitForEvent''().

 6. ''PlatformEventsWait''() - abstracts ''epoll_wait''(2)/''kevent''(2). Called by ''Tcl_WaitForEvent''() and ''NotifierThreadProc''(). 

Two additional subroutine are used in all three code paths (''epoll'', ''kqueue'', ''select'') to reduce code redundancy:

 1. ''AlertSingleThread''() - notify a single thread that is waiting on I/O. Called by ''NotifierThreadProc''().

 2. ''TclUnixWaitForFile() - reimplemented via ''poll''(2) instead of ''select''(2), as ''poll''(2) does not suffer the '''FD_SETSIZE''' limit on file descriptors that can be passed to ''select''(2) and is available on a sufficiently large number of platforms. Most importantly, this code would not benefit from using ''epoll''(7) or ''kqueue''(2) as this subroutine only waits on one single file descriptor at a time.

''PlatformEventsInit''() currently defaults to allocating space for 128 array members of ''struct epoll_event/kevent''. This could preferably be handled through e.g. ''fconfigure''.

Originally, a mutex used to protect the ''epoll''(7)/''kqueue''(2) file descriptor and the above mentioned array. This proved to be redundant as ''epoll_ctl''(2) can be called whilst blocking on ''epoll_wait''(2) on Linux and as ''kevent''(2) can be called whilst blocking on ''kevent''(2) on FreeBSD.

Lastly, the ''configure'' script is updated to define '''HAVE_EPOLL''' or '''HAVE_KQUEUE''' as appropriate.

~ Reference implementation

Please refer to the ''tip-458'' branch. The code is licensed under the BSD license.

~ Copyright

This document has been placed in the public domain. In legislations where this concept does not exist the BSD license applies.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83

# TIP 458: Add Support for epoll() and kqueue() in the Notifier

	Author:         Lucio Andrés Illanes Albornoz <[email protected]>
	Author:         Lucio Andrés Illanes Albornoz <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        24-Nov-2016
	Post-History:   
	Keywords:       event loop,scalability
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes to replace _select_\(2\) in the notifier implementation with _epoll_\(7\) and _kqueue_\(2\) on Linux and DragonFly-, Free-, Net-, and OpenBSD respectively. This is to remove a major bottleneck in the ability of Tcl to scale up to thousands and tens of thousands of sockets \(aka **C10K**\).
Furthermore, this should also provide sufficient infrastructure in order to permit adding support for other platform-specific event mechanisms in the future, such as IOCPs on Solaris and Windows.

# Rationale

The drawbacks associated with _poll_\(2\) and _select_\(2\) and the tremendously improved ability to scale of _epoll_\(7\) and _kqueue_\(2\) are well-known <https://en.wikipedia.org/wiki/C10k_problem> ; a previous attempt at implementing this feature elaborates on this subject and can be found at <https://sourceforge.net/p/tcl/mailman/tcl-core/?viewmonth=200909&viewday=10> .

Initially, the notifier thread was retained to provide for event notification and inter-thread IPC. This eventually proved unnecessary and thus the _epoll_\(7\)/_kqueue_\(2\) source modules now no longer contain the notifier thread and its infrastructure, particularly as this also reduces code size and complexity.

Threads that intend to wait on one or more file descriptors they own will now directly call _epoll\_wait_\(2\)/_kevent_\(2\) themselves during _Tcl\_WaitForEvent_\(\).  Inter-thread IPC is provided for by a per-thread trigger pipe, analogous to the trigger pipe of the notifier thread. On Linux, an _eventfd_\(2\) is used instead, which only requires one single fd. Furthermore, events for regular files are not processed via _epoll_\(7\), as it does not support them at present. Instead, events for regular files are immediately returned by the notifier when waiting for events.

The new implementation of the notifier only has two minor drawbacks:

 1. Each thread that has called _Tcl\_WaitForEvent_\(\) at least once will create an _epoll_\(7\)/_kqueue_\(2\) file descriptor.

 2. All threads create two _pipe_\(2\) file descriptors for inter-thread IPC; on Linux, one single _eventfd_\(2\) is created and used.

Therefore, threads that have waited on events at least once now own an additional amount of three/two file descriptors. Whether this could prove to be a problem remains a point of contention that should be subject to further discussion.

As far as the notifier implementation is concerned, threads do not share data structures or file descriptors; IPC is provided for explicitly. However, a thread may queue events to and then alert another thread in order to allow for less primitive forms of IPC. Therefore, the previously static mutex protecting the notifier list has become a per-thread mutex. Instead of protecting the notifier list, it protects per-thread event queues from event queue/unqueue race conditions. This only applies to the _epoll_\(7\)/_kqueue_\(2\)-based notifier implementations.

The majority of Tcl code should be unable to observe any difference at the script level.

# Specification

At present, code changes are almost entirely constrained to either _unix/tclEpollNotfy.c_ wherever _epoll_\(7\) is supported or _unix/tclKqueueNotfy.c_ wherever _kqueue_\(2\) is supported. The original _select_\(2\)-based notifier implementation now lives in _unix/tclSelectNotfy.c_.

Subroutines shared between _unix/tcl\{Epoll,Kqueue\}Notfy.c_ have been moved to _unix/tclUnixNotfy.c_, which is **\#include**d by the former. As explained in the last section of this document, the previously static mutex in _generic/tclNotify.c_ has become a per-thread mutex.

The new code associates the newly introduced \(but private\) _PlatformEventData_ structure with each file descriptor to wait on and its corresponding _FileHandler_ struct. _PlatformEventData_ contains:

 1. A pointer to the _FileHandler_ the file descriptor belongs to. This specifically facilitates updating the platform-specific mask of new events for the file descriptor of a _FileHandler_ after returning from _epoll\_wait_\(2\)/_kevent_\(2\) in _NotifierThreadProc_\(\).

 2. A pointer to the _ThreadSpecificData_ of the thread to whom the _FileHandler_ belongs. This specifically facilitates alerting threads waiting on one or more _FileHandlers_ in _NotifierThreadProc_\(\).

The core implementation is found in a set of six \(6\) newly introduced static subroutines in _unix/tcl\{Epoll,Kqueue\}Notfy.c_:

 1. _PlatformEventsControl_\(\) - abstracts _epoll\_ctl_\(2\)/_kevent_\(2\). Called by _Tcl\_\{Create,Delete\}FileHandler_\(\) to add/update event masks for a new or an old _FileHandler_ and _NotifierThreadProc_\(\) in order to include the _receivePipe_ fd when waiting for and processing events.

 2. _PlatformEventsFinalize_\(\) - abstracts _close_\(2\) and _ckfree_\(\). Called by _Tcl\_FinalizeNotifier_\(\).

 3. _PlatformEventsGet_\(\) - abstracts iterating over an array of events. Called by _NotifierThreadProc_\(\).

 4. _PlatformEventsInit_\(\) - abstracts _epoll\_create_\(2\)/_kqueue_\(2\). Called by _PlatformEvents\{Control,Wait\}_\(\) and _Tcl\_WaitForEvent_\(\).

 5. _PlatformEventsTranslate_\(\) - translates platform-specific event masks to **TCL\_\{READABLE,WRITABLE,EXCEPTION\}** bits. Called by _Tcl\_WaitForEvent_\(\).

 6. _PlatformEventsWait_\(\) - abstracts _epoll\_wait_\(2\)/_kevent_\(2\). Called by _Tcl\_WaitForEvent_\(\) and _NotifierThreadProc_\(\). 

Two additional subroutine are used in all three code paths \(_epoll_, _kqueue_, _select_\) to reduce code redundancy:

 1. _AlertSingleThread_\(\) - notify a single thread that is waiting on I/O. Called by _NotifierThreadProc_\(\).

 2. _TclUnixWaitForFile\(\) - reimplemented via _poll_\(2\) instead of _select_\(2\), as _poll_\(2\) does not suffer the **FD\_SETSIZE** limit on file descriptors that can be passed to _select_\(2\) and is available on a sufficiently large number of platforms. Most importantly, this code would not benefit from using _epoll_\(7\) or _kqueue_\(2\) as this subroutine only waits on one single file descriptor at a time.

_PlatformEventsInit_\(\) currently defaults to allocating space for 128 array members of _struct epoll\_event/kevent_. This could preferably be handled through e.g. _fconfigure_.

Originally, a mutex used to protect the _epoll_\(7\)/_kqueue_\(2\) file descriptor and the above mentioned array. This proved to be redundant as _epoll\_ctl_\(2\) can be called whilst blocking on _epoll\_wait_\(2\) on Linux and as _kevent_\(2\) can be called whilst blocking on _kevent_\(2\) on FreeBSD.

Lastly, the _configure_ script is updated to define **HAVE\_EPOLL** or **HAVE\_KQUEUE** as appropriate.

# Reference implementation

Please refer to the _tip-458_ branch. The code is licensed under the BSD license.

# Copyright

This document has been placed in the public domain. In legislations where this concept does not exist the BSD license applies.

Name change from tip/459.tip to tip/459.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

TIP:            459
Title:          Tcl Package Introspection Improvements
Version:        $Revision: 1.6 $
Author:         Jan Nijtmans <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        08-Dec-2016
Post-History:   
Keywords:       Tcl,package
Tcl-Version:    8.7

~ Abstract

This TIP proposes to improve package introspection by providing a new command
'''package files'''.

~ Rationale

This TIP is inspired by a request from FlightAware to improve Tcl's package
introspection possibilities. Although only a '''package files''' command was
requested, extending '''info loaded''' gives the possibility to find a shared
library contained in a package more easily than searching a list.

~ Specification of the proposed Change

Two new additions are proposed, to the '''package''' and '''info''' commands.

 1. '''package files''' ''name''

  > This command returns a list of filenames which were sourced during the
    initialization of package ''name''. More specific, the files that were
    sourced during running the script registered using '''package ifneeded'''.
    Left out are Tcl's own tclIndex and pkgIndex.tcl files, which might have been
    accessed due to dependancy searches, otherwise this would give very
    misleading results.

 1. '''info loaded''' ?''interpreter''? ?''name''?

  > The '''info loaded''' command already exists, it gives a list of package
    names with corresponding shared library names which were actually loaded
    in the give interpreter. The additional ''name'' argument restricts the
    result to the filename of the loaded library only.

Tcl packages don't have to do anything special in order to be introspected
correctly, just note that files containing auto_loaded commands cannot be
introspected because they are not sourced during package initialization.

~ Rejected alternatives

  > Use of ''source -nopkg'' in tclIndex files. Even though this addition in the
  earlier TIP was explicitly undocumented, it lead to the misunderstanding
  that other Tcl extensions should do the same.

  > Earlier implementation of this TIP didn't handle the second argument of
  '''info loaded''' correctly in all cases, and the handling in safe
  interpreters was not complete. This is all corrected in the current implementation.

  > All filenames should be converted to absolute. This is rejected for performance
  and for practical reasons. It could be quite expensive to calculate because
  the disk has to be accessed for possible hyper-links. Second, the package
  mechanism is already designed such that all sourced paths are absolute (see
  below example). Extensions using the '''source''' command with relative
  paths are in danger already, this should be fixed in the extension in
  stead of being masked in the '''package files''' command.

  > Additional information about the sourced files (like mtime or checksum) was
  suggested to be part of the introspection information, but this has been
  rejected as overkill. It is much more than requested in the Tcl-bounty, and
  it is difficult to imagine what actual use this would bring.

~ Reference Implementation

This is available in the ''package_files'' branch
[http://core.tcl.tk/tcl/timeline?r=package_files].

~ Examples

|$ tclsh8.7
|% package files Tcl
|/usr/lib/tcl8.7/init.tcl
|% package require Tk
|8.7a0
|% package files msgcat
|/usr/lib/tcl8/8.5/msgcat-1.6.0.tm
|% package files Tk
|/usr/lib/tk8.7/tk.tcl /usr/lib/tk8.7/msgs/en.msg /usr/lib/tk8.7/icons.tcl ...
|% info loaded {} Tk
|/usr/lib/libtk8.7.so

Note that '''package require Tk''' has the side-effect of loading the ''msgcat'' package, which is required by Tk.

~ Copyright

This document has been placed in the public domain.

Please note that any correspondence to the author concerning this TIP is
considered in the public domain unless otherwise specifically requested by the
individual(s) authoring said correspondence. This is to allow information
about the TIP to be placed in a public forum for discussion.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|
|

|

|

|

|
|
|

|

|

|

|

|

|

|

|
|

|

|

|

|
|

|

|
|
|
|
|
|
|
|
|
|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

# TIP 459: Tcl Package Introspection Improvements

	Author:         Jan Nijtmans <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        08-Dec-2016
	Post-History:   
	Keywords:       Tcl,package
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes to improve package introspection by providing a new command
**package files**.

# Rationale

This TIP is inspired by a request from FlightAware to improve Tcl's package
introspection possibilities. Although only a **package files** command was
requested, extending **info loaded** gives the possibility to find a shared
library contained in a package more easily than searching a list.

# Specification of the proposed Change

Two new additions are proposed, to the **package** and **info** commands.

 1. **package files** _name_

	  > This command returns a list of filenames which were sourced during the
    initialization of package _name_. More specific, the files that were
    sourced during running the script registered using **package ifneeded**.
    Left out are Tcl's own tclIndex and pkgIndex.tcl files, which might have been
    accessed due to dependancy searches, otherwise this would give very
    misleading results.

 1. **info loaded** ?_interpreter_? ?_name_?

	  > The **info loaded** command already exists, it gives a list of package
    names with corresponding shared library names which were actually loaded
    in the give interpreter. The additional _name_ argument restricts the
    result to the filename of the loaded library only.

Tcl packages don't have to do anything special in order to be introspected
correctly, just note that files containing auto\_loaded commands cannot be
introspected because they are not sourced during package initialization.

# Rejected alternatives

  > Use of _source -nopkg_ in tclIndex files. Even though this addition in the
  earlier TIP was explicitly undocumented, it lead to the misunderstanding
  that other Tcl extensions should do the same.

  > Earlier implementation of this TIP didn't handle the second argument of
  **info loaded** correctly in all cases, and the handling in safe
  interpreters was not complete. This is all corrected in the current implementation.

  > All filenames should be converted to absolute. This is rejected for performance
  and for practical reasons. It could be quite expensive to calculate because
  the disk has to be accessed for possible hyper-links. Second, the package
  mechanism is already designed such that all sourced paths are absolute \(see
  below example\). Extensions using the **source** command with relative
  paths are in danger already, this should be fixed in the extension in
  stead of being masked in the **package files** command.

  > Additional information about the sourced files \(like mtime or checksum\) was
  suggested to be part of the introspection information, but this has been
  rejected as overkill. It is much more than requested in the Tcl-bounty, and
  it is difficult to imagine what actual use this would bring.

# Reference Implementation

This is available in the _package\_files_ branch
<http://core.tcl.tk/tcl/timeline?r=package_files> .

# Examples

	$ tclsh8.7
	% package files Tcl
	/usr/lib/tcl8.7/init.tcl
	% package require Tk
	8.7a0
	% package files msgcat
	/usr/lib/tcl8/8.5/msgcat-1.6.0.tm
	% package files Tk
	/usr/lib/tk8.7/tk.tcl /usr/lib/tk8.7/msgs/en.msg /usr/lib/tk8.7/icons.tcl ...
	% info loaded {} Tk
	/usr/lib/libtk8.7.so

Note that **package require Tk** has the side-effect of loading the _msgcat_ package, which is required by Tk.

# Copyright

This document has been placed in the public domain.

Please note that any correspondence to the author concerning this TIP is
considered in the public domain unless otherwise specifically requested by the
individual\(s\) authoring said correspondence. This is to allow information
about the TIP to be placed in a public forum for discussion.

Name change from tip/46.tip to tip/46.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

TIP:		46
Title:          Consistent Overlap Behavior of Area-Defining Canvas Items 
Version:        $Revision: 1.5 $
Author:         Gerhard Hintermayer <[email protected]>
State:          Withdrawn
Type:           Project
Vote:           Pending
Created:        18-Jul-2001
Post-History:   
Tcl-Version:	8.5

~ Abstract

This document proposes that all canvas items that define an area
should behave the same in terms of interior points, i.e. points that
return the enclosing object id when submitted to
''[$canvas find overlapping]''.  Currently polygons behave differently
from the rest (rectangle, arc, oval).

~ Rationale

As long as these area-defining canvas items are filled, there's no problem.
The interior points belong to the object. But when the object is not filled
(i.e. -fill "" is used), only polygons consider inside points as overlapping.
For the rest of the area-defining canvas items, an interior point is ''not''
considered to overlap the object.  This makes it impossible to

   * define invisible or not filled mouse sensitive areas other than polygons
     because moving the pointer inside of an arc/oval/rectangle creates both
     an ''<Enter>'' and a ''<Leave>'' event, even though the pointer is still
     inside the item.

   * do object-oriented selection on a canvas. Consider you want to select
     a (not filled) oval, you ''have to'' click on the vertice, or else you
     won't find a overlapping item.

Well, I see the point, that this proposal might break existing code, but
from the number of replies to my postings at news:comp.lang.tcl ,
''[$canvas find overlapping]'' is not used very often.

One possibility to fix the backward compatibility is to introduce 2
different fill colors for the 2 cases - object either hollow or solid
but not filled. Then inside points would not overlap hollow objects, but
would overlap solid objects.

~ Proposal

We should either choose a wire frame model or an object-oriented model
for canvas objects. To my mind an object-oriented approach is better.
Right now we have a mixture of both. Polygons are objects, arcs, ovals and
rectangles are wire frames.

What I'd like is: all points which are inside of an area object should return
the enclosing object when passed to find overlap, ''regardless'' of the fill
color of the item.

~ Notice of Withdrawal

This TIP was Withdrawn by the TIP Editor following discussion on the
tcl-core mailing list.  The following is a summary of reasons for
withdrawal:

 > This would remove useful behaviour that is used rather more often
   than people think.  If people want unfilled polygons with the other
   style of overlap behaviour, they should use lines.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|
|

|

|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

# TIP 46: Consistent Overlap Behavior of Area-Defining Canvas Items

	Author:         Gerhard Hintermayer <[email protected]>
	State:          Withdrawn
	Type:           Project
	Vote:           Pending
	Created:        18-Jul-2001
	Post-History:   
	Tcl-Version:	8.5
-----

# Abstract

This document proposes that all canvas items that define an area
should behave the same in terms of interior points, i.e. points that
return the enclosing object id when submitted to
_[$canvas find overlapping]_.  Currently polygons behave differently
from the rest \(rectangle, arc, oval\).

# Rationale

As long as these area-defining canvas items are filled, there's no problem.
The interior points belong to the object. But when the object is not filled
\(i.e. -fill "" is used\), only polygons consider inside points as overlapping.
For the rest of the area-defining canvas items, an interior point is _not_
considered to overlap the object.  This makes it impossible to

   * define invisible or not filled mouse sensitive areas other than polygons
     because moving the pointer inside of an arc/oval/rectangle creates both
     an _<Enter>_ and a _<Leave>_ event, even though the pointer is still
     inside the item.

   * do object-oriented selection on a canvas. Consider you want to select
     a \(not filled\) oval, you _have to_ click on the vertice, or else you
     won't find a overlapping item.

Well, I see the point, that this proposal might break existing code, but
from the number of replies to my postings at news:comp.lang.tcl ,
_[$canvas find overlapping]_ is not used very often.

One possibility to fix the backward compatibility is to introduce 2
different fill colors for the 2 cases - object either hollow or solid
but not filled. Then inside points would not overlap hollow objects, but
would overlap solid objects.

# Proposal

We should either choose a wire frame model or an object-oriented model
for canvas objects. To my mind an object-oriented approach is better.
Right now we have a mixture of both. Polygons are objects, arcs, ovals and
rectangles are wire frames.

What I'd like is: all points which are inside of an area object should return
the enclosing object when passed to find overlap, _regardless_ of the fill
color of the item.

# Notice of Withdrawal

This TIP was Withdrawn by the TIP Editor following discussion on the
tcl-core mailing list.  The following is a summary of reasons for
withdrawal:

 > This would remove useful behaviour that is used rather more often
   than people think.  If people want unfilled polygons with the other
   style of overlap behaviour, they should use lines.

# Copyright

This document has been placed in the public domain.

Name change from tip/460.tip to tip/460.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61

62
63
64
65
66

67
68
69
70

71
72
73
74
75
76
77
78

79
80
81
82

83
84
85

86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141

142
143
144
145
146
147
148
149
150
151
152
153

154
155
156
157
158

159
160
161
162
163
164
165
166
167
168
169
170
171
172

173
174
175
176
177
178

179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232

TIP:            460
Title:          An Alternative to Upvar
Version:        $Revision: 1.5 $
Author:         Don Hathway <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        08-Dec-2016
Post-History:   
Keywords:       Tcl,variable,link,upvar
Tcl-Version:    9.0

~ Abstract

Variable linking with the ''upvar'' command is not as intuitive or effecient
as it should be. This TIP proposes an alternative through automatic variable
linking.

~Rationale

The current strategy used to link a variable in a called procedure to the
caller, is to pass the name of the variable to the procedure, and use the
''upvar'' command to create a new variable, which is then linked to the
original. Thus linking to a variable requires two components; the variable
name and a newly created variable.

It is possible to instruct Tcl to do this linking automatically in an idiomatic way
and dispense with the ''upvar'' command call.

Also, the requirement (by ''upvar'') that the name of the new link variable be a 
different name from the original is arguably considered counter-intuitive.

Benefits to this TIP as proposed:

 1. '''No code to perform explicit linking within the procedure's body.'''
    Unlike ''upvar'', this method requires no additional code to be entered in
    the body of the procedure. Less code, less bugs, easier to use! It has
    been said that Tcl'ers should make more use of variable linking in their
    code. Making it easier for them should have an encouraging effect, similar
    to how most Tcl'ers prefer ''$var'' over ''set var''.

 2. '''Clearly defines links in the procedure's parameter list.''' Readers
    should instantly know what the links are.  Clarity is important,
    especially for people that read code all day. There are no special project
    naming conventions to follow. A reader doesn't have to rely on docs or
    assume that a parameter name of "varName", "_var", or "varLnk" is to be
    linked by an upvar call, of which may be pushed down in the procedure's
    body by comments or other code.

 3. '''Alleviates arguably messy ''upvar'' chain linking.'''

~~ Upvar Chaining Example

Below are three ''upvar''s with the same arguments. As you can see, there is
quite a bit of arguably unnecessary code duplication, and that is bug prone.

| proc foo {a} {
|    upvar 1 $a la
|    maybe do something with la
|    bar la
| }

| proc bar {a} {
|    upvar 1 $a la
|    maybe do something with la
|    baz la
| }

| proc baz {a} {
|    upvar 1 $a la
|    maybe do something with la
| }

| foo begin

This could be written more succinctly:

| proc foo {*a} {
|    maybe do something with a
|    bar a
| }

| proc bar {*a} {
|    maybe do something with a
|    baz a
| }

| proc baz {*a} {
|    maybe do something with a
| }

| foo begin

~ Specification

Add support to procedure handling to allow for a parametric hint to procedure
definitions with respect to the intent to link variables accordingly. We use
the asterisk character "'''*'''" as the symbol to declare this intent; which
shall prefix the parameter's name. Consequently, the "'''*'''" character becomes 
special, but only inside the procedure parameter list. A procedure definition using 
this facility would then have the signature:

| proc foo {*a *b} {...}

Where '''*a''' and '''*b''' are the procedure's parameters to be linked to the
caller's arguments. New variables are then created for the future linking. In this example
'''*a''' creates a new link variable named '''a''', and likewise done for '''*b'''.
'''*a''' and '''*b''' holds the values passed in by the caller.

The formal parameter's shall retain the same values provided by the caller.

The link variable's name shall always have one '''*''' symbol less than its counterpart parameter, for the sake of consistency.  In example, a parameter named '''***a''' shall have a counterpart link variable named '''**a'''. Similarily '''**a''' shall have a counterpart link named '''*a'''.

Where there are duplicate link parameter names (i.e. proc P {*a *a}) the behavior shall be the same as if there were duplicate '''upvar''' statements.

It is legal to have empty link variable names. It shall be possible with a single '''*''' in
the procedure's parameter list (i.e. proc P {*} {incr ""}). The same duplicate name rule applies.

If the variable to be linked does not exist, it shall be created, if necessary. It shall have the same behavior as '''upvar 1''' in such instances.

When a link's construction fails, the behavior shall be the same as if '''upvar''' had failed, the procedure will
return with an error before any other commands (with exception to any commands involved in the link's construction) in its body are executed.

It is illegal for a link parameter to have a default value. It shall invoke an error during procedure
creation time and result in failed procedure creation with the error code:

| Tcl_SetErrorCode(interp, "TCL", "OPERATION", "PROC","FORMALARGUMENTFORMAT", NULL);

An example of such an error for:
| proc P {{*a foo}} {...}
Would be: "procedure "P": formal parameter "*a"  is to be linked and must not have a default value"

In that example, proc '''P''' is never created, the attempt failed due to the error.

It is the caller's responsibility to provide the names of variables to be linked. This 
constraint exists in the spirit of promoting good coding practices and to help avoid 
obscure and subtle bugs. For the same reasons, this TIP only searches one level up.
Therefore, It shall have the same behavior as '''upvar 1'''.

'''*args''' is a valid parameter name. For example, '''args''' is simply a
link in:

| proc foo {a *args} {
|     incr args
| }

Note that as of this TIP ''proc foo {args args} {...}'' is legal Tcl. In this

instance only the first ''scalar'' '''args''' is usable by the procedure. The
rest of the arguments are inaccessible by the script. They're not internally
lost, but Tcl's variable lookup mechanics will choose whichever is found first
when a script references it. This behavior is inherited for ''proc foo {*args
args} {...}''. Where '''args''' will be a link.

To further illustrate this proposal with an example:

| proc foo {*a *b} {
|   bar a b
| }

| proc bar {*a *b} {
|   incr a
|   incr b
| }

| set v1 0
| set v2 1
| foo v1 v2
| puts $v1
| # prints 1
| puts $v2
| # prints 2

| # Version of foo using upvar:
| proc foo {a b} {
|    # Note, upvar $a a would be an error.
|    upvar 1 $a la $b lb
|    bar la lb
| }

| proc bar {a b} {
|    upvar 1 $a la $b lb
|    incr la
|    incr lb
| }

The "'''*'''" character was chosen primarily because it
resembles a star or a snowflake and has a pleasantry to it. It is one of the few
ascii characters that '''sticks out''' from its surrounding text. 

It is also familiar to users of other languages where the same symbol exhibits similar
semantics (to wit: a link in Tcl acts as a reference to another variable and doesn't perform 
a copy when the reference is written to, as it would if it weren't a link). However, 
unlike other languages, the Tcl core does not expose operations to user scripts that work directly on 
memory, so the "'''*'''" character should not be mistaken to behave the same or suffer from the same 
pitfalls as it does in C, C++, Golang, etc. The '''*''' symbol simply instructs Tcl to 
create a link if it is able to do so.

~~ Consequences

 1. Breaks scripts using the special "'''*'''" as the first character in their
    procedure's parameters (i.e. '''*var''').

  > The impact of this should be minimal because these variable names require
    the user to wrap it in curly braces (i.e. '''${*var}''') to fetch their
    values, unless they're using the less common form of '''set varname'''.

~Reference Implementation

See branch ''dah-proc-arg-upvar''

~~Implementation Notes

 tclInt.h: Add a new field named ''numArgsCompiledLocals'' to the Proc struct.
  The new field holds the number of parameters along with any other relevant
  local variables which follow immediately after the parameters. For this TIP,
  these additional locals are variables with the VAR_LINK flag and to be resolved 
  as links to the values of arguments they've been configured to link with.

  The additional field was a hard choice, but is necessary because ''TclProcCompileProc'' 
  enforces ''procPtr->numCompiledLocals'' to be the same value as ''procPtr->numArgs''. 
  The local variable table is evidently not growable until later.

 tclProc.c: Modify ''InitArgsAndLocals'' to do the automatic linking. Note
  that this is a ''very hot'' function and that was kept in mind while making
  the necessary adjustments. There are two additional branches in the function
  (the second only visited when an error happens). The first to check if the
  command has any parameters that need linking and if so, process them with
  link support handling code. The second branch is to simply check if the link
  handling code set an error when an error occurs, so this branch should not
  be a concern as to performance impact. Due to branch prediction and this
  function being so hot, there should be virtually nil of a performance impact 
  on any code which doesn't make use of the new automatic linking facility.

 tclProc.c: Modify ''TclCreateProc'' to add additional locals after the list
 of parameter locals (if any) when there are parameters flagged for auto linking.

~Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
<
>
|

|
|
|
<
>
|
|
|
<
>
|
|
<
>
|

|

|
|

|

|

|
|

|

|

|
|

|

|
|

|

|
|

|

|

|

|
|
<
|
|
>
|

|
|

|
|
<
|
>
|
|
|
<
|
>
|
|
|
|
|
|
|

|
|
|
|
|
<
>
|
|
|
|
<
|
>
|

|

|
|

|
|

|

|
|

|
|
|

|

|

|

|

|

|
|

|
|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59

60
61
62
63
64

65
66
67
68

69
70
71
72
73
74
75
76

77
78
79
80

81
82
83

84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137

138
139
140
141
142
143
144
145
146
147
148
149
150

151
152
153
154
155

156
157
158
159
160
161
162
163
164
165
166
167
168
169
170

171
172
173
174
175

176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232

# TIP 460: An Alternative to Upvar

	Author:         Don Hathway <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        08-Dec-2016
	Post-History:   
	Keywords:       Tcl,variable,link,upvar
	Tcl-Version:    9.0
-----

# Abstract

Variable linking with the _upvar_ command is not as intuitive or effecient
as it should be. This TIP proposes an alternative through automatic variable
linking.

# Rationale

The current strategy used to link a variable in a called procedure to the
caller, is to pass the name of the variable to the procedure, and use the
_upvar_ command to create a new variable, which is then linked to the
original. Thus linking to a variable requires two components; the variable
name and a newly created variable.

It is possible to instruct Tcl to do this linking automatically in an idiomatic way
and dispense with the _upvar_ command call.

Also, the requirement \(by _upvar_\) that the name of the new link variable be a 
different name from the original is arguably considered counter-intuitive.

Benefits to this TIP as proposed:

 1. **No code to perform explicit linking within the procedure's body.**
    Unlike _upvar_, this method requires no additional code to be entered in
    the body of the procedure. Less code, less bugs, easier to use! It has
    been said that Tcl'ers should make more use of variable linking in their
    code. Making it easier for them should have an encouraging effect, similar
    to how most Tcl'ers prefer _$var_ over _set var_.

 2. **Clearly defines links in the procedure's parameter list.** Readers
    should instantly know what the links are.  Clarity is important,
    especially for people that read code all day. There are no special project
    naming conventions to follow. A reader doesn't have to rely on docs or
    assume that a parameter name of "varName", "\_var", or "varLnk" is to be
    linked by an upvar call, of which may be pushed down in the procedure's
    body by comments or other code.

 3. **Alleviates arguably messy _upvar_ chain linking.**

## Upvar Chaining Example

Below are three _upvar_s with the same arguments. As you can see, there is
quite a bit of arguably unnecessary code duplication, and that is bug prone.

	 proc foo {a} {
	    upvar 1 $a la
	    maybe do something with la
	    bar la

	 }
	 proc bar {a} {
	    upvar 1 $a la
	    maybe do something with la
	    baz la

	 }
	 proc baz {a} {
	    upvar 1 $a la
	    maybe do something with la

	 }
	 foo begin

This could be written more succinctly:

	 proc foo {*a} {
	    maybe do something with a
	    bar a

	 }
	 proc bar {*a} {
	    maybe do something with a
	    baz a

	 }
	 proc baz {*a} {
	    maybe do something with a

	 }
	 foo begin

# Specification

Add support to procedure handling to allow for a parametric hint to procedure
definitions with respect to the intent to link variables accordingly. We use
the asterisk character "**\***" as the symbol to declare this intent; which
shall prefix the parameter's name. Consequently, the "**\***" character becomes 
special, but only inside the procedure parameter list. A procedure definition using 
this facility would then have the signature:

	 proc foo {*a *b} {...}

Where **\*a** and **\*b** are the procedure's parameters to be linked to the
caller's arguments. New variables are then created for the future linking. In this example
**\*a** creates a new link variable named **a**, and likewise done for **\*b**.
**\*a** and **\*b** holds the values passed in by the caller.

The formal parameter's shall retain the same values provided by the caller.

The link variable's name shall always have one **\*** symbol less than its counterpart parameter, for the sake of consistency.  In example, a parameter named **\*\*\*a** shall have a counterpart link variable named **\*\*a**. Similarily **\*\*a** shall have a counterpart link named **\*a**.

Where there are duplicate link parameter names \(i.e. proc P \{\*a \*a\}\) the behavior shall be the same as if there were duplicate **upvar** statements.

It is legal to have empty link variable names. It shall be possible with a single **\*** in
the procedure's parameter list \(i.e. proc P \{\*\} \{incr ""\}\). The same duplicate name rule applies.

If the variable to be linked does not exist, it shall be created, if necessary. It shall have the same behavior as **upvar 1** in such instances.

When a link's construction fails, the behavior shall be the same as if **upvar** had failed, the procedure will
return with an error before any other commands \(with exception to any commands involved in the link's construction\) in its body are executed.

It is illegal for a link parameter to have a default value. It shall invoke an error during procedure
creation time and result in failed procedure creation with the error code:

	 Tcl_SetErrorCode(interp, "TCL", "OPERATION", "PROC","FORMALARGUMENTFORMAT", NULL);

An example of such an error for:
	 proc P {{*a foo}} {...}
Would be: "procedure "P": formal parameter "\*a"  is to be linked and must not have a default value"

In that example, proc **P** is never created, the attempt failed due to the error.

It is the caller's responsibility to provide the names of variables to be linked. This 
constraint exists in the spirit of promoting good coding practices and to help avoid 
obscure and subtle bugs. For the same reasons, this TIP only searches one level up.
Therefore, It shall have the same behavior as **upvar 1**.

**\*args** is a valid parameter name. For example, **args** is simply a
link in:

	 proc foo {a *args} {
	     incr args

	 }

Note that as of this TIP _proc foo \{args args\} \{...\}_ is legal Tcl. In this
instance only the first _scalar_ **args** is usable by the procedure. The
rest of the arguments are inaccessible by the script. They're not internally
lost, but Tcl's variable lookup mechanics will choose whichever is found first
when a script references it. This behavior is inherited for _proc foo \{\*args
args\} \{...\}_. Where **args** will be a link.

To further illustrate this proposal with an example:

	 proc foo {*a *b} {
	   bar a b

	 }

	 proc bar {*a *b} {
	   incr a
	   incr b

	 }

	 set v1 0
	 set v2 1
	 foo v1 v2
	 puts $v1
	 # prints 1
	 puts $v2
	 # prints 2

	 # Version of foo using upvar:
	 proc foo {a b} {
	    # Note, upvar $a a would be an error.
	    upvar 1 $a la $b lb
	    bar la lb

	 }
	 proc bar {a b} {
	    upvar 1 $a la $b lb
	    incr la
	    incr lb

	 }

The "**\***" character was chosen primarily because it
resembles a star or a snowflake and has a pleasantry to it. It is one of the few
ascii characters that **sticks out** from its surrounding text. 

It is also familiar to users of other languages where the same symbol exhibits similar
semantics \(to wit: a link in Tcl acts as a reference to another variable and doesn't perform 
a copy when the reference is written to, as it would if it weren't a link\). However, 
unlike other languages, the Tcl core does not expose operations to user scripts that work directly on 
memory, so the "**\***" character should not be mistaken to behave the same or suffer from the same 
pitfalls as it does in C, C\+\+, Golang, etc. The **\*** symbol simply instructs Tcl to 
create a link if it is able to do so.

## Consequences

 1. Breaks scripts using the special "**\***" as the first character in their
    procedure's parameters \(i.e. **\*var**\).

	  > The impact of this should be minimal because these variable names require
    the user to wrap it in curly braces \(i.e. **$\{\*var\}**\) to fetch their
    values, unless they're using the less common form of **set varname**.

# Reference Implementation

See branch _dah-proc-arg-upvar_

## Implementation Notes

 tclInt.h: Add a new field named _numArgsCompiledLocals_ to the Proc struct.
  The new field holds the number of parameters along with any other relevant
  local variables which follow immediately after the parameters. For this TIP,
  these additional locals are variables with the VAR\_LINK flag and to be resolved 
  as links to the values of arguments they've been configured to link with.

  The additional field was a hard choice, but is necessary because _TclProcCompileProc_ 
  enforces _procPtr->numCompiledLocals_ to be the same value as _procPtr->numArgs_. 
  The local variable table is evidently not growable until later.

 tclProc.c: Modify _InitArgsAndLocals_ to do the automatic linking. Note
  that this is a _very hot_ function and that was kept in mind while making
  the necessary adjustments. There are two additional branches in the function
  \(the second only visited when an error happens\). The first to check if the
  command has any parameters that need linking and if so, process them with
  link support handling code. The second branch is to simply check if the link
  handling code set an error when an error occurs, so this branch should not
  be a concern as to performance impact. Due to branch prediction and this
  function being so hot, there should be virtually nil of a performance impact 
  on any code which doesn't make use of the new automatic linking facility.

 tclProc.c: Modify _TclCreateProc_ to add additional locals after the list
 of parameter locals \(if any\) when there are parameters flagged for auto linking.

# Copyright

This document has been placed in the public domain.

Name change from tip/461.tip to tip/461.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185

186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222

TIP:            461
Title:          Separate Numeric and String Comparison Operators
Version:        $Revision: 1.6 $
Author:         Kevin B Kenny <[email protected]>
Author:         Kevin B Kenny <[email protected]>
Author:         Kevin Kenny <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        24-Jan-2017
Post-History:   
Keywords:       Tcl,expression
Tcl-Version:    8.7

~ Abstract

This TIP proposes to complete the separation between string and numeric
comparison operations in [[expr]] and related commands ([[for]], [[if]],
[[while]], etc.). It introduces new comparison operators '''ge''', '''gt''',
'''le''', and '''lt''', (along with the corresponding commands in the
'''::tcl::mathop''' namespace), and encourages programmers to restrict the six operators '''==''', '''>=''', '''>''', '''<=''', '''<''' and '''!=''' to comparisons of numeric
values.

~ Rationale

Tcl throughout its history has had comparison operators that freely compare
numeric and string values. These operators behave as expected if both their
arguments are numeric: they compare values on the real number line. Hence, 15
< 0x10 < 0b10001. Similarly, if presented with non-numeric strings, they
compare the strings in lexicographic order, as a programmer might expect:
"bambam" < "barney" < "betty" < "fred".

Trouble arises, however, when numeric and non-numeric strings are compared.
The rule for comparison is that mixed-type comparisons like this are treated
as string comparisons. The result is that '''<''' does not induce an order.
There are inconsistent comparison results, rendering '''<''' and friends
worthless for sorting. 0x10 < 0y < 1 < 0x10.

The problems with this inconsistency prompted changes in May of 2000,
introducing '''eq''' and '''ne''' operators that always perform string
comparison. For whatever reason, the four inequality operations never
followed. This leads to pitfalls for the unwary. It's fairly well entrenched
in the Tcl folklore that comparisons other than '''eq''' and '''ne''' should
be reserved for numeric arguments only, and experienced Tcl programmers know
to write:

| if {[string compare $x $y] < 0} { ... }

in place of 

| if {$x < $y} { ... }

~ Proposal

Four new bareword operators, '''ge''', '''gt''', '''le''' and
'''lt''' shall be added to the expression parser and to the
'''::tcl::mathop''' command set. They will have precedence identical to
the existing operators '''>=''', '''>''', '''<=''' and '''<'''. They
will accept string values, and return 0 or 1 according to lexicographic
string comparison of their operators. This change is entirely backward
compatible (it uses syntax that would previously have been erroneous),
and should go in as soon as possible - no later than the next point
release, but ideally even in a patchlevel - so that programmers can
begin conversion as soon as possible. Use of the '''==''', '''>=''',
'''>''', '''<=''', '''<''', and '''!=''' for comparing non-numeric
values shall immediately be deprecated.

The six string compare operators shall be declared to function so that
their results are the same as the results of [[string compare]]:

|    {$a lt $b}  <=> {[string compare $a $b] <  0}
|    {$a le $b}  <=> {[string compare $a $b] <= 0}
|    {$a eq $b}  <=> {[string compare $a $b] == 0}
|    {$a ne $b}  <=> {[string compare $a $b] != 0}
|    {$a gt $b}  <=> {[string compare $a $b] >  0}
|    {$a ge $b}  <=> {[string compare $a $b] >= 0}

It is also intended that any future changes to [[string compare]]
(for example, a hypothetical change to make it follow Unicode collation
semantics) will have the corresponding effect on these six operators.

Unlike what was specified in an earlier version of this TIP, no
changes are to  be made to the semantics of the comparison operators
 '''==''', '''>=''', '''>''', '''<=''', '''<''', and '''!='''.

~ Discussion

~~ Forcing typed comparisons in Tcl

Programmers who wish to insure string semantics should restrict their
comparisons to the '''lt''', '''le''', '''eq''', '''ne''', '''gt'''
and '''ge''' operators.

Use of the '''<''', '''<=''', '''==''', '''!=''', '''>''' and '''>='''
operators with operands that might be non-numeric shall be regarded
as poor programming style. Unless operands are constant, unary '''+'''
should be used to force them to be numeric. Thus,

| if {$x < $y} { ... }

should be relaced with

| if {+$x < +$y} { ... }

The second comparison will have the effect of forcing both operands to be
numeric.

~~ Rejected alternatives

Earlier, the radical suggestion of ''requiring'' the '''<''',
'''<=''', '''==''', '''!=''', '''>''' and '''>=''' operators to have
numeric arguments had been read into this TIP. It appears that there
is far too much outstanding code that is written like:

    if {$x == "somestring"} { ... }

to have the more radical option be viable.

One possible alternative to excluding non-numeric arguments from the
comparison operators is to change their semantics so that all non-numeric
strings are greater than all numbers. This change would at least yield a
consistent ordering. The ordering that it yields would, however, be somewhat
surprising, and not terribly useful. (It would at least be compatible with
today's scheme for numeric comparisons.)

~~ Objections (and rebuttals)

In out-of-band discussions, several objections were raised. This section
attempts to address them.

   1. ''Tcl's expression parser has a hard limit of 64 different binary
      operators. This proposal consumes four of them, leaving only 28. There
      is a concern that this is a less-than-effective use of a limited
      resource.''

    > The limit is self-imposed, in an effort to make the nodes of an
      expression parse tree fit in exactly 16 bytes (or four int's). It is far
      from obvious that this pretty size is actually useful. Few expressions
      are more than a few dozen parse nodes, and typical expressions are not
      parsed multiple times. It appears that neither the speed of the parse
      nor the size of the tree will be critical issues in most applications.
      In any case, we still have nearly half the operators left.

   2. ''There is some concern that using barewords for operators was a bad
      idea in the first place.'' The fact that

| expr {"foo"}

    > and

| set x foo; expr {$x}

     > both work, while

| expr {foo}

     > is an invalid bareword is arguably surprising.

    > Nevertheless, we have committed to the approach with the '''eq''',
      '''ne''', '''in''' and '''ni''' operators. These are unlikely to go
      away. Adding '''lt''', '''le''', '''gt''' and '''ge''' will make this
      problem no better nor worse.

    > Moreover, the language of [[expr]] is not the same as Tcl. It does not
      strip comments, parse into words, and apply Tcl's precise substitution
      rules - and it would be surprising if it did!  There are other "little
      languages" throughout Tcl - regular expressions, glob patterns, assembly
      code, and so on. [[expr]] is one among many.

   3. ''There is concern that [[expr]], which was originally intended almost
      exclusively for numeric calculations, is being abused with string
      arguments and possibly string results.''

    > The author of this TIP contends that we introduced string values to
      [[expr]] a long time ago, certainly by the time that the '''eq''',
      '''ne''', '''in''' and '''ni''' operations were introduced.  It is true
      that the use of numeric conversions in [[expr]] is incoherent, as seen
      in:

|   % proc tcl::mathfunc::cat {args} { join $args {} }
|   % expr {cat(0x1,0x2,"a")}
|   0x10x2a
|   % expr {cat(0x1)}
|   1

    > (Bug [[e7c21ed678]] is another manifestation of this general
      problem.) Once again, adding additional string operations
      that behave, with respect to data types, exactly the same
      as ones that are already there will neither fix nor exacerbate
      the general problem.

   4. ''Because [[expr]] has no interpreted form, the operations must have
      bytecode representations. The space of available bytecodes is under even
      more pressure than the space of available operators, and must not be
      squandered on operations that are duplicative of already-available
      functionality such as [[string compare]].''

    > The obvious rebuttal is that [[string compare]] is already bytecoded.
      There are no new operations required, merely a compiler that is smart
      enough to emit a short codeburst rather than a single bytecode. As an
      example, the code for the expression

|   {$x lt $y}

    > could
      be:

|   (0) loadScalar1 %v0        # var "x"
|   (2) loadScalar1 %v1        # var "y"
|   (4) strcmp 
|   (5) push1 0        # "0"
|   (7) lt 

    > For the other string operators, only the last bytecode in the burst
      would change.  No new bytecode operations are needed. In fact, this
      codeburst is identical code to that generated for the expression

|   {[string compare $x $y] < 0}

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|
|
|
|

|

|
|

|

|

|

|

|

|
|
|
|

|

|
|

|

|
|
|
|
|
|

|
|
|

|

|

|

|
|

|

|

|

|

|

|
|

|

|
|

|

|

|

|
|

|
|

|

|

|

|

|

|

|
|
|

|

|

|

|

|
|
|
|

|
|
|
|
<
|
>
|
|

|

|

|

|

|

|
|
|
|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182

183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222

# TIP 461: Separate Numeric and String Comparison Operators

	Author:         Kevin B Kenny <[email protected]>
	Author:         Kevin B Kenny <[email protected]>
	Author:         Kevin Kenny <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        24-Jan-2017
	Post-History:   
	Keywords:       Tcl,expression
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes to complete the separation between string and numeric
comparison operations in [expr] and related commands \([for], [if],
[while], etc.\). It introduces new comparison operators **ge**, **gt**,
**le**, and **lt**, \(along with the corresponding commands in the
**::tcl::mathop** namespace\), and encourages programmers to restrict the six operators **==**, **>=**, **>**, **<=**, **<** and **!=** to comparisons of numeric
values.

# Rationale

Tcl throughout its history has had comparison operators that freely compare
numeric and string values. These operators behave as expected if both their
arguments are numeric: they compare values on the real number line. Hence, 15
< 0x10 < 0b10001. Similarly, if presented with non-numeric strings, they
compare the strings in lexicographic order, as a programmer might expect:
"bambam" < "barney" < "betty" < "fred".

Trouble arises, however, when numeric and non-numeric strings are compared.
The rule for comparison is that mixed-type comparisons like this are treated
as string comparisons. The result is that **<** does not induce an order.
There are inconsistent comparison results, rendering **<** and friends
worthless for sorting. 0x10 < 0y < 1 < 0x10.

The problems with this inconsistency prompted changes in May of 2000,
introducing **eq** and **ne** operators that always perform string
comparison. For whatever reason, the four inequality operations never
followed. This leads to pitfalls for the unwary. It's fairly well entrenched
in the Tcl folklore that comparisons other than **eq** and **ne** should
be reserved for numeric arguments only, and experienced Tcl programmers know
to write:

	 if {[string compare $x $y] < 0} { ... }

in place of 

	 if {$x < $y} { ... }

# Proposal

Four new bareword operators, **ge**, **gt**, **le** and
**lt** shall be added to the expression parser and to the
**::tcl::mathop** command set. They will have precedence identical to
the existing operators **>=**, **>**, **<=** and **<**. They
will accept string values, and return 0 or 1 according to lexicographic
string comparison of their operators. This change is entirely backward
compatible \(it uses syntax that would previously have been erroneous\),
and should go in as soon as possible - no later than the next point
release, but ideally even in a patchlevel - so that programmers can
begin conversion as soon as possible. Use of the **==**, **>=**,
**>**, **<=**, **<**, and **!=** for comparing non-numeric
values shall immediately be deprecated.

The six string compare operators shall be declared to function so that
their results are the same as the results of [string compare]:

	    {$a lt $b}  <=> {[string compare $a $b] <  0}
	    {$a le $b}  <=> {[string compare $a $b] <= 0}
	    {$a eq $b}  <=> {[string compare $a $b] == 0}
	    {$a ne $b}  <=> {[string compare $a $b] != 0}
	    {$a gt $b}  <=> {[string compare $a $b] >  0}
	    {$a ge $b}  <=> {[string compare $a $b] >= 0}

It is also intended that any future changes to [string compare]
\(for example, a hypothetical change to make it follow Unicode collation
semantics\) will have the corresponding effect on these six operators.

Unlike what was specified in an earlier version of this TIP, no
changes are to  be made to the semantics of the comparison operators
 **==**, **>=**, **>**, **<=**, **<**, and **!=**.

# Discussion

## Forcing typed comparisons in Tcl

Programmers who wish to insure string semantics should restrict their
comparisons to the **lt**, **le**, **eq**, **ne**, **gt**
and **ge** operators.

Use of the **<**, **<=**, **==**, **!=**, **>** and **>=**
operators with operands that might be non-numeric shall be regarded
as poor programming style. Unless operands are constant, unary **\+**
should be used to force them to be numeric. Thus,

	 if {$x < $y} { ... }

should be relaced with

	 if {+$x < +$y} { ... }

The second comparison will have the effect of forcing both operands to be
numeric.

## Rejected alternatives

Earlier, the radical suggestion of _requiring_ the **<**,
**<=**, **==**, **!=**, **>** and **>=** operators to have
numeric arguments had been read into this TIP. It appears that there
is far too much outstanding code that is written like:

    if \{$x == "somestring"\} \{ ... \}

to have the more radical option be viable.

One possible alternative to excluding non-numeric arguments from the
comparison operators is to change their semantics so that all non-numeric
strings are greater than all numbers. This change would at least yield a
consistent ordering. The ordering that it yields would, however, be somewhat
surprising, and not terribly useful. \(It would at least be compatible with
today's scheme for numeric comparisons.\)

## Objections \(and rebuttals\)

In out-of-band discussions, several objections were raised. This section
attempts to address them.

   1. _Tcl's expression parser has a hard limit of 64 different binary
      operators. This proposal consumes four of them, leaving only 28. There
      is a concern that this is a less-than-effective use of a limited
      resource._

	    > The limit is self-imposed, in an effort to make the nodes of an
      expression parse tree fit in exactly 16 bytes \(or four int's\). It is far
      from obvious that this pretty size is actually useful. Few expressions
      are more than a few dozen parse nodes, and typical expressions are not
      parsed multiple times. It appears that neither the speed of the parse
      nor the size of the tree will be critical issues in most applications.
      In any case, we still have nearly half the operators left.

   2. _There is some concern that using barewords for operators was a bad
      idea in the first place._ The fact that

		 expr {"foo"}

	    > and

		 set x foo; expr {$x}

	     > both work, while

		 expr {foo}

	     > is an invalid bareword is arguably surprising.

	    > Nevertheless, we have committed to the approach with the **eq**,
      **ne**, **in** and **ni** operators. These are unlikely to go
      away. Adding **lt**, **le**, **gt** and **ge** will make this
      problem no better nor worse.

	    > Moreover, the language of [expr] is not the same as Tcl. It does not
      strip comments, parse into words, and apply Tcl's precise substitution
      rules - and it would be surprising if it did!  There are other "little
      languages" throughout Tcl - regular expressions, glob patterns, assembly
      code, and so on. [expr] is one among many.

   3. _There is concern that [expr], which was originally intended almost
      exclusively for numeric calculations, is being abused with string
      arguments and possibly string results._

	    > The author of this TIP contends that we introduced string values to
      [expr] a long time ago, certainly by the time that the **eq**,
      **ne**, **in** and **ni** operations were introduced.  It is true
      that the use of numeric conversions in [expr] is incoherent, as seen
      in:

		   % proc tcl::mathfunc::cat {args} { join $args {} }
		   % expr {cat(0x1,0x2,"a")}
		   0x10x2a
		   % expr {cat(0x1)}

		   1

	    > \(Bug [e7c21ed678] is another manifestation of this general
      problem.\) Once again, adding additional string operations
      that behave, with respect to data types, exactly the same
      as ones that are already there will neither fix nor exacerbate
      the general problem.

   4. _Because [expr] has no interpreted form, the operations must have
      bytecode representations. The space of available bytecodes is under even
      more pressure than the space of available operators, and must not be
      squandered on operations that are duplicative of already-available
      functionality such as [string compare]._

	    > The obvious rebuttal is that [string compare] is already bytecoded.
      There are no new operations required, merely a compiler that is smart
      enough to emit a short codeburst rather than a single bytecode. As an
      example, the code for the expression

		   {$x lt $y}

	    > could
      be:

		   (0) loadScalar1 %v0        # var "x"
		   (2) loadScalar1 %v1        # var "y"
		   (4) strcmp 
		   (5) push1 0        # "0"
		   (7) lt 

	    > For the other string operators, only the last bytecode in the burst
      would change.  No new bytecode operations are needed. In fact, this
      codeburst is identical code to that generated for the expression

		   {[string compare $x $y] < 0}

# Copyright

This document has been placed in the public domain.

Name change from tip/462.tip to tip/462.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145

TIP:            462
Title:          Add New [info ps] Ensemble for Subprocess Management
Version:        $Revision: 1.4 $
Author:         Fr�d�ric Bonnet <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        23-Jan-2017
Post-History:   
Tcl-Version:    8.7

~ Abstract

This TIP proposes to improve Tcl's handling of subprocesses created by the 
'''exec''' and '''open''' commands by adding a new '''::tcl::process''' ensemble.

~ Rationale

This TIP is inspired by a https://github.com/flightaware/Tcl-bounties#stop-tcl-from-eating-child-process-exit-status-gratuitously%|%request from FlightAware%|% to fix the way Tcl currently
handles child process exit status. 

Subprocess creation can be either synchronous or asynchronous. In either case, 
a children with a non-zero return value indicates an error condition that is
bubbled up to the Tcl error handling mechanism. 

~~ Synchronous subprocesses

Synchronous subprocesses are created using the '''exec''' command with no '''&'''
terminal argument. Errors are raised synchronously as well.

~~ Asynchronous subprocesses

Asynchronous subprocesses can be created using two distinct methods:

   * '''exec''' command with a '''&''' terminal argument. In this case the command returns immediately with a list of the PIDs for all the subprocesses in the pipeline.

   * '''open "| command"'''. In this case the command returns immediately with the channel id of the pipe (hereafter '''$ch'''). The subprocess IDs are given by the '''[pid $ch]''' command. The subprocess status code and error conditions are processed upon channel closure with the '''[close $ch]'''.

~~ Error handling and status code

Errors are caught with the '''catch''' and '''try'''  commands, with status 
codes given in the '''-errorcode''' options dictionary entry and the 
'''errorCode''' global variable in the form '''{CHILDKILLED pid sigName msg}''' / '''{CHILDSTATUS pid code}''' / '''{CHILDSUSP pid sigName msg}'''. 

~~ C-level access

The Tcl library provides the following procedures for managing subprocesses (excerpts from the Tcl documentation):

   * '''Tcl_DetachPids''' may be called to ask Tcl to take responsibility for one or more processes whose process ids are contained in the pidPtr array passed as argument. The caller presumably has started these processes running in background and does not want to have to deal with them again.

   * '''Tcl_ReapDetachedProcs''' invokes the '''waitpid''' kernel call on each of the background processes so that its state can be cleaned up if it has exited. If the process has not exited yet, '''Tcl_ReapDetachedProcs''' does not wait for it to exit; it will check again the next time it is invoked. Tcl automatically calls '''Tcl_ReapDetachedProcs''' each time the exec command is executed, so in most cases it is not necessary for any code outside of Tcl to invoke Tcl_ReapDetachedProcs. However, if you call '''Tcl_DetachPids''' in situations where the exec command may never get executed, you may wish to call '''Tcl_ReapDetachedProcs''' from time to time so that background processes can be cleaned up.

   * '''Tcl_WaitPid''' is a thin wrapper around the facilities provided by the operating system to wait on the end of a spawned process and to check a whether spawned process is still running. It is used by '''Tcl_ReapDetachedProcs''' and the channel system to portably access the operating system.

Moreover, '''Tcl_WaitPid''' is blocking unless called with the '''WNOHANG''' option.

~~ Limitations

The current implementation is lacking several key features:

   * There is no way to get subprocess status other than through the error handling mechanism.

   * Consequently, there is no way to collect the status code of a asychronous subprocess created with the '''exec &''' method because such commands don't raise errors once the subprocesses are launched.

   * There is no non-blocking way to query asynchronous subprocess status codes; '''catch'''/'''try''' upon '''open |''' pipe closure is blocking.

   * Moreover, '''exec''' and '''open''' call '''Tcl_ReapDetachedProcs''', thereby cleaning up all pending information on terminated subprocesses. This prevents any advanced subprocess monitoring at the script level.

   * While reasonable in the general case, a non-zero return value does not always indicates an error condition for all kinds of programs, so it is desirable to provide a subprocess-specific mechanism that does not rely on Tcl's standard error handling facility.

~ Specifications

A new '''::tcl::process''' will be created:

   '''::tcl::process''' ''subcommand ?arg ...'': Subprocess management.

The following ''subcommand'' values are supported by '''::tcl::process''': 

   * '''::tcl::process list''': Returns the list of subprocess PIDs.

   * '''::tcl::process status''' ''?switches? ?pids?'': Returns a dictionary mapping subprocess PIDs to their respective statuses. If ''pids'' is specified as a list of PIDs then the command only returns the status of the matching subprocesses if they exist, and raises an error otherwise. The status value uses the same format as the '''errorCode''' global variable for terminated processes; for active processes an empty value is returned. Under the hood this command calls '''Tcl_WaitPid''' with the '''WNOHANG''' flag set for non-blocking behavior. 

   * '''::tcl::process purge''' ''?pids?'': Cleans up all data associated with terminated subprocesses. If ''pids'' is specified as a list of PIDs then the command only cleanup data for the matching subprocesses if they exist, and raises an error otherwise. If the process is still active then it does nothing.

   * '''::tcl::process autopurge''' ''?flag?'': Automatic purge facility. If ''flag'' is specified as a boolean value then it activates or deactivate autopurge. In all cases it returns the current status as a boolean value. When autopurge is active, '''Tcl_ReapDetachedProcs''' is called each time the '''exec''' command is executed or a pipe channel created by '''open''' is closed. When autopurge is inactive, '''::tcl::process purge''' must be called explicitly. By default autopurge is active and replicates the current Tcl behavior.

Additionally, '''::tcl::process status''' accepts the following switches:

   * '''-wait''': By default the command returns immediately (the underlying '''Tcl_WaitPid''' is called with the '''WNOHANG''' flag set) unless this switch is set. If ''pids'' is specified as a list of PIDs then the command waits until the matching subprocess statuses are available. If ''pids'' is not specified then it waits for all known subprocesses.

   * '''--''': Marks the end of switches. The argument following this one will be treated as the first arg even if it starts with a -. 

~ Examples

|% ::tcl::process autopurge
|true
|% ::tcl::process autopurge false
|false
|
|% set pid1 [exec command1 a b c | command2 d e f &]
|123 456
|% set chan [open "|command1 a b c | command2 d e f"]
|file123
|% set pid2 [pid $chan]
|789 1011
|
|% ::tcl::process list
|123 456 789 1011
|
|% ::tcl::process status
|123 {CHILDSTATUS 123 0} 456 {CHILDKILLED 456 SIGPIPE "write on pipe with no readers"} 789 {CHILDSUSP 789 SIGTTIN "background tty read"} 1011 {}
|
|% ::tcl::process status 123
|123 {CHILDSTATUS 123 0}
|
|% ::tcl::process status 1011
|1011 {}
|
|% ::tcl::process status -wait
|123 {CHILDSTATUS 123 0} 456 {CHILDKILLED 456 SIGPIPE "write on pipe with no readers"} 789 {CHILDSUSP 789 SIGTTIN "background tty read"} 1011 {CHILDSTATUS 1011 -1}
|
|% ::tcl::process status 1011
|1011 {CHILDSTATUS 1011 -1}
|
|% ::tcl::process purge
|% exec command1 1 2 3 &
|1213
|% ::tcl::process list
|1213

~ Rejected Alternatives

The first version proposed to implement the feature as a new '''ps''' option to
the existing '''info''' command. However, almost all operations in '''[info]'''
are things that just examine state, not change it, and that's a 
principle-of-least-astonishment that should be upheld for the sake of less 
experienced users.

~ Reference implementation

TBD

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145

# TIP 462: Add New [info ps] Ensemble for Subprocess Management

	Author:         Frédéric Bonnet <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        23-Jan-2017
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes to improve Tcl's handling of subprocesses created by the 
**exec** and **open** commands by adding a new **::tcl::process** ensemble.

# Rationale

This TIP is inspired by a <https://github.com/flightaware/Tcl-bounties\#stop-tcl-from-eating-child-process-exit-status-gratuitously%\|%request> from FlightAware%\|% to fix the way Tcl currently
handles child process exit status. 

Subprocess creation can be either synchronous or asynchronous. In either case, 
a children with a non-zero return value indicates an error condition that is
bubbled up to the Tcl error handling mechanism. 

## Synchronous subprocesses

Synchronous subprocesses are created using the **exec** command with no **&**
terminal argument. Errors are raised synchronously as well.

## Asynchronous subprocesses

Asynchronous subprocesses can be created using two distinct methods:

   * **exec** command with a **&** terminal argument. In this case the command returns immediately with a list of the PIDs for all the subprocesses in the pipeline.

   * **open "\| command"**. In this case the command returns immediately with the channel id of the pipe \(hereafter **$ch**\). The subprocess IDs are given by the **[pid $ch]** command. The subprocess status code and error conditions are processed upon channel closure with the **[close $ch]**.

## Error handling and status code

Errors are caught with the **catch** and **try**  commands, with status 
codes given in the **-errorcode** options dictionary entry and the 
**errorCode** global variable in the form **\{CHILDKILLED pid sigName msg\}** / **\{CHILDSTATUS pid code\}** / **\{CHILDSUSP pid sigName msg\}**. 

## C-level access

The Tcl library provides the following procedures for managing subprocesses \(excerpts from the Tcl documentation\):

   * **Tcl\_DetachPids** may be called to ask Tcl to take responsibility for one or more processes whose process ids are contained in the pidPtr array passed as argument. The caller presumably has started these processes running in background and does not want to have to deal with them again.

   * **Tcl\_ReapDetachedProcs** invokes the **waitpid** kernel call on each of the background processes so that its state can be cleaned up if it has exited. If the process has not exited yet, **Tcl\_ReapDetachedProcs** does not wait for it to exit; it will check again the next time it is invoked. Tcl automatically calls **Tcl\_ReapDetachedProcs** each time the exec command is executed, so in most cases it is not necessary for any code outside of Tcl to invoke Tcl\_ReapDetachedProcs. However, if you call **Tcl\_DetachPids** in situations where the exec command may never get executed, you may wish to call **Tcl\_ReapDetachedProcs** from time to time so that background processes can be cleaned up.

   * **Tcl\_WaitPid** is a thin wrapper around the facilities provided by the operating system to wait on the end of a spawned process and to check a whether spawned process is still running. It is used by **Tcl\_ReapDetachedProcs** and the channel system to portably access the operating system.

Moreover, **Tcl\_WaitPid** is blocking unless called with the **WNOHANG** option.

## Limitations

The current implementation is lacking several key features:

   * There is no way to get subprocess status other than through the error handling mechanism.

   * Consequently, there is no way to collect the status code of a asychronous subprocess created with the **exec &** method because such commands don't raise errors once the subprocesses are launched.

   * There is no non-blocking way to query asynchronous subprocess status codes; **catch**/**try** upon **open \|** pipe closure is blocking.

   * Moreover, **exec** and **open** call **Tcl\_ReapDetachedProcs**, thereby cleaning up all pending information on terminated subprocesses. This prevents any advanced subprocess monitoring at the script level.

   * While reasonable in the general case, a non-zero return value does not always indicates an error condition for all kinds of programs, so it is desirable to provide a subprocess-specific mechanism that does not rely on Tcl's standard error handling facility.

# Specifications

A new **::tcl::process** will be created:

   **::tcl::process** _subcommand ?arg ..._: Subprocess management.

The following _subcommand_ values are supported by **::tcl::process**: 

   * **::tcl::process list**: Returns the list of subprocess PIDs.

   * **::tcl::process status** _?switches? ?pids?_: Returns a dictionary mapping subprocess PIDs to their respective statuses. If _pids_ is specified as a list of PIDs then the command only returns the status of the matching subprocesses if they exist, and raises an error otherwise. The status value uses the same format as the **errorCode** global variable for terminated processes; for active processes an empty value is returned. Under the hood this command calls **Tcl\_WaitPid** with the **WNOHANG** flag set for non-blocking behavior. 

   * **::tcl::process purge** _?pids?_: Cleans up all data associated with terminated subprocesses. If _pids_ is specified as a list of PIDs then the command only cleanup data for the matching subprocesses if they exist, and raises an error otherwise. If the process is still active then it does nothing.

   * **::tcl::process autopurge** _?flag?_: Automatic purge facility. If _flag_ is specified as a boolean value then it activates or deactivate autopurge. In all cases it returns the current status as a boolean value. When autopurge is active, **Tcl\_ReapDetachedProcs** is called each time the **exec** command is executed or a pipe channel created by **open** is closed. When autopurge is inactive, **::tcl::process purge** must be called explicitly. By default autopurge is active and replicates the current Tcl behavior.

Additionally, **::tcl::process status** accepts the following switches:

   * **-wait**: By default the command returns immediately \(the underlying **Tcl\_WaitPid** is called with the **WNOHANG** flag set\) unless this switch is set. If _pids_ is specified as a list of PIDs then the command waits until the matching subprocess statuses are available. If _pids_ is not specified then it waits for all known subprocesses.

   * **--**: Marks the end of switches. The argument following this one will be treated as the first arg even if it starts with a -. 

# Examples

	% ::tcl::process autopurge
	true
	% ::tcl::process autopurge false
	false

	% set pid1 [exec command1 a b c | command2 d e f &]
	123 456
	% set chan [open "|command1 a b c | command2 d e f"]
	file123
	% set pid2 [pid $chan]
	789 1011

	% ::tcl::process list
	123 456 789 1011

	% ::tcl::process status
	123 {CHILDSTATUS 123 0} 456 {CHILDKILLED 456 SIGPIPE "write on pipe with no readers"} 789 {CHILDSUSP 789 SIGTTIN "background tty read"} 1011 {}

	% ::tcl::process status 123
	123 {CHILDSTATUS 123 0}

	% ::tcl::process status 1011
	1011 {}

	% ::tcl::process status -wait
	123 {CHILDSTATUS 123 0} 456 {CHILDKILLED 456 SIGPIPE "write on pipe with no readers"} 789 {CHILDSUSP 789 SIGTTIN "background tty read"} 1011 {CHILDSTATUS 1011 -1}

	% ::tcl::process status 1011
	1011 {CHILDSTATUS 1011 -1}

	% ::tcl::process purge
	% exec command1 1 2 3 &
	1213
	% ::tcl::process list
	1213

# Rejected Alternatives

The first version proposed to implement the feature as a new **ps** option to
the existing **info** command. However, almost all operations in **[info]**
are things that just examine state, not change it, and that's a 
principle-of-least-astonishment that should be upheld for the sake of less 
experienced users.

# Reference implementation

TBD

# Copyright

This document has been placed in the public domain.

Name change from tip/463.tip to tip/463.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97

TIP:		463
Title:		Command-Driven Substitutions for regsub
State:		Final
Type:		Project
Tcl-Version:	8.7
Vote:		Done
Post-History:	
Version:	$Revision: 1.6 $
Author:		Donal Fellows <[email protected]>
Created:	11-Feb-2017
Keywords:	Tcl, regular expression

~ Abstract

The '''regsub''' command can only do substitutions of a limited complexity.
This TIP adds an option to generate substitution text using another Tcl
command, allowing a more complex range of substitutions to be performed easily
and safely.

~ Rationale and Outline Proposal

Many scripts wish to perform subsitutions on a string where the text to be
substituted can be described by a regular expression, but where the text to be
substituted in cannot easily be generated by the '''regsub''' command. There
are workarounds for this, as seen in this example (from the Wiki):

| set text [subst [regsub -all {[a-zA-Z]} [\
|     regsub -all "\[\[$\\\\\]" $text {\\&}] {[
|         set c [scan & %c]
|         format %c [expr {$c\&96|(($c\&31)+12)%26+1}]
|     ]}]]

But it is not at all trivial to write such things! Instead, we should be able
to do this:

| set text [regsub -all -command {[a-zA-Z]} $text {apply {c {
|     scan $c %c c
|     format %c [expr {$c&96|(($c&31)+12)%26+1}]
| }}}]

It's going to be both safer (as there's no required non-obvious metadata
defanging preprocessing step) and faster (as we can do this as a command call
rather than a '''subst''' that needs separate bytecode compilation).

The parallels with Perl's "e" flag to its regular expression substitution
operator should be obvious.

~ Proposed Change

My proposal is that we add a flag to the '''regsub''' command, '''-command''',
that changes the interpretation and processing of the substitution argument.
When the flag is passed, instead of that argument being a string that is
processed for '''&''' and backslash-number sequences, it is instead
interpreted as a command prefix; the various captured substrings (minimally
the entire string passed in, but also any captured substrings specified in the
RE) will become extra arguments added, and the result will be evaluated and
the result of that evaluation will be used as the string to substitute in. If
the '''-all''' option is not given, the substitution command will be called at
most once, whereas if '''-all''' is given, the substitution command will be
called for as many times as the regular expression matches. The indices in the
original script that matched will not be available.

Non-OK results will be passed through to the surrounding script.

Substitutions too complex to be described by a simple command can be done by
using a procedure or '''apply'''/lambda-term (as in the example above). The
arguments received by the command invoked by '''regsub -command''' will be
exactly the substrings that were matched, with no other substitutions
performed on them.

~~ Examples

The command:

| regsub -all -command {\w} "ab-cd-ef-gh" {  puts  }

will give '''---''' as its result and print the letters '''a''' to '''h''',
one per line in that order.

The command:

| regsub -command {\W(\W)} "ab cd,{ef gh,} ij" {apply {{x y} {
|     scan $y %c c
|     format %%%02x $c
| }}}

will produce this result:

| ab cd%7bef gh,} ij

~ Implementation

http://core.tcl.tk/tcl/timeline?r=tip-463

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
|
>

|

|

|

|
|

|
|
|
|
|

|
|
|
|

|
|
|

|

|

|
|

|

|
|

|
|

|

|

|

|
|
|
|

|

|

|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97

# TIP 463: Command-Driven Substitutions for regsub
	State:		Final
	Type:		Project
	Tcl-Version:	8.7
	Vote:		Done
	Post-History:	

	Author:		Donal Fellows <[email protected]>
	Created:	11-Feb-2017
	Keywords:	Tcl, regular expression
-----

# Abstract

The **regsub** command can only do substitutions of a limited complexity.
This TIP adds an option to generate substitution text using another Tcl
command, allowing a more complex range of substitutions to be performed easily
and safely.

# Rationale and Outline Proposal

Many scripts wish to perform subsitutions on a string where the text to be
substituted can be described by a regular expression, but where the text to be
substituted in cannot easily be generated by the **regsub** command. There
are workarounds for this, as seen in this example \(from the Wiki\):

	 set text [subst [regsub -all {[a-zA-Z]} [\
	     regsub -all "\[\[$\\\\\]" $text {\\&}] {[
	         set c [scan & %c]
	         format %c [expr {$c\&96|(($c\&31)+12)%26+1}]
	     ]}]]

But it is not at all trivial to write such things! Instead, we should be able
to do this:

	 set text [regsub -all -command {[a-zA-Z]} $text {apply {c {
	     scan $c %c c
	     format %c [expr {$c&96|(($c&31)+12)%26+1}]
	 }}}]

It's going to be both safer \(as there's no required non-obvious metadata
defanging preprocessing step\) and faster \(as we can do this as a command call
rather than a **subst** that needs separate bytecode compilation\).

The parallels with Perl's "e" flag to its regular expression substitution
operator should be obvious.

# Proposed Change

My proposal is that we add a flag to the **regsub** command, **-command**,
that changes the interpretation and processing of the substitution argument.
When the flag is passed, instead of that argument being a string that is
processed for **&** and backslash-number sequences, it is instead
interpreted as a command prefix; the various captured substrings \(minimally
the entire string passed in, but also any captured substrings specified in the
RE\) will become extra arguments added, and the result will be evaluated and
the result of that evaluation will be used as the string to substitute in. If
the **-all** option is not given, the substitution command will be called at
most once, whereas if **-all** is given, the substitution command will be
called for as many times as the regular expression matches. The indices in the
original script that matched will not be available.

Non-OK results will be passed through to the surrounding script.

Substitutions too complex to be described by a simple command can be done by
using a procedure or **apply**/lambda-term \(as in the example above\). The
arguments received by the command invoked by **regsub -command** will be
exactly the substrings that were matched, with no other substitutions
performed on them.

## Examples

The command:

	 regsub -all -command {\w} "ab-cd-ef-gh" {  puts  }

will give **---** as its result and print the letters **a** to **h**,
one per line in that order.

The command:

	 regsub -command {\W(\W)} "ab cd,{ef gh,} ij" {apply {{x y} {
	     scan $y %c c
	     format %%%02x $c
	 }}}

will produce this result:

	 ab cd%7bef gh,} ij

# Implementation

<http://core.tcl.tk/tcl/timeline?r=tip-463>

# Copyright

This document has been placed in the public domain.

Name change from tip/464.tip to tip/464.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

TIP:            464
Title:          Support for Multimedia Keys on Windows
Version:        $Revision: 1.6 $
Author:         Ralf Fassel <[email protected]>
Author:         Andreas Leitgeb <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        28-Jan-2017
Post-History:   
Keywords:       Tk,keyboard,keycode
Tcl-Version:    8.5

~ Abstract

This TIP proposes adding support for the multimedia keys present on many
modern keyboards.

~ Rationale

Tk is lacking support for the multimedia keys as described on
https://msdn.microsoft.com/en-us/library/windows/desktop/dd375731%28v=vs.85%29.aspx

|VK_VOLUME_DOWN        0xAE   Volume Down key
|VK_VOLUME_UP          0xAF   Volume Up key
|VK_MEDIA_NEXT_TRACK   0xB0   Next Track key
|VK_MEDIA_PREV_TRACK   0xB1   Previous Track key
|VK_MEDIA_STOP         0xB2   Stop Media key
|VK_MEDIA_PLAY_PAUSE   0xB3   Play/Pause Media key

Linux supports these as, e.g., XF86AudioPlay, XF86AudioPrev, XF86AudioNext.
Tk should support them as well so that application programmers can make use of
the keys as appropriate.

Because this is driven by changing external circumstances, it is propsed that
this TIP be backported to all future-releaseable versions of Tk (i.e., 8.5
onwards).

~ Proposal

The table of supported keys should be extended to include the following named
keys:

 * '''XF86AudioLowerVolume''' - the volume-down key

 * '''XF86AudioMute''' - the volume-mute key

 * '''XF86AudioNext''' - the next-track key

 * '''XF86AudioPlay''' - the start-playback key

 * '''XF86AudioPrev''' - the previous-track key

 * '''XF86AudioRaiseVolume''' - the volume-up key

 * '''XF86AudioStop''' - the stop-playback key

The above list does not imply any ordering in the implementation.

~ Implementation

The support can be added by extending some keymapping lookup tables in Tk.

A Ticket already exists with a proposed patch
[http://core.tcl.tk/tk/tktview/499526180d6cd5ca7c02eed96c10e9d3630a807c] and
fvogel has also created a branch
[http://core.tcl.tk/tk/timeline?r=tip-464].

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|
|
|
|
|
|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

# TIP 464: Support for Multimedia Keys on Windows

	Author:         Ralf Fassel <[email protected]>
	Author:         Andreas Leitgeb <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        28-Jan-2017
	Post-History:   
	Keywords:       Tk,keyboard,keycode
	Tcl-Version:    8.5
-----

# Abstract

This TIP proposes adding support for the multimedia keys present on many
modern keyboards.

# Rationale

Tk is lacking support for the multimedia keys as described on
<https://msdn.microsoft.com/en-us/library/windows/desktop/dd375731%28v=vs.85%29.aspx>

	VK_VOLUME_DOWN        0xAE   Volume Down key
	VK_VOLUME_UP          0xAF   Volume Up key
	VK_MEDIA_NEXT_TRACK   0xB0   Next Track key
	VK_MEDIA_PREV_TRACK   0xB1   Previous Track key
	VK_MEDIA_STOP         0xB2   Stop Media key
	VK_MEDIA_PLAY_PAUSE   0xB3   Play/Pause Media key

Linux supports these as, e.g., XF86AudioPlay, XF86AudioPrev, XF86AudioNext.
Tk should support them as well so that application programmers can make use of
the keys as appropriate.

Because this is driven by changing external circumstances, it is propsed that
this TIP be backported to all future-releaseable versions of Tk \(i.e., 8.5
onwards\).

# Proposal

The table of supported keys should be extended to include the following named
keys:

 * **XF86AudioLowerVolume** - the volume-down key

 * **XF86AudioMute** - the volume-mute key

 * **XF86AudioNext** - the next-track key

 * **XF86AudioPlay** - the start-playback key

 * **XF86AudioPrev** - the previous-track key

 * **XF86AudioRaiseVolume** - the volume-up key

 * **XF86AudioStop** - the stop-playback key

The above list does not imply any ordering in the implementation.

# Implementation

The support can be added by extending some keymapping lookup tables in Tk.

A Ticket already exists with a proposed patch
<http://core.tcl.tk/tk/tktview/499526180d6cd5ca7c02eed96c10e9d3630a807c>  and
fvogel has also created a branch
<http://core.tcl.tk/tk/timeline?r=tip-464> .

# Copyright

This document has been placed in the public domain.

Name change from tip/465.tip to tip/465.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103

TIP:            465
Title:          Change Rule 8 of the Dodekalogue to Cut Some Corner Cases
Version:        $Revision: 1.2 $
Author:         Andreas Leitgeb <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        03-Mar-2017
Post-History:   
Tcl-Version:    8.7

~ Abstract

This TIP proposes to make '''$'''-substitution more conforming to naive
expectations and just rule out certain odd-ball uses that can safely be
assumed to not appear in serious use, but only in crafted examples "serving"
for confusion, or as accidentally legal interpretation of mistyped Tcl code.

~ Rationale

Back in the days where '''$'''-substitution was added to Tcl, it was designed
to be as syntactically simple as possible. Back then, Tcl was still an
interpreted language, so optimising parse time was top priority.  For this, it
was designed that a sequence starting with '''${''' would end at the '''next
close brace''' no matter how many open braces or backslashes are passed by,
and '''$arr(''' would end at the '''next close paren''' no matter how many
open parens are found before.  This enables odd-ball corner cases that work in
an interactive shell at top level, as tried by newbies:

|   set "{{{" 42; puts ${{{{}

but fail within a braced block, unless a comment like:

|   # }}} }}}

follows within the same block. These are just strange parts of Tcl, that
nobody can seriously claim to use for good.

Another surprising part is with arrays, where parens are treated
asymmetrically, in that any number of bare open parens, but also quote chars
or braces may be part of the index in a '''$arr(...)''' substitution, but
first bare close paren terminates the token.  Quote characters or braces have
no significance, apart from that bare close braces might pre-maturely finish
the enclosing braced block, which may only be evident to seasoned Tclers - and
not even always to them.

The final motivation for writing up this TIP came while discussing one part of
[282]: assignment to array elements.

An informal poll showed a clear preference towards bare '''array(...)'''
naming on left hand side of proposed assignment, without sympathy for any need
of explicit disambiguation by quoting or tagging.

Generally disallowing bare open parens, quotes and braces within array indices
would mean that array indices on left hand side of an assignment could follow
same rules as on right hand side, and parsing an array by these new rules
would make sure, that where parsing as a function call and parsing as an array
are both successful, then both parses would end up consuming the same portion
of the expression body - a prerequisite for making a sound decision about
following assignment operator.

Without having array parses and function parses agreeing on close paren, then
it is possible that parsing as an array will see a trailing assignment
operator that would otherwise have been nested in a subexpression, or even
part of a quoted literal value.

Because of the low expected impact on real code, a target of 8.7 is considered
feasible.

~ Implementation

A full implementation of this TIP is now checked in on branch ''tip-465''.

~ Alternatives

The following points show alternatives that would make sense, but would make
the currently rather simple implementation of this TIP ways more complicated:

 * Allow bare parens in array indices if properly paired. Quotes and braces
   are still disallowed (even if paired), to avoid cases of bad nesting:
   '''{(})'''.  This might save users two backslashes in some rare cases where
   a close paren in the index is already backslash-escaped.

 * Specifically add backslash-quoting to the "body" of '''${...}'''.  After
   all, the "body" is a variable name, and not a nested structure like most
   braced words in Tcl. This would make some odd variable names once again
   possible, but now in a consistent syntax that doesn't affect enclosing
   blocks.

 * '''${...}''' syntax could also be further restricted by disallowing open
   braces and final backslashes, just to enforce "well-behaving" tokens.

 * A much stricter alternative would disallow unbalanced braces even within
   "-quoted and unquoted literals. This would disallow common but dangerous
   idioms like ''append var "{"'', which may be followed by an ''append var
   "}"'' in the same block and work, until one of these two commands gets
   moved into a nested block. The correct and safe way is, of course,
   backslash-escaping bare braces within string literals.  Good Code(tm)
   wouldn't be affected by this alternative change.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|
|
|

|

|

|

|

|

|

|

|

|
|

|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103

# TIP 465: Change Rule 8 of the Dodekalogue to Cut Some Corner Cases

	Author:         Andreas Leitgeb <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        03-Mar-2017
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes to make **$**-substitution more conforming to naive
expectations and just rule out certain odd-ball uses that can safely be
assumed to not appear in serious use, but only in crafted examples "serving"
for confusion, or as accidentally legal interpretation of mistyped Tcl code.

# Rationale

Back in the days where **$**-substitution was added to Tcl, it was designed
to be as syntactically simple as possible. Back then, Tcl was still an
interpreted language, so optimising parse time was top priority.  For this, it
was designed that a sequence starting with **$\{** would end at the **next
close brace** no matter how many open braces or backslashes are passed by,
and **$arr\(** would end at the **next close paren** no matter how many
open parens are found before.  This enables odd-ball corner cases that work in
an interactive shell at top level, as tried by newbies:

	   set "{{{" 42; puts ${{{{}

but fail within a braced block, unless a comment like:

	   # }}} }}}

follows within the same block. These are just strange parts of Tcl, that
nobody can seriously claim to use for good.

Another surprising part is with arrays, where parens are treated
asymmetrically, in that any number of bare open parens, but also quote chars
or braces may be part of the index in a **$arr\(...\)** substitution, but
first bare close paren terminates the token.  Quote characters or braces have
no significance, apart from that bare close braces might pre-maturely finish
the enclosing braced block, which may only be evident to seasoned Tclers - and
not even always to them.

The final motivation for writing up this TIP came while discussing one part of
[[282]](282.md): assignment to array elements.

An informal poll showed a clear preference towards bare **array\(...\)**
naming on left hand side of proposed assignment, without sympathy for any need
of explicit disambiguation by quoting or tagging.

Generally disallowing bare open parens, quotes and braces within array indices
would mean that array indices on left hand side of an assignment could follow
same rules as on right hand side, and parsing an array by these new rules
would make sure, that where parsing as a function call and parsing as an array
are both successful, then both parses would end up consuming the same portion
of the expression body - a prerequisite for making a sound decision about
following assignment operator.

Without having array parses and function parses agreeing on close paren, then
it is possible that parsing as an array will see a trailing assignment
operator that would otherwise have been nested in a subexpression, or even
part of a quoted literal value.

Because of the low expected impact on real code, a target of 8.7 is considered
feasible.

# Implementation

A full implementation of this TIP is now checked in on branch _tip-465_.

# Alternatives

The following points show alternatives that would make sense, but would make
the currently rather simple implementation of this TIP ways more complicated:

 * Allow bare parens in array indices if properly paired. Quotes and braces
   are still disallowed \(even if paired\), to avoid cases of bad nesting:
   **\{\(\}\)**.  This might save users two backslashes in some rare cases where
   a close paren in the index is already backslash-escaped.

 * Specifically add backslash-quoting to the "body" of **$\{...\}**.  After
   all, the "body" is a variable name, and not a nested structure like most
   braced words in Tcl. This would make some odd variable names once again
   possible, but now in a consistent syntax that doesn't affect enclosing
   blocks.

 * **$\{...\}** syntax could also be further restricted by disallowing open
   braces and final backslashes, just to enforce "well-behaving" tokens.

 * A much stricter alternative would disallow unbalanced braces even within
   "-quoted and unquoted literals. This would disallow common but dangerous
   idioms like _append var "\{"_, which may be followed by an _append var
   "\}"_ in the same block and work, until one of these two commands gets
   moved into a nested block. The correct and safe way is, of course,
   backslash-escaping bare braces within string literals.  Good Code\(tm\)
   wouldn't be affected by this alternative change.

# Copyright

This document has been placed in the public domain.

Name change from tip/466.tip to tip/466.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
TIP:            466
Title:          Revised Implementation of the Text Widget
Version:        $Revision: 1.11 $
Author:         Fran�ois Vogel <[email protected]>
Author:         Gregor Cramer <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        10-Mar-2017
Post-History:   
Keywords:       Tk,text widget
Tcl-Version:    8.7

~ Abstract

This TIP proposes the replacement of the current implementation of the text
widget (the "legacy" text widget) by a revised implementation offering a large
number of advantages.

~ Rationale

The Tk text widget has become increasingly complex as long as incremental
improvements and features have been added from time to time. In that process,
some known long-standing issues have become very difficult to tackle, for
instance the long line problem regarding lack of performance.

Gregor Cramer, in the process of using the text widget in one of his
<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26

# TIP 466: Revised Implementation of the Text Widget

	Author:         François Vogel <[email protected]>
	Author:         Gregor Cramer <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        10-Mar-2017
	Post-History:   
	Keywords:       Tk,text widget
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes the replacement of the current implementation of the text
widget \(the "legacy" text widget\) by a revised implementation offering a large
number of advantages.

# Rationale

The Tk text widget has become increasingly complex as long as incremental
improvements and features have been added from time to time. In that process,
some known long-standing issues have become very difficult to tackle, for
instance the long line problem regarding lack of performance.

Gregor Cramer, in the process of using the text widget in one of his

︙ ︙ 
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478

 * A large number of new features

 * Numerous bug fixes

 * Very few incompatibilities with the legacy text widget

~ Proposal

The proposal is to replace the legacy code with the new implementation.

The author of the revised implementation has written a well documented website
[http://scidb.sourceforge.net/tk/revised-text-widget.html]
describing in details the issues with the legacy code, how he fixed these
issues, and what features he has changed or improved.

It was not deemed feasible nor necessary to copy/paste/reformat all the
information of the above website into the present TIP. Only the new features
and incompatibilities are highlighted here, as opposed to detailed rationales
about each change.

A version of the '''text''' man page, consistent with the changes and improvements proposed by the present TIP, can be seen at 
http://scidb.sourceforge.net/tk/text.html
This version of the man page is colorized, with blue meaning "changed", and green meaning "new", so that it is easier to spot what's different from the legacy text widget.

~~ Performance Improvements

Detailed performance comparison between legacy code and revised code can be
found at [http://scidb.sourceforge.net/tk/comparison.html] but these are the
key points:

 * Long line problem, especially with many tags, is eliminated: in general
   only O(N log N) in revised version, was a higher order polynomial time in
   legacy code.

 * Display is faster: smoother scrolling, faster response time.
   [http://scidb.sourceforge.net/tk/display.html]

 * Undo/redo is much faster: a completely new implementation has been worked
   out, directly working on the text segments.
   [http://scidb.sourceforge.net/tk/undo.html]

~~ New Features

Detailed explanations and rationales for each of the items below can be found
at [http://scidb.sourceforge.net/tk/revised-text-widget.html]

 * Undo/redo is handling tags (this was requesteed in Issue #1561991 and
   Issue #1027741, embedded images, embedded windows, and also marks if option
   ''-steadymarks'' is enabled

 * Additional widget state '''readonly'''

 * Hyphenation and full-justification support

 > * Support of hyphenation (Issue #1096580 is fixed), with new helper
      functions '''tk_textinsert''' and '''tk_textReplace''', and with new
      switches to ''pathName'' '''count''', ''pathName'' '''get''', and
      ''pathName'' '''search'''

 > * Additional justification mode '''full'''

 > * Additional wrap mode '''codepoint''', and new widget option
   '''-useunibreak'''. New subcommand ''pathName'' '''brks'''

 > * Additional option '''-lang''' used to guide hyphenation engines.

 * Additional subcommands:

 > * ''pathName'' '''checksum'''

 > * ''pathName'' '''clear'''

 > * ''pathName'' '''edit altered'''

 > * ''pathName'' '''edit info'''

 > * ''pathName'' '''edit inspect'''

 > * ''pathName'' '''edit irreversible'''

 > * ''pathName'' '''edit recover'''

 > * ''pathName'' '''inspect'''

 > * ''pathName'' '''isclean'''

 > * ''pathName'' '''isdead'''

 > * ''pathName'' '''isempty'''

 > * ''pathName'' '''lineno'''

 > * ''pathName'' '''load'''

 > * ''pathName'' '''mark compare'''

 > * ''pathName'' '''mark exists'''

 > * ''pathName'' '''mark generate'''

 > * ''pathName'' '''tag clear'''

 > * ''pathName'' '''tag findnext'''

 > * ''pathName'' '''tag findprev'''

 > * ''pathName'' '''tag getrange'''

 > * ''pathName'' '''tag priority'''

 > * ''pathName'' '''watch'''

 * Additional tag attributes:

 > * '''-eolcolor'''

 > * '''-hyphencolor'''

 > * '''-hyphenrules'''

 > * '''-inactivebackground'''

 > * '''-inactiveforeground'''

 > * '''-inactiveselectbackground'''

 > * '''-inactiveselectforeground'''

 > * '''-indentbackground'''

 > * '''-undo'''

 * Additional widget options:

 > * '''-endindex'''

 > * '''-eolchar'''

 > * '''-eolcolor'''

 > * '''-eotchar'''

 > * '''-eotcolor'''

 > * '''-hyphencolor'''

 > * '''-hyphenrules'''

 > * '''-hyphens'''

 > * '''-inactiveselectforeground'''

 > * '''-insertforeground'''

 > * '''-maxredo'''

 > * '''-maxundosize'''

 > * '''-responsiveness'''

 > * '''-showendofline'''

 > * '''-showendoftext'''

 > * '''-showinsertforeground'''

 > * '''-spacemode'''

 > * '''-startindex'''

 > * '''-steadymarks'''

 > * '''-synctime'''

 > * '''-tagging'''

 * Extensions to the syntax for indices:

 > * new specifier '''begin'''

 > * new syntax ''tag''.'''current.first''', ''tag''.'''current.last'''

 > * new syntax '''@first,last'''

 * Additional features of existing subcommands:

 > * Additional option '''-marks''' for ''pathName'' '''delete''' command

 > * Additional optional parameter ''direction'' for ''pathName'' '''mark set''' sub-command

 > * New virtual event '''<<Altered>>''' to support new sub-command
     ''pathName'' '''edit altered'''

 > * Extensions to commands ''pathName'' '''edit reset''' and ''pathName''
     '''edit separator'''

 * Extended command ''pathName'' '''tag names'''

 * Additional switch for ''pathName'' '''dump'''

 * Additional option '''-extents''' for ''pathName'' '''bbox'''
     and ''pathName'' '''dlineinfo'''

 * Additional option '''-discardspecial''' for ''pathName'' '''mark names''', ''pathName'' '''mark next''', and ''pathName'' '''mark previous'''.

 * Additional optional parameter ''pattern'' for ''pathName'' '''mark names''', ''pathName'' '''mark next''', and ''pathName'' '''mark previous'''.

 * New helper commands:

 > * '''tk_mergeRange'''

 > * '''tk_textInsert'''

 > * '''tk_textReplace'''

 > * '''tk_textRebindMouseWheel'''

 * Additional option '''-owner''' for embedded window

 * Additional option '''-tags''' for embedded images and embedded windows

~~ Bug Fixes

 * Bug fixed in TkTextGetIndex

 * Bug fixed in TkTextGetIndexFromObj

 * Bug fixed in DeleteIndexRange (note that this bugfix implies that deletion at the end of the text handles the last newline now differently - slight incompatibility with the legacy text widget)

 * Bug fixed in TkTextDeleteTag/TagBindEvent

 * Problems fixed with '''-startline'''/'''-endline'''

 * Problems fixed with tag event handling

 * Several bug fixes with '''undo''

 * '''Edit modified''' confusing results fixed with new command
   '''edit altered'''

 * Severe problems with command '''sync''' fixed

 * Invalid changes in disabled widget are marked as deprecated

 * Inaccurate wrapping algorithm fixed

 * Bugs in display logic fixed

 * Insert cursor is now fully visible in all conditions

 * Trimming spaces: Issue #1082213 is invalid, the fix put in trunk (8.7) has
   been reverted (but there is now the new option '''-spacemode''' that can be
   set to '''trim''')

 * Issues with display of selections fixed

 * '''Update''' is no longer wasting the processor time since superfluous
   update computations are not done anymore

 * Bugs in context drawing support (OS X) fixed

 * Bugs fixed in tkUnixRFont.c

 * Several bug fixes related to handling/positioning of the insertion cursor

Details on each of these bugs can be found in the "Bugs/Issues in Original
Implementation" section at
http://scidb.sourceforge.net/tk/revised-text-widget.html

~~ Incompatibilities with Legacy Version

Based on the author's website, the following incompatibilities are currently
known:

 * [449] (undo/redo to Return Range of Characters) was not adapted into the
   revised implementation, because Issue #1217222 - the basis for [449] -
   is now featured by:

 > 1. The new undo implementation, because also the tag associations will be
      restored, and

 > 2. The powerful '''watch''' command, which also provides the affected
      ranges (with constant runtime behavior).

 > Moreover, the '''tk_mergeRange''' function convenience function has been
   implemented in the revised version.

 * The special selection tag '''sel''' can no longer be elided (would be
   useless anyway).

 * Tag options (introduced in 8.6.6) -overstrikefg and -underlinefg were
   renamed to '''-overstrikecolor''' and '''-underlinecolor'''

 * The new index syntax '''@first,last''' is incompatible with the legacy
   version but it is not expected that any existing application will break,
   certainly nobody is using such a form for the name of a mark or image

 * The default value of 50 ms for the new '''-responsiveness''' option is
   incompatible to prior releases, but it shouldn't matter here, because
   nobody wants flickering, and nobody is using special tricks with a short
   mouse hovering while the widget is scrolling. Setting the responsiveness to zero restores the old
   behavior of the text widget.

 * <<UndoStack>> is generated with any change on the undo stack, not only when
   the undo stack or the redo stack becomes empty or non-empty

 * '''-startline'''/'''-endline''' behavior was subtly changed in some corner cases

 * In revised implementation "+N chars" and "-N chars" refer to characters,
   and no longer to indices (which was the case in legacy code for backwards
   compatibility reasons).

~~ Deprecated Commands and Options

 * Tag options (introduced in 8.6.6) '''-overstrikefg''' and
   '''-underlinefg''' were renamed to -overstrikecolor and -underlinecolor

 * edit '''undodepth'''|'''redodepth'''|'''canundo'''|'''canredo''' are
   replaced by more general '''edit info'''

 * Widget options '''-startline'''/'''-endline'''' are replaced by
   -startindex/-endindex

~~ Drawbacks

 * The increase in memory usage is not very high (but a bit high), and despite
   this, in many cases, especially if many tags are used, and/or undo is
   enabled, the revised version is even decreasing the memory usage.

Detailed memory comparison between legacy code and revised code can be found
at http://scidb.sourceforge.net/tk/comparison.html

~~ Known Issues in the Revised Implementation

Based on the author's website, currently only these issues are known: 

 * The code for the implementation has increased by more than 100%, and about
   70% of the old code has been changed. The revised implementation needs more
   testing, the text widget is very complex, and bugs are expected. And a few
   additions are not yet well tested.

 * Function '''tk_textCopy''' is copying hidden (elided) text. This seems to
   be unexpected, but it's the behavior of the original implementation.
   Probably this is a bug and should be corrected.

 * Adding/deleting tags covering a large range of text is still quite time
   consuming.

 * The display line with the insert cursor is redrawn each time the cursor
   blinks, which causes a steady stream of graphics traffic. It would be
   desirable if the cursor update will be performed with a specialized and
   efficient redraw function.

 * If option '''-spacemode''' is set to trim, then '''get -displaychars'''
   should probably return trimmed spaces. Currently this command is not
   trimming spaces, so the result may not coincide with the visible text.

 * The '''search -regexp''' sub-command is still not yet fully implemented,
   see Tk documentation.

 * The revised widget still ignores modifying commands if state is
   not normal; this behavior is unreasonable, but conforms to the original
   version.

 * Currently the special index specifier '''begin''' has the lowest
   precedence, although it should have the same precedence as the special
   index special '''end''' (see section INDICES). In a future release this
   should be corrected.  The current behavior is a workaround, avoiding that
   existing applications will break with the introduction of '''begin'''.

 * The implementation still contains some TODO's of minor issues. 

Also, the following should be noted:

 * With the revised version there are failing tests on all platforms, they
   need to be fixed (by fixing the expected result in the test, or by fixing
   the text widget code).

 * More tests should be written to exercise the new or changed features.

 * The OS X case should be more tested on a real Mac, because it's the only
   platform using context drawing.

~~ Miscellaneous

 * No function signature pertaining to a public interface was changed. Also
   public data structures haven't been touched.

 * All recent new features brought in trunk in the legacy version have their
   counterpart in the revised version, have been improved in performance and
   have no known drawbacks. Minor incompatibilities are however identified
   here and there.

~ Target Release

Given the amount of changes, also because of our usual precautions regarding
backwards compatibility, and despite the very high quality of the code and the
fact it passes (almost all) the previously existing test suite, it is deemed
reasonable to target Tcl/Tk 8.7 (or 9.0), but neither the 8.6 nor the 8.5
streams of releases, which will continue to implement the legacy text widget
code.

Support of versions back to 8.5 is currently included in the revised code, but
will be removed (because it's useless for use in trunk only) at the time the
new code will get merged into trunk.

~ Implementation

Implementation of the revised text widget code has been placed in branch
[http://core.tcl.tk/tk/timeline?r=revised_text] of the fossil repository.

This implementation compiles on Linux, Windows, and OS X. It respects the
standards of Tk (C99 standard, and also the Tcl source code formatting
described in [247]).

The man page for the text widget has been contributed by jima and is included
in the revised_text branch.

The expected results of many tests were adjusted to take into account that the
revised implementation is better optimizing, so some trace results of display
line computation are different. Other adjustments were required because of bug
fixes.

~ Open Questions

 * tkTextUndo.c implements a specialized undo/redo, not using the legacy
   tkUndo.c. Reasons for this are stated at the top of tkTextUndo.c. It is
   interesting to note that, in the revised_text branch, tkUndo.c is not even
   compiled anymore, except on Linux (for no apparent reason). This is dead
   code waiting for use case by a widget. At least, compilation on Linux
   should be removed, but couldn't we even rename tkTextUndo.c to tkUndo.c and
   forget about the old implementation? tkTextUndo.c is also a shareable
   implementation (in the spirit of [104]).

 * Actual removal of deprecated features or keep them (some are marked as
   deprecated, but actually still supported)?

~ Copyright

This document has been placed in the public domain.

The author of the revised text widget code has explicitly placed his code of
the text widget under the same license as Tcl.

|

|

|
|

|

|

|

|

|

|

|

|
|
|

|

|
|
|
|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|
|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|
|
|

|

|

|

|

|
|

|

|
|

|

|
|

|
|

|

|

|

|
|
|

|

|
|

|
|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|
|

|

|

|

|
|

|

|

|
|

|

|
|

|

>
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478

 * A large number of new features

 * Numerous bug fixes

 * Very few incompatibilities with the legacy text widget

# Proposal

The proposal is to replace the legacy code with the new implementation.

The author of the revised implementation has written a well documented website
<http://scidb.sourceforge.net/tk/revised-text-widget.html> 
describing in details the issues with the legacy code, how he fixed these
issues, and what features he has changed or improved.

It was not deemed feasible nor necessary to copy/paste/reformat all the
information of the above website into the present TIP. Only the new features
and incompatibilities are highlighted here, as opposed to detailed rationales
about each change.

A version of the **text** man page, consistent with the changes and improvements proposed by the present TIP, can be seen at 
<http://scidb.sourceforge.net/tk/text.html>
This version of the man page is colorized, with blue meaning "changed", and green meaning "new", so that it is easier to spot what's different from the legacy text widget.

## Performance Improvements

Detailed performance comparison between legacy code and revised code can be
found at <http://scidb.sourceforge.net/tk/comparison.html>  but these are the
key points:

 * Long line problem, especially with many tags, is eliminated: in general
   only O\(N log N\) in revised version, was a higher order polynomial time in
   legacy code.

 * Display is faster: smoother scrolling, faster response time.
   <http://scidb.sourceforge.net/tk/display.html> 

 * Undo/redo is much faster: a completely new implementation has been worked
   out, directly working on the text segments.
   <http://scidb.sourceforge.net/tk/undo.html> 

## New Features

Detailed explanations and rationales for each of the items below can be found
at <http://scidb.sourceforge.net/tk/revised-text-widget.html> 

 * Undo/redo is handling tags \(this was requesteed in Issue \#1561991 and
   Issue \#1027741, embedded images, embedded windows, and also marks if option
   _-steadymarks_ is enabled

 * Additional widget state **readonly**

 * Hyphenation and full-justification support

	 > \* Support of hyphenation \(Issue \#1096580 is fixed\), with new helper
      functions **tk\_textinsert** and **tk\_textReplace**, and with new
      switches to _pathName_ **count**, _pathName_ **get**, and
      _pathName_ **search**

	 > \* Additional justification mode **full**

	 > \* Additional wrap mode **codepoint**, and new widget option
   **-useunibreak**. New subcommand _pathName_ **brks**

	 > \* Additional option **-lang** used to guide hyphenation engines.

 * Additional subcommands:

	 > \* _pathName_ **checksum**

	 > \* _pathName_ **clear**

	 > \* _pathName_ **edit altered**

	 > \* _pathName_ **edit info**

	 > \* _pathName_ **edit inspect**

	 > \* _pathName_ **edit irreversible**

	 > \* _pathName_ **edit recover**

	 > \* _pathName_ **inspect**

	 > \* _pathName_ **isclean**

	 > \* _pathName_ **isdead**

	 > \* _pathName_ **isempty**

	 > \* _pathName_ **lineno**

	 > \* _pathName_ **load**

	 > \* _pathName_ **mark compare**

	 > \* _pathName_ **mark exists**

	 > \* _pathName_ **mark generate**

	 > \* _pathName_ **tag clear**

	 > \* _pathName_ **tag findnext**

	 > \* _pathName_ **tag findprev**

	 > \* _pathName_ **tag getrange**

	 > \* _pathName_ **tag priority**

	 > \* _pathName_ **watch**

 * Additional tag attributes:

	 > \* **-eolcolor**

	 > \* **-hyphencolor**

	 > \* **-hyphenrules**

	 > \* **-inactivebackground**

	 > \* **-inactiveforeground**

	 > \* **-inactiveselectbackground**

	 > \* **-inactiveselectforeground**

	 > \* **-indentbackground**

	 > \* **-undo**

 * Additional widget options:

	 > \* **-endindex**

	 > \* **-eolchar**

	 > \* **-eolcolor**

	 > \* **-eotchar**

	 > \* **-eotcolor**

	 > \* **-hyphencolor**

	 > \* **-hyphenrules**

	 > \* **-hyphens**

	 > \* **-inactiveselectforeground**

	 > \* **-insertforeground**

	 > \* **-maxredo**

	 > \* **-maxundosize**

	 > \* **-responsiveness**

	 > \* **-showendofline**

	 > \* **-showendoftext**

	 > \* **-showinsertforeground**

	 > \* **-spacemode**

	 > \* **-startindex**

	 > \* **-steadymarks**

	 > \* **-synctime**

	 > \* **-tagging**

 * Extensions to the syntax for indices:

	 > \* new specifier **begin**

	 > \* new syntax _tag_.**current.first**, _tag_.**current.last**

	 > \* new syntax **@first,last**

 * Additional features of existing subcommands:

	 > \* Additional option **-marks** for _pathName_ **delete** command

	 > \* Additional optional parameter _direction_ for _pathName_ **mark set** sub-command

	 > \* New virtual event **<<Altered>>** to support new sub-command
     _pathName_ **edit altered**

	 > \* Extensions to commands _pathName_ **edit reset** and _pathName_
     **edit separator**

 * Extended command _pathName_ **tag names**

 * Additional switch for _pathName_ **dump**

 * Additional option **-extents** for _pathName_ **bbox**
     and _pathName_ **dlineinfo**

 * Additional option **-discardspecial** for _pathName_ **mark names**, _pathName_ **mark next**, and _pathName_ **mark previous**.

 * Additional optional parameter _pattern_ for _pathName_ **mark names**, _pathName_ **mark next**, and _pathName_ **mark previous**.

 * New helper commands:

	 > \* **tk\_mergeRange**

	 > \* **tk\_textInsert**

	 > \* **tk\_textReplace**

	 > \* **tk\_textRebindMouseWheel**

 * Additional option **-owner** for embedded window

 * Additional option **-tags** for embedded images and embedded windows

## Bug Fixes

 * Bug fixed in TkTextGetIndex

 * Bug fixed in TkTextGetIndexFromObj

 * Bug fixed in DeleteIndexRange \(note that this bugfix implies that deletion at the end of the text handles the last newline now differently - slight incompatibility with the legacy text widget\)

 * Bug fixed in TkTextDeleteTag/TagBindEvent

 * Problems fixed with **-startline**/**-endline**

 * Problems fixed with tag event handling

 * Several bug fixes with **undo_

 * **Edit modified** confusing results fixed with new command
   **edit altered**

 * Severe problems with command **sync** fixed

 * Invalid changes in disabled widget are marked as deprecated

 * Inaccurate wrapping algorithm fixed

 * Bugs in display logic fixed

 * Insert cursor is now fully visible in all conditions

 * Trimming spaces: Issue \#1082213 is invalid, the fix put in trunk \(8.7\) has
   been reverted \(but there is now the new option **-spacemode** that can be
   set to **trim**\)

 * Issues with display of selections fixed

 * **Update** is no longer wasting the processor time since superfluous
   update computations are not done anymore

 * Bugs in context drawing support \(OS X\) fixed

 * Bugs fixed in tkUnixRFont.c

 * Several bug fixes related to handling/positioning of the insertion cursor

Details on each of these bugs can be found in the "Bugs/Issues in Original
Implementation" section at
<http://scidb.sourceforge.net/tk/revised-text-widget.html>

## Incompatibilities with Legacy Version

Based on the author's website, the following incompatibilities are currently
known:

 * [[449]](449.md) \(undo/redo to Return Range of Characters\) was not adapted into the
   revised implementation, because Issue \#1217222 - the basis for [[449]](449.md) -
   is now featured by:

	 > 1. The new undo implementation, because also the tag associations will be
      restored, and

	 > 2. The powerful **watch** command, which also provides the affected
      ranges \(with constant runtime behavior\).

	 > Moreover, the **tk\_mergeRange** function convenience function has been
   implemented in the revised version.

 * The special selection tag **sel** can no longer be elided \(would be
   useless anyway\).

 * Tag options \(introduced in 8.6.6\) -overstrikefg and -underlinefg were
   renamed to **-overstrikecolor** and **-underlinecolor**

 * The new index syntax **@first,last** is incompatible with the legacy
   version but it is not expected that any existing application will break,
   certainly nobody is using such a form for the name of a mark or image

 * The default value of 50 ms for the new **-responsiveness** option is
   incompatible to prior releases, but it shouldn't matter here, because
   nobody wants flickering, and nobody is using special tricks with a short
   mouse hovering while the widget is scrolling. Setting the responsiveness to zero restores the old
   behavior of the text widget.

 * <<UndoStack>> is generated with any change on the undo stack, not only when
   the undo stack or the redo stack becomes empty or non-empty

 * **-startline**/**-endline** behavior was subtly changed in some corner cases

 * In revised implementation "\+N chars" and "-N chars" refer to characters,
   and no longer to indices \(which was the case in legacy code for backwards
   compatibility reasons\).

## Deprecated Commands and Options

 * Tag options \(introduced in 8.6.6\) **-overstrikefg** and
   **-underlinefg** were renamed to -overstrikecolor and -underlinecolor

 * edit **undodepth**\|**redodepth**\|**canundo**\|**canredo** are
   replaced by more general **edit info**

 * Widget options **-startline**/**-endline**' are replaced by
   -startindex/-endindex

## Drawbacks

 * The increase in memory usage is not very high \(but a bit high\), and despite
   this, in many cases, especially if many tags are used, and/or undo is
   enabled, the revised version is even decreasing the memory usage.

Detailed memory comparison between legacy code and revised code can be found
at <http://scidb.sourceforge.net/tk/comparison.html>

## Known Issues in the Revised Implementation

Based on the author's website, currently only these issues are known: 

 * The code for the implementation has increased by more than 100%, and about
   70% of the old code has been changed. The revised implementation needs more
   testing, the text widget is very complex, and bugs are expected. And a few
   additions are not yet well tested.

 * Function **tk\_textCopy** is copying hidden \(elided\) text. This seems to
   be unexpected, but it's the behavior of the original implementation.
   Probably this is a bug and should be corrected.

 * Adding/deleting tags covering a large range of text is still quite time
   consuming.

 * The display line with the insert cursor is redrawn each time the cursor
   blinks, which causes a steady stream of graphics traffic. It would be
   desirable if the cursor update will be performed with a specialized and
   efficient redraw function.

 * If option **-spacemode** is set to trim, then **get -displaychars**
   should probably return trimmed spaces. Currently this command is not
   trimming spaces, so the result may not coincide with the visible text.

 * The **search -regexp** sub-command is still not yet fully implemented,
   see Tk documentation.

 * The revised widget still ignores modifying commands if state is
   not normal; this behavior is unreasonable, but conforms to the original
   version.

 * Currently the special index specifier **begin** has the lowest
   precedence, although it should have the same precedence as the special
   index special **end** \(see section INDICES\). In a future release this
   should be corrected.  The current behavior is a workaround, avoiding that
   existing applications will break with the introduction of **begin**.

 * The implementation still contains some TODO's of minor issues. 

Also, the following should be noted:

 * With the revised version there are failing tests on all platforms, they
   need to be fixed \(by fixing the expected result in the test, or by fixing
   the text widget code\).

 * More tests should be written to exercise the new or changed features.

 * The OS X case should be more tested on a real Mac, because it's the only
   platform using context drawing.

## Miscellaneous

 * No function signature pertaining to a public interface was changed. Also
   public data structures haven't been touched.

 * All recent new features brought in trunk in the legacy version have their
   counterpart in the revised version, have been improved in performance and
   have no known drawbacks. Minor incompatibilities are however identified
   here and there.

# Target Release

Given the amount of changes, also because of our usual precautions regarding
backwards compatibility, and despite the very high quality of the code and the
fact it passes \(almost all\) the previously existing test suite, it is deemed
reasonable to target Tcl/Tk 8.7 \(or 9.0\), but neither the 8.6 nor the 8.5
streams of releases, which will continue to implement the legacy text widget
code.

Support of versions back to 8.5 is currently included in the revised code, but
will be removed \(because it's useless for use in trunk only\) at the time the
new code will get merged into trunk.

# Implementation

Implementation of the revised text widget code has been placed in branch
<http://core.tcl.tk/tk/timeline?r=revised_text>  of the fossil repository.

This implementation compiles on Linux, Windows, and OS X. It respects the
standards of Tk \(C99 standard, and also the Tcl source code formatting
described in [[247]](247.md)\).

The man page for the text widget has been contributed by jima and is included
in the revised\_text branch.

The expected results of many tests were adjusted to take into account that the
revised implementation is better optimizing, so some trace results of display
line computation are different. Other adjustments were required because of bug
fixes.

# Open Questions

 * tkTextUndo.c implements a specialized undo/redo, not using the legacy
   tkUndo.c. Reasons for this are stated at the top of tkTextUndo.c. It is
   interesting to note that, in the revised\_text branch, tkUndo.c is not even
   compiled anymore, except on Linux \(for no apparent reason\). This is dead
   code waiting for use case by a widget. At least, compilation on Linux
   should be removed, but couldn't we even rename tkTextUndo.c to tkUndo.c and
   forget about the old implementation? tkTextUndo.c is also a shareable
   implementation \(in the spirit of [[104]](104.md)\).

 * Actual removal of deprecated features or keep them \(some are marked as
   deprecated, but actually still supported\)?

# Copyright

This document has been placed in the public domain.

The author of the revised text widget code has explicitly placed his code of
the text widget under the same license as Tcl.

Name change from tip/467.tip to tip/467.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

TIP:            467
Title:          Move TIP Collection to Fossil
Version:        $Revision: 1.5 $
Author:         Mark Janssen <[email protected]>
State:          Draft
Type:           Process
Vote:           Pending
Created:        14-Mar-2017
Post-History:   
Keywords:       migration

~ Abstract

The Tcl TIP collection shall be moved to Fossil and the process of managing
TIPs shall use Fossil as much as possible. The TIP format will be changed from
a TIP-specific form to Markdown.

~ Rationale

Triggered by some people having issues with changing content on current TIP
website and discussion on the #tcl chat, I have experimented with fossil as a
medium to host the Tcl TIP collection.

The current TIP storage and handling requires a lot of scripts that need to be
maintained by the TCT and it is less open than it could be.
There are also advantages to switching to Fossil in place of CVS.

 * Fossil has embedded Markdown rendering.

 * Fossil is already used to manage the Tcl and Tk sources.

 * TIP discussion and CFVs can be done and tracked using fossil tickets.

 * Fossil events could also track CFV's and Vote results

 * CVS is extremely vulnerable to problems with system administration on a
   single host. With a fossil-based system, it is much simpler to have
   multiple repositories.

Besides Fossil supporting Markdown out of the box, markdown is also better
option for the future than the current format. The value of making up
your own plain text format in this age is debatable (especially for the TIP
requirements). Markdown has widely available options to convert to other
formats without any need for the community to maintain the converters, and
supports key extra features such as embedded images (which are important for
some Tk TIPs, and never worked particularly well with the old TIP format).

~ Specification

Proposed URL for the new repository will be http://core.tcl.tk/tips

~~ Backwards compatibility

 * ''tip.tcl.tk/<NUM>.html'' should still show a rendered result. This could be redirected to ''core.tcl.tk/tips/doc/trunk/tip/<NUM>.md''

 * ''tip.tcl.tk'' offers several converted formats (XML, *roff, ...). The fossil option will be to use the ''core.tcl.tk/tips/file/tip/<NUM>.md?download'' URL to get the raw Markdown downloads. For getting the other options one could convert the markdown source file using something like pandoc.

 * E-mail address are not hidden in the source and in the rendered result.  If e-mail addresses need to be hidden there are two options

    1. Remove mails from source.

    2. Hide e-mails in fossil.

    3. Hide e-mails in the webhost.

  Option 2. doesn't help much as the mails are still online in the raw markdown files. Option 1. loses information. Suggested is to leave the addresses untouched. 

~~ Process

TBD

~~ TIP Format

The Markdown tip format has one extension to standard Markdown:

Any TIP.md file will have a mandatory preamble starting with the title (for
fossil rendering) and ending with a `------` on a single line. Between these
parts there is tab indented meta information about the tip. (Tab indented so
it renders nicer in fossil, 4 spaces would also work)

Example from [0]:

|# TIP 0: Tcl Core Team Basic Rules
|    State:          Final
|    Type:           Process
|    Vote:           Done
|    Post-History:
|------

~ Implementation

 * There is a proof of concept conversion (with CVS history) at
   https://fossil.mpcjanssen.nl/tips

 * The scripts for the automatic conversion are at
   https://fossil.mpcjanssen.nl/tip-migration

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|
|

|
|

|

|

|

|

|

|

|

|
|
|
|

|

|
|
|
|
|
|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

# TIP 467: Move TIP Collection to Fossil

	Author:         Mark Janssen <[email protected]>
	State:          Draft
	Type:           Process
	Vote:           Pending
	Created:        14-Mar-2017
	Post-History:   
	Keywords:       migration
-----

# Abstract

The Tcl TIP collection shall be moved to Fossil and the process of managing
TIPs shall use Fossil as much as possible. The TIP format will be changed from
a TIP-specific form to Markdown.

# Rationale

Triggered by some people having issues with changing content on current TIP
website and discussion on the \#tcl chat, I have experimented with fossil as a
medium to host the Tcl TIP collection.

The current TIP storage and handling requires a lot of scripts that need to be
maintained by the TCT and it is less open than it could be.
There are also advantages to switching to Fossil in place of CVS.

 * Fossil has embedded Markdown rendering.

 * Fossil is already used to manage the Tcl and Tk sources.

 * TIP discussion and CFVs can be done and tracked using fossil tickets.

 * Fossil events could also track CFV's and Vote results

 * CVS is extremely vulnerable to problems with system administration on a
   single host. With a fossil-based system, it is much simpler to have
   multiple repositories.

Besides Fossil supporting Markdown out of the box, markdown is also better
option for the future than the current format. The value of making up
your own plain text format in this age is debatable \(especially for the TIP
requirements\). Markdown has widely available options to convert to other
formats without any need for the community to maintain the converters, and
supports key extra features such as embedded images \(which are important for
some Tk TIPs, and never worked particularly well with the old TIP format\).

# Specification

Proposed URL for the new repository will be <http://core.tcl.tk/tips>

## Backwards compatibility

 * _tip.tcl.tk/<NUM>.html_ should still show a rendered result. This could be redirected to _core.tcl.tk/tips/doc/trunk/tip/<NUM>.md_

 * _tip.tcl.tk_ offers several converted formats \(XML, \*roff, ...\). The fossil option will be to use the _core.tcl.tk/tips/file/tip/<NUM>.md?download_ URL to get the raw Markdown downloads. For getting the other options one could convert the markdown source file using something like pandoc.

 * E-mail address are not hidden in the source and in the rendered result.  If e-mail addresses need to be hidden there are two options

    1. Remove mails from source.

    2. Hide e-mails in fossil.

    3. Hide e-mails in the webhost.

  Option 2. doesn't help much as the mails are still online in the raw markdown files. Option 1. loses information. Suggested is to leave the addresses untouched. 

## Process

TBD

## TIP Format

The Markdown tip format has one extension to standard Markdown:

Any TIP.md file will have a mandatory preamble starting with the title \(for
fossil rendering\) and ending with a \`------\` on a single line. Between these
parts there is tab indented meta information about the tip. \(Tab indented so
it renders nicer in fossil, 4 spaces would also work\)

Example from [[0]](0.md):

	# TIP 0: Tcl Core Team Basic Rules
	    State:          Final
	    Type:           Process
	    Vote:           Done
	    Post-History:
	------

# Implementation

 * There is a proof of concept conversion \(with CVS history\) at
   <https://fossil.mpcjanssen.nl/tips>

 * The scripts for the automatic conversion are at
   <https://fossil.mpcjanssen.nl/tip-migration>

# Copyright

This document has been placed in the public domain.

Name change from tip/468.tip to tip/468.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

TIP:		468
Title:		Support Passing TCP listen Backlog Size Option to TCP Socket Creation
Version:	$Revision: 1.1 $
Author:		Shannon Noe <[email protected]>
State:		Draft
Type:		Project
Vote:		Pending
Created:	03-Apr-2017
Post-History:  
Keywords:	Tcl, socket, SOMAXCONN
Tcl-Version:	8.7

~ Abstract

This TIP adds the ability to control the TCP backlog depth used by the
''listen'' system call within the '''socket''' Command. The API function,
'''Tcl_OpenTcpServerEx''', will be extended to allow the passing of the
backlog value. Currently, the SOMAXCONN macro is used as the default. Backlog
values are hard coded to a minimum of 100. The backlog values of 1 and 0 are
useful on the Linux platform.

~ Rationale

Modern Linux TCP supports the kernel managing the listen queue for TCP
sockets. Multiple processes open the same socket address and ports with
SOREUSEADDR and SOREUSEPORT. Each process then uses a backlog value of 1 to
process a single connection at a time. This is explained in detail on this
website
http://veithen.github.io/2014/01/01/how-tcp-backlog-works-in-linux.html

Tighter control over this would allow Tcl scripts to have tighter control over
whether to support a large backlog of sockets waiting to be opened. (Exceeding
the limit would cause the OS to automatically reject the socket connection,
which might be preferable in some high-availability situations to being
blocked for an unknown amount of time.)

~ Specification

A '''Tcl_OpenTcpServerEx''' function will be changed to add a ''backlog''
parameter with this signature:

 > Tcl_Channel '''Tcl_OpenTcpServerEx'''(Tcl_Interp *''interp'', const char *
    ''service'', const char *''myHost'', unsigned int ''flags'',  int ''backlog'',
    Tcl_TcpAcceptProc *''acceptProc'', ClientData ''acceptProcData'')

As for the Tcl side, the '''socket''' command gains a new optional switch that
are only valid for server sockets: ?'''-backlog''' ''int''?. Omitting the
parameter will cause the default value to be used.

Tcl code includes local macro’s for SOMAXCONN which override all platforms
values for SOMAXCONN. This makes backwards compatibility easier. We only need
to preserve the macro value in the default code path.

~ Reference Implementation

Please refer to the ''tip-???'' branch of the core Tcl repository.

~ Backwards Compatibility

The '''Tcl_OpenTcpServerEx''' will retain the old behavior by default as
SOMAXCONN. The SOMAXCONN is defined by macros in the Tcl source. All Tcl code
paths with a listen() system call pass a backlog value. No new code paths are
introduced, only new values for the listen backlog parameter.

The '''socket''' command will be backwards compatible. The default
'''-backlog''' parameter is set to ''SOMAXCONN''. Omission of the new
parameter provides the current behavior.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|

|

|

|

|

|
|
|

|
|

|

|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

# TIP 468: Support Passing TCP listen Backlog Size Option to TCP Socket Creation

	Author:		Shannon Noe <[email protected]>
	State:		Draft
	Type:		Project
	Vote:		Pending
	Created:	03-Apr-2017
	Post-History:  
	Keywords:	Tcl, socket, SOMAXCONN
	Tcl-Version:	8.7
-----

# Abstract

This TIP adds the ability to control the TCP backlog depth used by the
_listen_ system call within the **socket** Command. The API function,
**Tcl\_OpenTcpServerEx**, will be extended to allow the passing of the
backlog value. Currently, the SOMAXCONN macro is used as the default. Backlog
values are hard coded to a minimum of 100. The backlog values of 1 and 0 are
useful on the Linux platform.

# Rationale

Modern Linux TCP supports the kernel managing the listen queue for TCP
sockets. Multiple processes open the same socket address and ports with
SOREUSEADDR and SOREUSEPORT. Each process then uses a backlog value of 1 to
process a single connection at a time. This is explained in detail on this
website
<http://veithen.github.io/2014/01/01/how-tcp-backlog-works-in-linux.html>

Tighter control over this would allow Tcl scripts to have tighter control over
whether to support a large backlog of sockets waiting to be opened. \(Exceeding
the limit would cause the OS to automatically reject the socket connection,
which might be preferable in some high-availability situations to being
blocked for an unknown amount of time.\)

# Specification

A **Tcl\_OpenTcpServerEx** function will be changed to add a _backlog_
parameter with this signature:

 > Tcl\_Channel **Tcl\_OpenTcpServerEx**\(Tcl\_Interp \*_interp_, const char \*
    _service_, const char \*_myHost_, unsigned int _flags_,  int _backlog_,
    Tcl\_TcpAcceptProc \*_acceptProc_, ClientData _acceptProcData_\)

As for the Tcl side, the **socket** command gains a new optional switch that
are only valid for server sockets: ?**-backlog** _int_?. Omitting the
parameter will cause the default value to be used.

Tcl code includes local macro’s for SOMAXCONN which override all platforms
values for SOMAXCONN. This makes backwards compatibility easier. We only need
to preserve the macro value in the default code path.

# Reference Implementation

Please refer to the _tip-???_ branch of the core Tcl repository.

# Backwards Compatibility

The **Tcl\_OpenTcpServerEx** will retain the old behavior by default as
SOMAXCONN. The SOMAXCONN is defined by macros in the Tcl source. All Tcl code
paths with a listen\(\) system call pass a backlog value. No new code paths are
introduced, only new values for the listen backlog parameter.

The **socket** command will be backwards compatible. The default
**-backlog** parameter is set to _SOMAXCONN_. Omission of the new
parameter provides the current behavior.

# Copyright

This document has been placed in the public domain.

Name change from tip/469.tip to tip/469.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86

TIP:            469
Title:          A Callback for Channel-Exception Conditions
Version:        $Revision: 1.3 $
Author:         Andreas Leitgeb <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        16-Apr-2017
Post-History:   
Keywords:       Tcl,event handling
Tcl-Version:    8.7

~ Abstract

This TIP proposes to extend the '''fileevent''' Tcl command to also accept the
keyword '''exception''' for its second argument. This will allow to register a
callback for the specific event that the OS reports an exception on the
channel, while ignoring read- or writability.

~ Rationale

Tcl already allows registering for exceptions in its C-API function
Tcl_CreateChannelHandler(). This TIP merely enables the command
'''fileevent''' to pass TCL_EXCEPTION for the mask in the call to
Tcl_CreateChannelHandler().

On Linux, there exist special "files" that are always readable or writable
without blocking, but certain (hardware-related) events are reported as
exceptions on the channel.  The example at hand is the "sysfs"-API for GPIO
(general purpose input output) where level-changes on GPIO pins are reported
as exceptions on the channel. For details see
[https://www.kernel.org/doc/Documentation/gpio/sysfs.txt] and the paragraphs
about "value".

Listening for readable plus exceptions (as Tcl automatically does when asking
for readable event) doesn't help here, because then the event would
continuously fire, as reading the current level on a pin never blocks.

The only way to react to level-changes (short of busy-looping) is to have the
internal select/poll call specify exclusively the exception notification for
that channel.

~ Specification

This document proposes to add the keyword '''exception''' to the
'''fileevent''' command, where so far only '''readable''' and '''writable'''
are allowed.

If '''exception''' is given as event specifier, then a handler script is
registered, cleared or queried just like with '''readable''' or
'''writable'''.

Since '''readable''' or '''writable''' already check for exception as well,
registering an exception event for a channel that already has readable and/or
writable handlers registered makes little sense, but allowing it does not
raise any issues that having both readable and writable handlers wouldn't
already have, so being fussy about it would confuse more than it could help to
avoid confusion.

~ Alternatives

The ''piio'' extension provides event registration on its own, but its support
for certain IO-chipsets lags behind the sysfs-API.

With '''exception''' becoming its own event type, then '''readable''' and
'''writable''' would no longer need to also fire on exceptions, but
compatibility forbids this particular follow-up change.

~ Compatibility

No incompatibilities are introduced.

~ Reference Implementation

A really bare-bones reference implementation is available as a patch
[http://paste.tclers.tk/4231]. Also, a branch named tip-469 in fossil has been created: [https://core.tcl.tk/tcl/timeline?r=tip-469].

A thus-patched tclsh can successfully wait for input-level changes
on TIP-author's "nano-pi" raspberryPI-like platform with a chipset
not yet supported by piio.

Documentation and test updates yet to be done.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|
|
|

|

|

|

|
|

|

|

|
|

|
|
|

|

|

|

|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86

# TIP 469: A Callback for Channel-Exception Conditions

	Author:         Andreas Leitgeb <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        16-Apr-2017
	Post-History:   
	Keywords:       Tcl,event handling
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes to extend the **fileevent** Tcl command to also accept the
keyword **exception** for its second argument. This will allow to register a
callback for the specific event that the OS reports an exception on the
channel, while ignoring read- or writability.

# Rationale

Tcl already allows registering for exceptions in its C-API function
Tcl\_CreateChannelHandler\(\). This TIP merely enables the command
**fileevent** to pass TCL\_EXCEPTION for the mask in the call to
Tcl\_CreateChannelHandler\(\).

On Linux, there exist special "files" that are always readable or writable
without blocking, but certain \(hardware-related\) events are reported as
exceptions on the channel.  The example at hand is the "sysfs"-API for GPIO
\(general purpose input output\) where level-changes on GPIO pins are reported
as exceptions on the channel. For details see
<https://www.kernel.org/doc/Documentation/gpio/sysfs.txt>  and the paragraphs
about "value".

Listening for readable plus exceptions \(as Tcl automatically does when asking
for readable event\) doesn't help here, because then the event would
continuously fire, as reading the current level on a pin never blocks.

The only way to react to level-changes \(short of busy-looping\) is to have the
internal select/poll call specify exclusively the exception notification for
that channel.

# Specification

This document proposes to add the keyword **exception** to the
**fileevent** command, where so far only **readable** and **writable**
are allowed.

If **exception** is given as event specifier, then a handler script is
registered, cleared or queried just like with **readable** or
**writable**.

Since **readable** or **writable** already check for exception as well,
registering an exception event for a channel that already has readable and/or
writable handlers registered makes little sense, but allowing it does not
raise any issues that having both readable and writable handlers wouldn't
already have, so being fussy about it would confuse more than it could help to
avoid confusion.

# Alternatives

The _piio_ extension provides event registration on its own, but its support
for certain IO-chipsets lags behind the sysfs-API.

With **exception** becoming its own event type, then **readable** and
**writable** would no longer need to also fire on exceptions, but
compatibility forbids this particular follow-up change.

# Compatibility

No incompatibilities are introduced.

# Reference Implementation

A really bare-bones reference implementation is available as a patch
<http://paste.tclers.tk/4231> . Also, a branch named tip-469 in fossil has been created: <https://core.tcl.tk/tcl/timeline?r=tip-469> .

A thus-patched tclsh can successfully wait for input-level changes
on TIP-author's "nano-pi" raspberryPI-like platform with a chipset
not yet supported by piio.

Documentation and test updates yet to be done.

# Copyright

This document has been placed in the public domain.

Name change from tip/47.tip to tip/47.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148

149
150

151
152
153
154
155
156
157
158
159
160
161
162

163
164
165
166
167
168

169
170
171

172
173
174

175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191

TIP:            47
Title:          Modifying Tk to Allow Writing X Window managers
Version:        $Revision: 1.9 $
Author:         Neil McKay <[email protected]>
Author:         Andreas Kupries <[email protected]>
Author:         Donal K. Fellows <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        19-Jul-2001
Post-History:   
Tcl-Version:    8.4

~ Abstract

With a few modifications to the Tk core, extensions could be
written that would allow X window managers to be implemented
as Tcl/Tk scripts.

~ Requirements

Writing X window managers in Tk requires some facilities that
the current Tk core doesn't provide. A window manager
must be able to:

 * draw to, and handle events on, the display's
	root window (including ''<Create>'',
	''<MapRequest>'', ''<ResizeRequest>'', ''<CirculateRequest>'',
	and ''<ConfigureRequest>'' events, which are currently
	ignored)

 * embed arbitrary windows inside Tk windows

 * receive ''<PropertyNotify>'' events from embedded windows

 * perform a variety of other X-specific operations

Window embedding can be handled by an extension, if it is not
incorporated into the Tk frame widget at some later time.
Likewise, the X-specific operations can be handled by
an extension. However, Tk as it currently stands cannot
access the display's root window, nor can ''<PropertyNotify>''
events be received from embedded windows; doing these
things requires core modifications.

~ Root Window Access

The root window is special in many ways:

 * It does not need to be created

 * It cannot be destroyed, moved, or resized

 * Only one process can receive ''<ButtonPress>'' and ''<ButtonRelease>''
	events from it, and only one process can have the
	''SubstructureRedirect'' and ''ResizeRedirect'' masks set

 * It has no physical parent window

Because of these properties, access to the root window via a Tk
widget presents some difficulties. First, the widget's window cannot
be created in the standard way; however, this problem may be solved by
providing a non-standard creation routine via the
''Tk_SetClassProcs'' procedure described in [5].
Likewise, the event handling required by the root window
can be enabled in an extension, although some care is required
when enabling ''<ButtonPress>'' and certain other events.
What really causes problems is the lack of a physical parent.
There are many places in Tk where it is assumed that only
toplevel widgets have no physical parent within the application;
this is reflected in the Tk source by the use of the ''TK_TOP_LEVEL'' flag.
This flag is used to mean different things in different places.
In particular, the ''TK_TOP_LEVEL'' flag may mean:

 * This window is a toplevel widget

 * This widget has a wrapper window

 * This widget's window is controlled by the window manager

 * This window is at the top of a physical window hierarchy
	within the current application

In the current version of Tk, toplevel widgets have all of these
properties, and no other widgets have any of these properties;
hence a single flag suffices.
If we create a widget whose window is the display's root, then this
is no longer the case; a root window has the last property, but not
the first three. For this reason, it is necessary to replace
the ''TK_TOP_LEVEL'' flag with at least two distinct flags. A better
idea is to replace the ''TK_TOP_LEVEL'' flag with four flags, one for each
of the properties listed above. (Even in a standard Tk distribution,
this replacement is desirable for documentation reasons, since it will
indicate what property of a toplevel widget is important in the current
circumstances.) We must also replace the ''Tk_IsTopLevel'' macro with
several macros, or just eliminate it entirely.

One possible set of flag names is:

 TK_TOP_LEVEL: this is a toplevel widget

 TK_HAS_WRAPPER: this window has a wrapper window

 TK_WIN_MANAGED: this window is controlled by the window manager

 TK_TOP_HIERARCHY: this window is at the top of a physical window hierarchy

~ New Event Bindings and Substitutions

A window manager must be able to intercept certain events on the root
window that the standard Tk distribution doesn't recognize, and
it must be able to obtain information about those events. In particular,
it needs to respond to ''<CirculateRequest>'', ''<ConfigureRequest>'',
''<CreateNotify>'', ''<MapRequest>'', and ''<ResizeRequest>'' events.
These events are ignored by standard Tk, and need not be enabled by
default; however, they need to be included in the list of events recognized
by the Tk ''[bind]'' command. Adding this facility is very simple.

Obtaining information about these events is also necessary.
This is usually done via %-substitutions in the ''[bind]'' command;
however, there are two pieces of information that are necessary
for implementing a window manager that cannot be obtained via
the current %-substitution mechanism: the numerical X window ID,
required to handle ''<CreateNotify>'' events, and the property name,
for handling ''<PropertyNotify>'' events. This information could be
obtained by adding two new %-substitutions:

 %i: substitute the numerical window ID for the event

 %P: substitute the atom name for the property being changed

~ Propagating <PropertyNotify> Events

In order to receive ''<PropertyNotify>'' events from embedded windows,
the Tk event loop must handle events not just for windows that
are represented by ''Tk_Window'' structures, but also for their children.
One way to accomplish this is to add another flag for the
''Tk_Window'' struct, and alter the event loop so that
it will also look at a window's parent, if the event is a
''<PropertyNotify>'' event. The relevant part of the Tk event loop
currently looks like this:

|
|winPtr = (TkWindow *) Tk_IdToWindow(eventPtr->xany.display, handlerWindow);
|if (winPtr == NULL) {
|    if (eventPtr->type == PropertyNotify) {
|	TkSelPropProc(eventPtr);
|    }

|    return;
|}

|

If the flag for propagating ''<PropertyNotify>'' events is
''TK_PROP_PROPCHANGE'', then the code above must be modified to look
approximately like this:

|
|winPtr = (TkWindow *) Tk_IdToWindow(eventPtr->xany.display, handlerWindow);
|if (winPtr == NULL) {
|    if (eventPtr->type != PropertyNotify) {
|	return;
|    }

|    TkSelPropProc(eventPtr);
|    parentXId = (parent of handlerWindow);
|    winPtr = (TkWindow *) Tk_IdToWindow(eventPtr->xany.display, parentXId);
|    if (winPtr == NULL) {
|	return;
|    }

|    if (!(winPtr->flags & TK_PROP_PROPCHANGE)) {
|	return;
|    }

|    handlerWindow = parentXId;
|    return;
|}

|

~ Patches

A patch (against tk8.4a2) that implements the changes described above
is available
[http://www.eecs.umich.edu/~mckay/computer/wmenablers.84a3.patch.gz].

~ Notes

''Andreas Kupries.'' There was a ''tkwm' patch once.
[http://www.neosoft.com/tcl/ftparchive/sorted/x11/tkwm/]
[http://www.ensta.fr/internet/unix/window_managers/tkwm.html]

~ Copyright

This document is in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|
|
|
|

|

|

|

|

|

|

|

|

|

|
|
|

|

|

|

|

|

|

|
|

|

|

|
|

|

|

|

|

|

|
|
|
|
|
<
>
|
<
>
|

|
|

|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
<
>
|
|
<
>
|

|

|

|

|

|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146

147
148

149
150
151
152
153
154
155
156
157
158
159
160

161
162
163
164
165
166

167
168
169

170
171
172

173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191

# TIP 47: Modifying Tk to Allow Writing X Window managers

	Author:         Neil McKay <[email protected]>
	Author:         Andreas Kupries <[email protected]>
	Author:         Donal K. Fellows <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        19-Jul-2001
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

With a few modifications to the Tk core, extensions could be
written that would allow X window managers to be implemented
as Tcl/Tk scripts.

# Requirements

Writing X window managers in Tk requires some facilities that
the current Tk core doesn't provide. A window manager
must be able to:

 * draw to, and handle events on, the display's
	root window \(including _<Create>_,
	_<MapRequest>_, _<ResizeRequest>_, _<CirculateRequest>_,
	and _<ConfigureRequest>_ events, which are currently
	ignored\)

 * embed arbitrary windows inside Tk windows

 * receive _<PropertyNotify>_ events from embedded windows

 * perform a variety of other X-specific operations

Window embedding can be handled by an extension, if it is not
incorporated into the Tk frame widget at some later time.
Likewise, the X-specific operations can be handled by
an extension. However, Tk as it currently stands cannot
access the display's root window, nor can _<PropertyNotify>_
events be received from embedded windows; doing these
things requires core modifications.

# Root Window Access

The root window is special in many ways:

 * It does not need to be created

 * It cannot be destroyed, moved, or resized

 * Only one process can receive _<ButtonPress>_ and _<ButtonRelease>_
	events from it, and only one process can have the
	_SubstructureRedirect_ and _ResizeRedirect_ masks set

 * It has no physical parent window

Because of these properties, access to the root window via a Tk
widget presents some difficulties. First, the widget's window cannot
be created in the standard way; however, this problem may be solved by
providing a non-standard creation routine via the
_Tk\_SetClassProcs_ procedure described in [[5]](5.md).
Likewise, the event handling required by the root window
can be enabled in an extension, although some care is required
when enabling _<ButtonPress>_ and certain other events.
What really causes problems is the lack of a physical parent.
There are many places in Tk where it is assumed that only
toplevel widgets have no physical parent within the application;
this is reflected in the Tk source by the use of the _TK\_TOP\_LEVEL_ flag.
This flag is used to mean different things in different places.
In particular, the _TK\_TOP\_LEVEL_ flag may mean:

 * This window is a toplevel widget

 * This widget has a wrapper window

 * This widget's window is controlled by the window manager

 * This window is at the top of a physical window hierarchy
	within the current application

In the current version of Tk, toplevel widgets have all of these
properties, and no other widgets have any of these properties;
hence a single flag suffices.
If we create a widget whose window is the display's root, then this
is no longer the case; a root window has the last property, but not
the first three. For this reason, it is necessary to replace
the _TK\_TOP\_LEVEL_ flag with at least two distinct flags. A better
idea is to replace the _TK\_TOP\_LEVEL_ flag with four flags, one for each
of the properties listed above. \(Even in a standard Tk distribution,
this replacement is desirable for documentation reasons, since it will
indicate what property of a toplevel widget is important in the current
circumstances.\) We must also replace the _Tk\_IsTopLevel_ macro with
several macros, or just eliminate it entirely.

One possible set of flag names is:

 TK\_TOP\_LEVEL: this is a toplevel widget

 TK\_HAS\_WRAPPER: this window has a wrapper window

 TK\_WIN\_MANAGED: this window is controlled by the window manager

 TK\_TOP\_HIERARCHY: this window is at the top of a physical window hierarchy

# New Event Bindings and Substitutions

A window manager must be able to intercept certain events on the root
window that the standard Tk distribution doesn't recognize, and
it must be able to obtain information about those events. In particular,
it needs to respond to _<CirculateRequest>_, _<ConfigureRequest>_,
_<CreateNotify>_, _<MapRequest>_, and _<ResizeRequest>_ events.
These events are ignored by standard Tk, and need not be enabled by
default; however, they need to be included in the list of events recognized
by the Tk _[bind]_ command. Adding this facility is very simple.

Obtaining information about these events is also necessary.
This is usually done via %-substitutions in the _[bind]_ command;
however, there are two pieces of information that are necessary
for implementing a window manager that cannot be obtained via
the current %-substitution mechanism: the numerical X window ID,
required to handle _<CreateNotify>_ events, and the property name,
for handling _<PropertyNotify>_ events. This information could be
obtained by adding two new %-substitutions:

 %i: substitute the numerical window ID for the event

 %P: substitute the atom name for the property being changed

# Propagating <PropertyNotify> Events

In order to receive _<PropertyNotify>_ events from embedded windows,
the Tk event loop must handle events not just for windows that
are represented by _Tk\_Window_ structures, but also for their children.
One way to accomplish this is to add another flag for the
_Tk\_Window_ struct, and alter the event loop so that
it will also look at a window's parent, if the event is a
_<PropertyNotify>_ event. The relevant part of the Tk event loop
currently looks like this:

	winPtr = (TkWindow *) Tk_IdToWindow(eventPtr->xany.display, handlerWindow);
	if (winPtr == NULL) {
	    if (eventPtr->type == PropertyNotify) {
		TkSelPropProc(eventPtr);

	    }
	    return;

	}

If the flag for propagating _<PropertyNotify>_ events is
_TK\_PROP\_PROPCHANGE_, then the code above must be modified to look
approximately like this:

	winPtr = (TkWindow *) Tk_IdToWindow(eventPtr->xany.display, handlerWindow);
	if (winPtr == NULL) {
	    if (eventPtr->type != PropertyNotify) {
		return;

	    }
	    TkSelPropProc(eventPtr);
	    parentXId = (parent of handlerWindow);
	    winPtr = (TkWindow *) Tk_IdToWindow(eventPtr->xany.display, parentXId);
	    if (winPtr == NULL) {
		return;

	    }
	    if (!(winPtr->flags & TK_PROP_PROPCHANGE)) {
		return;

	    }
	    handlerWindow = parentXId;
	    return;

	}

# Patches

A patch \(against tk8.4a2\) that implements the changes described above
is available
<http://www.eecs.umich.edu/~mckay/computer/wmenablers.84a3.patch.gz> .

# Notes

_Andreas Kupries._ There was a _tkwm' patch once.
<http://www.neosoft.com/tcl/ftparchive/sorted/x11/tkwm/> 
<http://www.ensta.fr/internet/unix/window_managers/tkwm.html> 

# Copyright

This document is in the public domain.

Name change from tip/470.tip to tip/470.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63

TIP:		470
Title:		Reliable Access to OO Definition Context Object
State:		Final
Type:		Project
Tcl-Version:	8.7
Vote:		Done
Post-History:	
Version:	$Revision: 1.4 $
Author:		Donal Fellows <[email protected]>
Created:	23-Apr-2017
Keywords:	TclOO, metaprogramming

~ Abstract

This TIP makes it easier for people to write procedures to extend TclOO's
definition sublanguage.

~ Rationale

One of the fundamental features of Tcl is that you can extend it with more
capabilities by writing your own procedures (and other commands, if you prefer
the C API). However, it is somewhat awkward to do so when using TclOO, as the
'''oo::define''' and '''oo::objdefine''' commands don't make it easy to find
out what the context class or object is.

For example, in the ''oo::util'' package of Tcllib, the code for discovering
what the context class is includes this
[http://core.tcl-lang.org/tcllib/artifact/51d71f560ceb7d63?ln=77]:

|    # Get the name of the current class or class delegate 
|    set cls [namespace which [lindex [info level -1] 1]]

That is ugly, and won't even work reliably for getting the context object in
'''oo::objdefine''' as that can be entered into by multiple paths (i.e.,
there's a shortcut from '''oo::define''').

~ Proposed Change

I propose to make the existing '''self''' command in '''oo::define''', when
invoked without arguments, return the context class (provided it is evaluated
in the correct stack frame, as usual with definition commands).  Similarly, I
also propose to add a '''self''' command to the '''oo::objdefine''' system
that takes no arguments and returns the context object.

This will enable to code listed above in the ''Rationale'' to become:

|    # Get the name of the current class or class delegate 
|    set cls [uplevel 1 self]

In the C API, I propose adding a function:

 > Tcl_Object '''Tcl_GetDefineContextObject'''(Tcl_Interp *''interp'')

which will get the context object, or return NULL and put an error in the
interpreter if there is no context object in the frame or the context object
has been deleted. The functionality is that of '''TclOOGetDefineCmdContext'''
in ''tclOODefineCmds.c''
[http://core.tcl-lang.org/tcl/artifact/7d58f1a701168168?ln=682], but the text
of the error messages might be changed.

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
|
>

|

|

|
|
|

|

|

|
|

|
|

|

|
|
|
|

|

|
|

|

|
|
|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63

# TIP 470: Reliable Access to OO Definition Context Object
	State:		Final
	Type:		Project
	Tcl-Version:	8.7
	Vote:		Done
	Post-History:	

	Author:		Donal Fellows <[email protected]>
	Created:	23-Apr-2017
	Keywords:	TclOO, metaprogramming
-----

# Abstract

This TIP makes it easier for people to write procedures to extend TclOO's
definition sublanguage.

# Rationale

One of the fundamental features of Tcl is that you can extend it with more
capabilities by writing your own procedures \(and other commands, if you prefer
the C API\). However, it is somewhat awkward to do so when using TclOO, as the
**oo::define** and **oo::objdefine** commands don't make it easy to find
out what the context class or object is.

For example, in the _oo::util_ package of Tcllib, the code for discovering
what the context class is includes this
<http://core.tcl-lang.org/tcllib/artifact/51d71f560ceb7d63?ln=77> :

	    # Get the name of the current class or class delegate 
	    set cls [namespace which [lindex [info level -1] 1]]

That is ugly, and won't even work reliably for getting the context object in
**oo::objdefine** as that can be entered into by multiple paths \(i.e.,
there's a shortcut from **oo::define**\).

# Proposed Change

I propose to make the existing **self** command in **oo::define**, when
invoked without arguments, return the context class \(provided it is evaluated
in the correct stack frame, as usual with definition commands\).  Similarly, I
also propose to add a **self** command to the **oo::objdefine** system
that takes no arguments and returns the context object.

This will enable to code listed above in the _Rationale_ to become:

	    # Get the name of the current class or class delegate 
	    set cls [uplevel 1 self]

In the C API, I propose adding a function:

 > Tcl\_Object **Tcl\_GetDefineContextObject**\(Tcl\_Interp \*_interp_\)

which will get the context object, or return NULL and put an error in the
interpreter if there is no context object in the frame or the context object
has been deleted. The functionality is that of **TclOOGetDefineCmdContext**
in _tclOODefineCmds.c_
<http://core.tcl-lang.org/tcl/artifact/7d58f1a701168168?ln=682> , but the text
of the error messages might be changed.

# Copyright

This document has been placed in the public domain.

Name change from tip/471.tip to tip/471.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61

TIP:            471
Title:          Add [info linkedname] Introspection Command
Version:        $Revision: 1.1 $
Author:         Mathieu Lafon <[email protected]>
State:          Draft
Type:           Project
Created:        05-May-2017
Tcl-Version:    8.7
Vote:		Pending
Post-History:

~ Abstract

This TIP proposes to improve link variable introspection by providing a new
'''info linkedname''' command.

~ Rationale

This TIP is related to discussions about [457] and the '''-upvar''' extended
argument specifier. Adding an intropsection command to get the name of the
variable linked to is more Tcl-ish than automatically adding a local variable
with the linked name.  The proposed command is not restricted to [457] usage
as this can also be used for a link variable created by other means, using the
'''upvar''' command for example.

~ Specification of the Proposed Change

There should be a new subcommand of '''info''' created with the following syntax:

 > '''info linkedname''' ''varname''

The ''varname'' should be the name of a variable that has been linked to
another variable (e.g., with '''upvar''', '''global''', '''variable''' or
'''namespace upvar'''), and the result of the command will be the name of the
variable linked to.

~ Reference Implementation

The reference implementation is available in the info-linkedname
[http://core.tcl.tk/tcl/timeline?r=info-linkedname] branch.

The code is licensed under the BSD license.

~~ Implementation Notes

Depending on the linked variable, the name is found using different methods:

 * The name of a variable present in a hash table (globals, local variables
   created at runtime, ...) is retrieved using the hash key;

 * The name of an array element is built using the name of the array and the
   index name, retrieved using the hash key. A new field is added to the
   TclVarHashTable sructure to access the related array variable from the
   array element;

 * The name of a compiled local variable is searched in current or upper call
   frames.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|
|
|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61

# TIP 471: Add [info linkedname] Introspection Command

	Author:         Mathieu Lafon <[email protected]>
	State:          Draft
	Type:           Project
	Created:        05-May-2017
	Tcl-Version:    8.7
	Vote:		Pending
	Post-History:
-----

# Abstract

This TIP proposes to improve link variable introspection by providing a new
**info linkedname** command.

# Rationale

This TIP is related to discussions about [[457]](457.md) and the **-upvar** extended
argument specifier. Adding an intropsection command to get the name of the
variable linked to is more Tcl-ish than automatically adding a local variable
with the linked name.  The proposed command is not restricted to [[457]](457.md) usage
as this can also be used for a link variable created by other means, using the
**upvar** command for example.

# Specification of the Proposed Change

There should be a new subcommand of **info** created with the following syntax:

 > **info linkedname** _varname_

The _varname_ should be the name of a variable that has been linked to
another variable \(e.g., with **upvar**, **global**, **variable** or
**namespace upvar**\), and the result of the command will be the name of the
variable linked to.

# Reference Implementation

The reference implementation is available in the info-linkedname
<http://core.tcl.tk/tcl/timeline?r=info-linkedname>  branch.

The code is licensed under the BSD license.

## Implementation Notes

Depending on the linked variable, the name is found using different methods:

 * The name of a variable present in a hash table \(globals, local variables
   created at runtime, ...\) is retrieved using the hash key;

 * The name of an array element is built using the name of the array and the
   index name, retrieved using the hash key. A new field is added to the
   TclVarHashTable sructure to access the related array variable from the
   array element;

 * The name of a compiled local variable is searched in current or upper call
   frames.

# Copyright

This document has been placed in the public domain.

Name change from tip/472.tip to tip/472.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86

TIP:            472
Title:          Add Support for 0d Radix Prefix to Integer Literals
Version:        $Revision: 1.8 $
Author:         Venkat Iyer <[email protected]>
Author:         Brian Griffin <[email protected]>
State:          Accepted
Type:           Project
Vote:           Done
Created:        25-May-2017
Post-History:   
Tcl-Version:    8.7

~ Abstract

This TIP proposes adding support for a '''0d''' decimal radix prefix to
complement the existing '''0x''' hexidecimal, '''0o''' octal and '''0b'''
binary radix prefixes.

~ Rationale

Verilog (and other Hardware Description Languages) always (or at least since
the 1995 LRM) had a way to specify a decimal number explicitly.  Verilog uses
''''d343534''' to mean decimal, VHDL actually allows any radix from 2 to 16
using syntax, so you could explicitly force a decimal interpretation using
'''10#343534#'''.

Tcl now allows '''0b''' for binary in '''expr''' and '''format''', which is
similar to ''''b''' in Verilog.  And of course the '''0x''' prefix has always
been around.  Another use case would be to prevent false parsing of leading
zeroes in '''clock format'''s as octal, without having to go through a
'''scan'''.

But a more elegant reason is that it makes the radix definition consistent, so

 1. all valid input radixes have a consistent unambiguous input literal
    format, and

 2. the '''d''' in '''format %d''' finally finds its complement in '''scan'''.

~ Specification

Extend the '''TclParseNumber''' function to recognize the prefixes '''0d'''
and '''0D''' as decimal integers.  It will have the same semantics as
'''0x''', but base 10 instead of base 16.  

Also extend format command '#' flag to produce the appropriate "0d" for 
the "%#d" conversion.  

~ Examples

It's an integer:

|   % expr {0d12 + 0d15}
|   27
|   % format "%#x" 0d1024
|   0x400
|   % format "%#d" 128
|   0d128

Errors same as other radix prefixes:

|   % expr { 0d317g }
|   invalid bareword "0d317g"
|   in expression " 0d317g ";
|   should be "$0d317g" or "{0d317g}" or "0d317g(...)" or ...
|   % expr { 0x1.53 }
|   missing operator at _@_
|   in expression " 0x1_@_.53 "
|   % expr {0d7.23}
|   missing operator at _@_
|   in expression "0d7_@_.23"

~ Compatibility

Currently, literals beginning with '''0d''' and parsed as a number will
produce an error.  Any code expecting such an error would fail to produce an
error an thus have a change in behavior.  I would expect this situation to be
uncommon.

~ Implementation

An implementation can be found the fossil on the "bsg-0d-radix-prefix" branch, including %#d conversion support.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|
|
|

|

|
|

|
|

|

|

|
|
|

|
|

|

|
|
|
|
|
|

|
|
|
|
|
|
|
|
|
|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86

# TIP 472: Add Support for 0d Radix Prefix to Integer Literals

	Author:         Venkat Iyer <[email protected]>
	Author:         Brian Griffin <[email protected]>
	State:          Accepted
	Type:           Project
	Vote:           Done
	Created:        25-May-2017
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes adding support for a **0d** decimal radix prefix to
complement the existing **0x** hexidecimal, **0o** octal and **0b**
binary radix prefixes.

# Rationale

Verilog \(and other Hardware Description Languages\) always \(or at least since
the 1995 LRM\) had a way to specify a decimal number explicitly.  Verilog uses
**'d343534** to mean decimal, VHDL actually allows any radix from 2 to 16
using syntax, so you could explicitly force a decimal interpretation using
**10\#343534\#**.

Tcl now allows **0b** for binary in **expr** and **format**, which is
similar to **'b** in Verilog.  And of course the **0x** prefix has always
been around.  Another use case would be to prevent false parsing of leading
zeroes in **clock format**s as octal, without having to go through a
**scan**.

But a more elegant reason is that it makes the radix definition consistent, so

 1. all valid input radixes have a consistent unambiguous input literal
    format, and

 2. the **d** in **format %d** finally finds its complement in **scan**.

# Specification

Extend the **TclParseNumber** function to recognize the prefixes **0d**
and **0D** as decimal integers.  It will have the same semantics as
**0x**, but base 10 instead of base 16.  

Also extend format command '\#' flag to produce the appropriate "0d" for 
the "%\#d" conversion.  

# Examples

It's an integer:

	   % expr {0d12 + 0d15}
	   27
	   % format "%#x" 0d1024
	   0x400
	   % format "%#d" 128
	   0d128

Errors same as other radix prefixes:

	   % expr { 0d317g }
	   invalid bareword "0d317g"
	   in expression " 0d317g ";
	   should be "$0d317g" or "{0d317g}" or "0d317g(...)" or ...
	   % expr { 0x1.53 }
	   missing operator at _@_
	   in expression " 0x1_@_.53 "
	   % expr {0d7.23}
	   missing operator at _@_
	   in expression "0d7_@_.23"

# Compatibility

Currently, literals beginning with **0d** and parsed as a number will
produce an error.  Any code expecting such an error would fail to produce an
error an thus have a change in behavior.  I would expect this situation to be
uncommon.

# Implementation

An implementation can be found the fossil on the "bsg-0d-radix-prefix" branch, including %\#d conversion support.

# Copyright

This document has been placed in the public domain.

Name change from tip/473.tip to tip/473.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53

TIP:		473
Title:		Allow a Defined Target Namespace in oo::copy
State:		Final
Type:		Project
Tcl-Version:	8.6.7
Vote:		Done
Post-History:	
Version:	$Revision: 1.4 $
Author:		Donal Fellows <[email protected]>
Created:	06-Jun-2017
Keywords:	Tcl, missing functionality, bugfix

~ Abstract

This TIP adds functionality to '''oo::copy''' to allow the created copy to
have a defined namespace, much as '''oo::class''''s '''createWithNamespace'''
method allows such a namespace to be given on normal object creation.

~ Rationale

Due to an oversight, the '''oo::copy''' command is missing the ability to have
an explicit namespace name specified to use as the instance namespace of the
target object. It was always intended to have this (and the functionality is
there in the C API), but it was omitted from the Tcl-level interface.

Having this capability allows objects to be used as factories for namespaces,
which is in many ways an inversion of the way that TclOO was designed (with
namespaces as the basis for objects). It was requested by Nathan Coulter as a
way to enable more complex behaviour in Rivet and NaviServer. See Tcl Issue
dd3b844fda [http://core.tcl.tk/tcl/tktview/dd3b844fdabdeae5fcb0] for more
information.

~ Proposed Change

I propose to add one more optional argument to '''oo::copy''',
''targetNamespace'', that if provided and non-empty will be the name of a
namespace (resolved relative to the current namespace if not an absolute name)
that will be the name of the newly created target object's instance namespace.
The named namespace must not already exist. Note that specifying the
''targetObject'' as the empty string will cause the object's command to be
automatically chosen.

 > '''oo::copy''' ''sourceObject'' ?''targetObject''? ?''targetNamespace''?

The meaning of the result of the command is unchanged.

~ Implementation

See the oo-copy-ns branch. [http://core.tcl.tk/tcl/timeline?r=oo-copy-ns]

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
|
>

|

|
|

|

|

|
|

|
|

|

|

|
|
|

|

|

|

|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53

# TIP 473: Allow a Defined Target Namespace in oo::copy
	State:		Final
	Type:		Project
	Tcl-Version:	8.6.7
	Vote:		Done
	Post-History:	

	Author:		Donal Fellows <[email protected]>
	Created:	06-Jun-2017
	Keywords:	Tcl, missing functionality, bugfix
-----

# Abstract

This TIP adds functionality to **oo::copy** to allow the created copy to
have a defined namespace, much as **oo::class**'s **createWithNamespace**
method allows such a namespace to be given on normal object creation.

# Rationale

Due to an oversight, the **oo::copy** command is missing the ability to have
an explicit namespace name specified to use as the instance namespace of the
target object. It was always intended to have this \(and the functionality is
there in the C API\), but it was omitted from the Tcl-level interface.

Having this capability allows objects to be used as factories for namespaces,
which is in many ways an inversion of the way that TclOO was designed \(with
namespaces as the basis for objects\). It was requested by Nathan Coulter as a
way to enable more complex behaviour in Rivet and NaviServer. See Tcl Issue
dd3b844fda <http://core.tcl.tk/tcl/tktview/dd3b844fdabdeae5fcb0>  for more
information.

# Proposed Change

I propose to add one more optional argument to **oo::copy**,
_targetNamespace_, that if provided and non-empty will be the name of a
namespace \(resolved relative to the current namespace if not an absolute name\)
that will be the name of the newly created target object's instance namespace.
The named namespace must not already exist. Note that specifying the
_targetObject_ as the empty string will cause the object's command to be
automatically chosen.

 > **oo::copy** _sourceObject_ ?_targetObject_? ?_targetNamespace_?

The meaning of the result of the command is unchanged.

# Implementation

See the oo-copy-ns branch. <http://core.tcl.tk/tcl/timeline?r=oo-copy-ns> 

# Copyright

This document has been placed in the public domain.

Name change from tip/48.tip to tip/48.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
TIP:            48
Title:          Tk Widget Styling Support
Version:        $Revision: 1.20 $
Author:         Fr�d�ric Bonnet <[email protected]>
Author:         Fr�d�ric Bonnet <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        23-Jul-2001
Post-History:   
Discussions-To: news:comp.lang.tcl
Tcl-Version:    8.4

~ Abstract

The Tk Toolkit is one of the last major GUI toolkits lacking themes
support.  This TIP proposes several changes to widget design that
allows custom code to be provided for widget element handling in a
transparent and extensible fashion.  User-provided code may then be
used to alter the widgets' look without the need to alter the Tk core.
The proposed changes induce no loss of compatibility, and only slight
core changes are needed with no side effect on existing functionality.

~ Background

The Tk Toolkit appeared on X-Window systems at a time where Motif was
the ''de facto'' standard for GUI development.  It thus naturally
adopted Motif's look&feel and its famous 3D border style.  First ports
to non-X platforms such as Windows and MacOS kept the Motif style,
which disappointed many users who felt Tk applications look "foreign".
Version 8.0 released around 1996 added native look&feel on these
platforms.

Recently, other Open Source toolkits such as Qt (used by the KDE
project) and GTK (used by the GIMP graphics editing software and the
Gnome project) emerged as powerful and free alternatives to Motif for
X-Window GUI development.  The rapidly growing success of Open Source
systems such as GNU/Linux helped both toolkits attract a vast
community of developers, and the firm (and sometimes friendly)
competition between both communities led to an explosion of new
features.  Thirst for freedom and customizability created the need for
themeability.

The current implementation of Tk only provides native look&feel on
supported platforms (Windows, X-Window, MacOS).  This lack partly
explains Tk's loss of mind-share, especially amongst Linux developers,
where theme support is considered a "cool" or must-have feature.

While yesterday's goal of many GUIs was cross-platform visual
uniformity (Qt and GTK borrowed much of their visual appearance from
Windows, which borrowed earlier from NeXTStep), it is now quite common
to find huge visual differences on today's desktops, even on similar
systems.  Screenshot contests are quite common nowadays.

~ Rationale

Tk first kept away from the toolkit war.  Tk's and its competitors'
philosophies are radically opposite.  Tk favors high level
abstractions and scripting languages such as Tcl, whereas Qt and GTK
developments are primarily done using C or C++ (which Tcl/Tk advocates
believe to be The Wrong Way).  But despite Tk's power, flexibility and
ease of use, it has lost serious mind-share, especially amongst
newcomers and Linux users who don't care about its cross-platform
capabilities.

Many Tk users may see themes support as cosmetic or of lower
importance than much needed features such as megawidgets or
objectification.  Nevertheless, this is a critical feature to be
implemented for the long-term viability of Tk.  Many courses are now
promoting Qt, GTK or (aarggg!) Swing in place of Motif, leaving no
room for Tk.  Whatever its qualities (cross-platform, performance,
ease of use, internationalization and Unicode support), the lack of
themeability will always be seen as one of the main reasons for not
using Tk.  Applications using Tk instead of GTK will look as "foreign"
on pixmap-themed Linux desktop, or even on newer MacOS and Windows
versions, as pre-8.0 applications were on non-X desktops.

The lack of themeability is neither a fatality nor difficult to solve.
Tk already allows colors, fonts and border width and relief to be
specified for all widgets.  What is currently missing is pixmap
themeing and border styles.  The current proposal describes the
required building blocks for theme support that are both easy to
implement and backward compatible.

A straightforward solution would be the one introduced by the
Dash-patch in the form of new widget options such as ''-tile''.  This
approach suffers from several major drawbacks:

  * A lot of new options are needed to handle the many ways of drawing
    pixmap tiles, such as anchoring, repeating, or scaling.

  * With the introduction of new options such as
    ''-activebackground'', tile-related options must be duplicated for
    each widget state (normal, active, disabled...), thus cluttering
    the options namespace more and thus raising the learning curve.

  * Applying a theme to a whole widget hierarchy implies traversing
    the whole tree and applying a lot of options to each widget.

  * Memory consumption is increased for all widgets, even in the case
    when these options are not used.

Moreover, one of the main goals of a theme being to enforce overall
visual consistency, multiplying new options should be avoided.  A
theme is designed to gather these options into one place so that they
can be shared by numerous widgets while avoiding performance or memory
hit.  A carefully designed theme engine should then only add one new
option per widget to set its ''style'' (an essential part of a theme).

How far should themeabitily go? A previous version of this document
proposed to extend the current 3D border mechanism to allow custom
drawing code.  Although this proposal was simple, backward compatible
and covered most of the needs for themeability (border style often
represents the largest part of the visual appearance), it failed to
address other significant parts of the user interface.  These include
radio and check marks, scrollbar arrows, sliders, and other widget
''elements''.  From this point of view, the border is only an
''element'' of a widget.  A complete theme engine should then allow
each UI element to be customized, while maximizing code reuse and
preserving compatibility.  To suit this model, widgets should then be
thought of as assembly of elements, and no more as monolithic
constructs.  This implies a paradigm shift in the way widgets are
''designed'' (but not necessarily in the way they are ''used'').
Actually, the notion of ''element'' is not foreign to Tk, since some
widgets (scrollbars) use the same term to identify their subparts.

~ A quick look at existing implementations

The two major toolkits supporting widget styles are Qt and GTK+.  Both
seem to follow the same path, but in slightly distinct manners: they
define a fixed set of common elements (arrows, checkmarks...) and
associate each with one or several API calls.  While Qt follows the
OO-path, GTK+ uses a more traditional procedural API model.

Qt defines a generic ''QStyle'' class which is the base class for all
styles (Windows, Motif...).  QStyle-derived classes implement a number
of virtual member methods, each being used to draw or compute the
geometry of the many elements.  Thanks to polymorphism, widgets can
then use any style derived from this base class.

Contrary to the C++ -based Qt that defines a class gathering all
style-related methods, GTK+ is C-based and defines individual
procedures (e.g. ''gtk_draw_slider'').

But overall, both use the same model: a predefined (albeit potentially
extensible) set of elements, and associated overloadable
methods/procs.  Adding new elements implies recompilation and/or code
changes.  While it is hardly seen as a problem with Qt and GTK+, since
both target C/C++ programming, it doesn't fit the Tcl/Tk model at all.

~ Proposal (or There Must Be A Better Way)

This document describes a generic and extensible element handling
mechanism.  This mechanism allows elements to be created and/or
overloaded at run-time in a modular fashion.

Widgets are composed of elements.  For instance, a scrollbar is made
out of arrows, a trough, and a slider.  Each element must be declared
<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|
|
|

|

|

|
|

|

|
|

|
|
|

|

|
|

|

|
|

|
|

|
|
|

|

|

|

|

|
|

|
|
|

|
|

|
|

|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157

# TIP 48: Tk Widget Styling Support

	Author:         Frédéric Bonnet <[email protected]>
	Author:         Frédéric Bonnet <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        23-Jul-2001
	Post-History:   
	Discussions-To: news:comp.lang.tcl
	Tcl-Version:    8.4
-----

# Abstract

The Tk Toolkit is one of the last major GUI toolkits lacking themes
support.  This TIP proposes several changes to widget design that
allows custom code to be provided for widget element handling in a
transparent and extensible fashion.  User-provided code may then be
used to alter the widgets' look without the need to alter the Tk core.
The proposed changes induce no loss of compatibility, and only slight
core changes are needed with no side effect on existing functionality.

# Background

The Tk Toolkit appeared on X-Window systems at a time where Motif was
the _de facto_ standard for GUI development.  It thus naturally
adopted Motif's look&feel and its famous 3D border style.  First ports
to non-X platforms such as Windows and MacOS kept the Motif style,
which disappointed many users who felt Tk applications look "foreign".
Version 8.0 released around 1996 added native look&feel on these
platforms.

Recently, other Open Source toolkits such as Qt \(used by the KDE
project\) and GTK \(used by the GIMP graphics editing software and the
Gnome project\) emerged as powerful and free alternatives to Motif for
X-Window GUI development.  The rapidly growing success of Open Source
systems such as GNU/Linux helped both toolkits attract a vast
community of developers, and the firm \(and sometimes friendly\)
competition between both communities led to an explosion of new
features.  Thirst for freedom and customizability created the need for
themeability.

The current implementation of Tk only provides native look&feel on
supported platforms \(Windows, X-Window, MacOS\).  This lack partly
explains Tk's loss of mind-share, especially amongst Linux developers,
where theme support is considered a "cool" or must-have feature.

While yesterday's goal of many GUIs was cross-platform visual
uniformity \(Qt and GTK borrowed much of their visual appearance from
Windows, which borrowed earlier from NeXTStep\), it is now quite common
to find huge visual differences on today's desktops, even on similar
systems.  Screenshot contests are quite common nowadays.

# Rationale

Tk first kept away from the toolkit war.  Tk's and its competitors'
philosophies are radically opposite.  Tk favors high level
abstractions and scripting languages such as Tcl, whereas Qt and GTK
developments are primarily done using C or C\+\+ \(which Tcl/Tk advocates
believe to be The Wrong Way\).  But despite Tk's power, flexibility and
ease of use, it has lost serious mind-share, especially amongst
newcomers and Linux users who don't care about its cross-platform
capabilities.

Many Tk users may see themes support as cosmetic or of lower
importance than much needed features such as megawidgets or
objectification.  Nevertheless, this is a critical feature to be
implemented for the long-term viability of Tk.  Many courses are now
promoting Qt, GTK or \(aarggg!\) Swing in place of Motif, leaving no
room for Tk.  Whatever its qualities \(cross-platform, performance,
ease of use, internationalization and Unicode support\), the lack of
themeability will always be seen as one of the main reasons for not
using Tk.  Applications using Tk instead of GTK will look as "foreign"
on pixmap-themed Linux desktop, or even on newer MacOS and Windows
versions, as pre-8.0 applications were on non-X desktops.

The lack of themeability is neither a fatality nor difficult to solve.
Tk already allows colors, fonts and border width and relief to be
specified for all widgets.  What is currently missing is pixmap
themeing and border styles.  The current proposal describes the
required building blocks for theme support that are both easy to
implement and backward compatible.

A straightforward solution would be the one introduced by the
Dash-patch in the form of new widget options such as _-tile_.  This
approach suffers from several major drawbacks:

  * A lot of new options are needed to handle the many ways of drawing
    pixmap tiles, such as anchoring, repeating, or scaling.

  * With the introduction of new options such as
    _-activebackground_, tile-related options must be duplicated for
    each widget state \(normal, active, disabled...\), thus cluttering
    the options namespace more and thus raising the learning curve.

  * Applying a theme to a whole widget hierarchy implies traversing
    the whole tree and applying a lot of options to each widget.

  * Memory consumption is increased for all widgets, even in the case
    when these options are not used.

Moreover, one of the main goals of a theme being to enforce overall
visual consistency, multiplying new options should be avoided.  A
theme is designed to gather these options into one place so that they
can be shared by numerous widgets while avoiding performance or memory
hit.  A carefully designed theme engine should then only add one new
option per widget to set its _style_ \(an essential part of a theme\).

How far should themeabitily go? A previous version of this document
proposed to extend the current 3D border mechanism to allow custom
drawing code.  Although this proposal was simple, backward compatible
and covered most of the needs for themeability \(border style often
represents the largest part of the visual appearance\), it failed to
address other significant parts of the user interface.  These include
radio and check marks, scrollbar arrows, sliders, and other widget
_elements_.  From this point of view, the border is only an
_element_ of a widget.  A complete theme engine should then allow
each UI element to be customized, while maximizing code reuse and
preserving compatibility.  To suit this model, widgets should then be
thought of as assembly of elements, and no more as monolithic
constructs.  This implies a paradigm shift in the way widgets are
_designed_ \(but not necessarily in the way they are _used_\).
Actually, the notion of _element_ is not foreign to Tk, since some
widgets \(scrollbars\) use the same term to identify their subparts.

# A quick look at existing implementations

The two major toolkits supporting widget styles are Qt and GTK\+.  Both
seem to follow the same path, but in slightly distinct manners: they
define a fixed set of common elements \(arrows, checkmarks...\) and
associate each with one or several API calls.  While Qt follows the
OO-path, GTK\+ uses a more traditional procedural API model.

Qt defines a generic _QStyle_ class which is the base class for all
styles \(Windows, Motif...\).  QStyle-derived classes implement a number
of virtual member methods, each being used to draw or compute the
geometry of the many elements.  Thanks to polymorphism, widgets can
then use any style derived from this base class.

Contrary to the C\+\+ -based Qt that defines a class gathering all
style-related methods, GTK\+ is C-based and defines individual
procedures \(e.g. _gtk\_draw\_slider_\).

But overall, both use the same model: a predefined \(albeit potentially
extensible\) set of elements, and associated overloadable
methods/procs.  Adding new elements implies recompilation and/or code
changes.  While it is hardly seen as a problem with Qt and GTK\+, since
both target C/C\+\+ programming, it doesn't fit the Tcl/Tk model at all.

# Proposal \(or There Must Be A Better Way\)

This document describes a generic and extensible element handling
mechanism.  This mechanism allows elements to be created and/or
overloaded at run-time in a modular fashion.

Widgets are composed of elements.  For instance, a scrollbar is made
out of arrows, a trough, and a slider.  Each element must be declared

︙ ︙ 
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
Elements are declared along with an implementation.  This declaration
can be made by the system or by widgets themselves, and at run-time,
thus allowing extensions to create new and use or derive existing
elements.

Implementations are registered in a given style engine.  A style
engine is thus a collection of element implementations.  Style engines
can be declared at run-time as well, but are static (since they
provide compiled code).  Style engines can be layered in order to
reuse and redefine existing elements implementations, falling back to
the default, core-defined engine.

A style is an instance of a style engine.  Styles can be given client
data information that would carry style engine-specific data.  For
example, a style engine implementing pixmapped elements could be given
the pixmaps to use.  Styles can be created and deleted at run-time.

Using this scheme, a widget can register elements and their default
implementation, but actually use a custom implementation code in a
transparent manner depending on its currently applied style.
Moreover, elements can be shared across widgets, new elements can be
registered dynamically and used transparently.  New widgets could also
be built in a modular fashion and easily reuse other widget's
elements.  The proposed mechanism could then be used in a
megawidget-like fashion (we could speak about megaelement widgets).
Last, it provides a dynamic hook mechanism for overriding the core
widget code from loadable extensions, avoiding the need for
maintaining core patches.

~ Functional Specification

 Style engines: Style engines gather code for handling a set of
	elements.  For this reason, they are inherently static, alike
	''Tcl_ObjType''s.  They can be registered at run-time,
	queried, but never unregistered, since external style engines
	will usually be provided by loadable packages, and that Tcl
	does not support library unloading.

 Styles: Styles are instances of style engines.  While engines are
	static, styles can be dynamic.  All styles of the same engine
	use the same code for handling elements, but using different
	data provided at creation-time.  For example, a generic pixmap
	engine may be instantiated by several styles providing a
	different set of pixmaps.  Styles can be created at run-time,
	queried, and freed.  Since they are user-visible entities, a
	''Tcl_Obj''-based interface is also provided.

 Elements: Elements are virtual entities.  An element only exists if
	an implementation has been provided.  Thus, elements are
	created implicitly.  They can be queried, but not destroyed.
	Upon creation, elements are given a unique ID that remains
	valid for the entire application life time and is used
	subsequently for all related calls.  It serves as a numerical

|
|

|

|

|

|

165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
Elements are declared along with an implementation.  This declaration
can be made by the system or by widgets themselves, and at run-time,
thus allowing extensions to create new and use or derive existing
elements.

Implementations are registered in a given style engine.  A style
engine is thus a collection of element implementations.  Style engines
can be declared at run-time as well, but are static \(since they
provide compiled code\).  Style engines can be layered in order to
reuse and redefine existing elements implementations, falling back to
the default, core-defined engine.

A style is an instance of a style engine.  Styles can be given client
data information that would carry style engine-specific data.  For
example, a style engine implementing pixmapped elements could be given
the pixmaps to use.  Styles can be created and deleted at run-time.

Using this scheme, a widget can register elements and their default
implementation, but actually use a custom implementation code in a
transparent manner depending on its currently applied style.
Moreover, elements can be shared across widgets, new elements can be
registered dynamically and used transparently.  New widgets could also
be built in a modular fashion and easily reuse other widget's
elements.  The proposed mechanism could then be used in a
megawidget-like fashion \(we could speak about megaelement widgets\).
Last, it provides a dynamic hook mechanism for overriding the core
widget code from loadable extensions, avoiding the need for
maintaining core patches.

# Functional Specification

 Style engines: Style engines gather code for handling a set of
	elements.  For this reason, they are inherently static, alike
	_Tcl\_ObjType_s.  They can be registered at run-time,
	queried, but never unregistered, since external style engines
	will usually be provided by loadable packages, and that Tcl
	does not support library unloading.

 Styles: Styles are instances of style engines.  While engines are
	static, styles can be dynamic.  All styles of the same engine
	use the same code for handling elements, but using different
	data provided at creation-time.  For example, a generic pixmap
	engine may be instantiated by several styles providing a
	different set of pixmaps.  Styles can be created at run-time,
	queried, and freed.  Since they are user-visible entities, a
	_Tcl\_Obj_-based interface is also provided.

 Elements: Elements are virtual entities.  An element only exists if
	an implementation has been provided.  Thus, elements are
	created implicitly.  They can be queried, but not destroyed.
	Upon creation, elements are given a unique ID that remains
	valid for the entire application life time and is used
	subsequently for all related calls.  It serves as a numerical

︙ ︙ 
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596

	also provide a list of required widget options.  Elements
	would then pick the option values from the widget record
	according to the widget's option table.  In the case when the
	desired option is missing from the option table, the element
	would have to either try other options of fail gracefully and
	use sensible default values.

~ Detailed Specification

The proposal introduces a set of new public types and APIs, exported
from ''generic/tk.h'' and the stubs table.  The implementation induces
very slight and limited changes to the existing code, with only one
new private API added (''TkGetOptionSpec'' in ''generic/tkConfig.c'').
Most of the new code is concentrated into one file.  There is no side
effect on existing functionality.

''Types and constants''.

 TK_OPTION_STYLE:
	New ''Tk_OptionType'' usually associated with the ''-style''
	widget option.

 TK_STYLE_VERSION_1, TK_STYLE_VERSION:
	Version numbers of Tk style support. The former matches the 
	implementation described in this proposal. The latter is a shortcut to
	the current version. Future extensions may introduce new version numbers.

 Tk_StyleEngine:
	Opaque token for handling style engines.  May be NULL, meaning
	the default system engine.

 Tk_StyledElement:
	Opaque token holding a style-specific implementation of a
	given element.  Subsequently used for performing element ops.

 Tk_Style:
	Opaque token for handling styles.  May be NULL, meaning the
	default system style.

 Tk_GetElementSizeProc, Tk_GetElementBoxProc, Tk_GetElementBorderWidthProc, Tk_DrawElementProc:
	Implementations of various element operations.

| typedef void (Tk_GetElementSizeProc) _ANSI_ARGS_((ClientData clientData, 
| 	char *recordPtr, CONST Tk_OptionSpec **optionsPtr, Tk_Window tkwin,
| 	int width, int height, int inner, int *widthPtr, int *heightPtr));
|
| typedef void (Tk_GetElementBoxProc) _ANSI_ARGS_((ClientData clientData, 
| 	char *recordPtr, CONST Tk_OptionSpec **optionsPtr, Tk_Window tkwin,
| 	int x, int y, int width, int height, int inner, int *xPtr, int *yPtr, 
| 	int *widthPtr, int *heightPtr));
|
| typedef int (Tk_GetElementBorderWidthProc) _ANSI_ARGS_((ClientData clientData, 
| 	char *recordPtr, CONST Tk_OptionSpec **optionsPtr, Tk_Window tkwin));
|
| typedef void (Tk_DrawElementProc) _ANSI_ARGS_((ClientData clientData, 
| 	char *recordPtr, CONST Tk_OptionSpec **optionsPtr, Tk_Window tkwin,
| 	Drawable d, int x, int y, int width, int height, int state));

 Tk_ElementOptionSpec:
	Used to specify a list of required widget options, along with
	their type.  This info will be subsequently used to get option
	values from the widget record using its option table.

| typedef struct Tk_ElementOptionSpec {
|     char *name;
|     Tk_OptionType type;
| } Tk_ElementOptionSpec;

 Tk_ElementSpec:
	Static styled element definition. The version field must be set to 
	TK_STYLE_VERSION_1 in order to match the following structure.

| typedef struct Tk_ElementSpec {
|     int version;
|     char *name;
|     Tk_ElementOptionSpec *options;
|     Tk_GetElementSizeProc *getSize;
|     Tk_GetElementBoxProc *getBox;
|     Tk_GetElementBorderWidthProc *getBorderWidth;
|     Tk_DrawElementProc *draw;
| } Tk_ElementSpec;

 TK_ELEMENT_STATE_*:
	Flags used when drawing elements.  Elements may have a
	different visual appearance depending on their state.
	However, it should be noted that the element size is not
	affected by state changes.

| #define TK_ELEMENT_STATE_ACTIVE       (1<<0)
| #define TK_ELEMENT_STATE_DISABLED     (1<<1)
| #define TK_ELEMENT_STATE_FOCUS        (1<<2)
| #define TK_ELEMENT_STATE_PRESSED      (1<<3)

''Functions''.

 TkStylePkgInit, TkStylePkgFree:
	Internal procedures used to initialize the style subpackage on
	a per-application basis.

| void TkStylePkgInit (TkMainInfo *mainPtr)
| void TkStylePkgFree (TkMainInfo *mainPtr)

 TkGetOptionSpec:
	Internal function used to retrieve an option specifier from a
	compiled option table.

| CONST Tk_OptionSpec * TkGetOptionSpec (CONST char *name, 
| 	Tk_OptionTable optionTable);

 Tk_RegisterStyleEngine:
	Registers a new style engine.

| Tk_StyleEngine Tk_RegisterStyleEngine (char *name, Tk_StyleEngine parent)

 >	Name may be NULL, in which case it registers the default
	engine.  Returns a NULL token if an error occurred (e.g.
	registering an existing engine).

 Tk_GetStyleEngine:
	Returns a token to an existing style engine, or NULL.

| Tk_StyleEngine Tk_GetStyleEngine (char *name)

 Tk_RegisterStyledElement:
	Registers the implementation of an element for a given style
	engine.

| int Tk_RegisterStyledElement (Tk_StyleEngine engine, 
| 	Tk_ElementSpec *templatePtr)

 >	Element names use a dotted notation that gives a hierarchical
	search order.  For example, a widget requiring an element
	named "Scrollbar.vslider" can actually use the "vslider"
	generic element.  Apart from this dotted notation, element
	names are free-form.  However, conventions should be defined,
	such as capitalized widget classes, and lower case elements.
	Since whole widgets can act as elements, one can therefore
	register an element named "Scrollbar".

 >	The given pointer is not stored into internal structures, but
	is instead used to fill them.  Styled element specs can thus
	be allocated on the stack or dynamically, but in most cases
	they will be statically defined.

 Tk_GetElementId:
	Returns the unique numerical ID for an already registered
	element.

| int Tk_GetElementId (char *name)

 Tk_CreateStyle:
	Creates a new style as an instance of an existing style
	engine.

| Tk_Style Tk_CreateStyle (CONST char *name, Tk_StyleEngine engine, 
| 	ClientData clientData)

 >	Client data may be provided, that will be passed as is to
	element operations.

 Tk_GetStyle:
	Retrieves an existing style by its name.

| Tk_Style Tk_GetStyle (Tcl_Interp *interp, CONST char *name)

 >	Retrieves either an existing style by its name, or NULL if
	none was found.  In the latter case, leaves an error message
	in ''interp'' if it is not NULL.

 Tk_FreeStyle:
	Frees a style returned by ''Tk_CreateStyle'' or
	''Tk_GetStyle''.

| void Tk_FreeStyle (Tk_Style style)

 >	It actually decrements an internal reference count so that
	styles can be shared and deleted safely.

 Tk_NameOfStyle:
	Gets a style's name.

| CONST char * Tk_NameOfStyle (Tk_Style style)

 Tk_AllocStyleFromObj, Tk_GetStyleFromObj, Tk_FreeStyleFromObj:
	''Tcl_Obj'' based interface to styles.

| Tk_Style  Tk_AllocStyleFromObj (Tcl_Interp *interp, Tcl_Obj *objPtr)
| Tk_Style Tk_GetStyleFromObj (Tcl_Obj *objPtr)
| void  Tk_FreeStyleFromObj (Tcl_Obj *objPtr)

 >	''Tk_AllocStyleFromObj'' gets (doesn't create) an existing
	style from an object.  ''Tk_GetStyleFromObj'' returns the
	style already stored in the object's internal representation.
	The object must have been returned by
	''Tk_AllocStyleFromObj''.  ''Tk_FreeStyleFromObj'' frees the
	style held by the object. 

 Tk_GetStyledElement:
	Returns a token for the styled element for use with widgets
	having the given ''optionTable''.

| Tk_StyledElement Tk_GetStyledElement (Tk_Style style, int elementId, 
| 	Tk_OptionTable optionTable)

 >	Returns a token for the styled element (or NULL if not found),
	for use with widgets having the given optionTable.  The token
	is persistent and doesn't need to be freed, so it can be
	safely stored if needed (although using element IDs is the
	preferred method).  It is used in subsequent element
	operations and avoids repeated lookups.  The lookup algorithm
	works as follows:

 > 1.	Look for an implementation of the given element in the current
	style engine.

 > 2.	If none was found, traverse the chain of engines (each but the
	default engine has a parent) until the default engine is
	reached.

 > 3.	Restart at step 1 with the base element name instead.  For
	example, if we are looking for "foo.bar.baz", then look for
	"bar.baz" then "baz", until we find an implementation.  

 >	If no implementation was found, then a panic is generated,
	meaning that some dependency has not been resolved.  In the
	general case, this won't happen for core widgets (because they
	only use core elements), and new widgets either have to rely
	on core or package-provided elements, or define their own.

 Tk_GetElementSize, Tk_GetElementBox, Tk_GetElementBorderWidth, Tk_DrawElement:
	Various element operations.

| void Tk_GetElementSize (Tk_Style style, Tk_StyledElement element, 
| 	char *recordPtr, Tk_Window tkwin, int width, int height, 
| 	int inner, int *widthPtr, int *heightPtr)
| void Tk_GetElementBox (Tk_Style style, Tk_StyledElement element, 
| 	char *recordPtr, Tk_Window tkwin, int x, int y, int width, 
| 	int height, int inner, int *xPtr, int *yPtr, int *widthPtr, 
| 	int *heightPtr)
| int Tk_GetElementBorderWidth (Tk_Style style, Tk_StyledElement element, 
| 	char *recordPtr, Tk_Window tkwin)
| void Tk_DrawElement (Tk_Style style, Tk_StyledElement element, 
| 	char *recordPtr, Tk_Window tkwin, Drawable d, int x, int y, 
| 	int width, int height, int state)

 >	The first two are used for geometry management.  First one
	only computes the size, while second one computes the box
	coordinates.  The ''inner'' parameter is a boolean that
	controls whether the inner (FALSE) or outer (TRUE) geometry is
	requested from the maximum outer/minimum inner geometry.
	Third one returns the uniform internal border width of the
	element and is mostly intended for whole widgets.  Last one
	draws the element using the given geometry and state.

~ Implementation

An implementation has been written and completed with respect to the
present specification.  A patch for Tk 8.4a3 is available at:

 > http://www.purl.org/NET/bonnet/pub/style.patch

The ''square'' widget implemented in the test file
''generic/tkSquare.c'' has also been rewritten to use the new API for
its square element.  It demonstrates basic features.  Patch file:

 > http://www.purl.org/NET/bonnet/pub/squarestyle.patch

The sample code registers an element "Square.square" in the default
style engine.  This element is used by the square widget in its
drawing code.  A new style engine "fixedborder" is registered, and
code is provided for the "Square.square" element.  This style engine
draws the element's border using a fixed border width given as client
data by instantiated styles.  Four styles are created as instances of
the "fixedborder" element: "flat", "border2", "border4" and "border8"
(0, 2, 4 and 8 pixel-wide borders).

Sample test session:

| pack [square .s]
| .s config -style
| .s config -style flat
| .s config -style border2
| .s config -style border4
| .s config -style border8
| .s config -style ""
| pack [square .s2]
| .s2 config -style border2
| .s2 config -style border8

~ Performance and memory usage

The provided design and implementation is geared towards the best
compromise between performance and memory consumption.

Critical performance bottleneck is element querying.  In order to
minimize element access times, elements are identified by unique IDs
that act as indexes within internal tables, allowing direct
addressing.  Hash tables are used internally by all name pools
(engines, styles, elements).  Static structures are used whenever
possible (for styled element registration, indirectly through
widgets' option tables...).  Widget processing times are increased by
the extra procedure calls and indirections, but that is the price to
pay for better modularity anyway.  Additional calls are kept minimal.

Per-widget memory consumption is minimal.  A widget usually only needs
to store its current style.  Element IDs can (should?) be shared
globally across widgets of the same class and don't need to be stored
in the widget record.  Moreover, most information is shared internally
across widgets of the same class (identified by their option table).
Many caching & fast lookup techniques are used throughout the code.

~ Compatibility

Existing widgets will need to be rewritten in order to become
style-aware.  The required code changes may be significant (implying
code modularization).  However, no incompatibility is introduced.
Thus, migrating widgets from the old to the new model can follow a
smooth path, similar to that needed for the transition to ''Tcl_Obj''
interfaces.  Besides, widgets as a whole can act as elements, which
may shift the amount of work from the core to the style engines at the
expense of a lesser modularity and code reuse.

~ Future improvements or changes

 * Additional APIs for querying the list of engines, styles,
   elements...

 * Additional operations for elements, e.g. hit tests.

 * Script-level interfaces.

 * Optional translation tables between real widget options and needed
   element options, e.g. ''-elementborderwidth'' => ''-borderwidth''.

 * How to handle native widgets? They will certainly have to be
   provided as whole elements.

 * Current implementation uses thread-local storage for holding
   dynamic data.  Since most data is not thread-specific, this could
   be changed to a more memory-efficient scheme.

 * Provide man pages and tests.

 * Additional hidden/private option flag for accessing some widgets'
   non-configurable data (e.g. scrollbar position) through option
   tables.

~ Glossary

 Element: Part of a widget (e.g. a checkbox mark or a scrollbar
	arrow), usually active.

 Style: The visual appearance of a widget.  May include colors, skins,
        tiles, border drawing style (Windows, Motif...), element
        pictures.

 Styled element: A style-specific implementation of a widget element.

 Style engine: A visually consistent collection of styled elements.

 Theme: A collection of graphical elements giving a consistent
        appearance to a whole widget hierarchy, application, or
        desktop.  A theme is usually made up out of icons, colors,
        fonts, widget styles, or even desktop background and sounds.

~ Copyright

This document has been placed in the public domain.

|

|

|

|

|
|

|

|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|

|
|
|
|

|

|

|
|
|
|
|
|
|
|
|

|

|
|
|
|

|

|
|

|
|

|

|

|
|

|

|

|

|
|

|

|

|

|
|

|

|

|

|
|
|

|

|

|

|
|

|
|
|

|
|

|

|

|

|
|

|

|
|

|
|

|
|

|

|
|
|
|
|
|
|
|
|
|
|
|

|
|

|

|

|
|

|

|

|
|
|
|
|
|
|
|
|
|

|

|
|
|

|

|

|

|
|

|

|

|

|

|

|
|

|

|

>
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
	also provide a list of required widget options.  Elements
	would then pick the option values from the widget record
	according to the widget's option table.  In the case when the
	desired option is missing from the option table, the element
	would have to either try other options of fail gracefully and
	use sensible default values.

# Detailed Specification

The proposal introduces a set of new public types and APIs, exported
from _generic/tk.h_ and the stubs table.  The implementation induces
very slight and limited changes to the existing code, with only one
new private API added \(_TkGetOptionSpec_ in _generic/tkConfig.c_\).
Most of the new code is concentrated into one file.  There is no side
effect on existing functionality.

_Types and constants_.

 TK\_OPTION\_STYLE:
	New _Tk\_OptionType_ usually associated with the _-style_
	widget option.

 TK\_STYLE\_VERSION\_1, TK\_STYLE\_VERSION:
	Version numbers of Tk style support. The former matches the 
	implementation described in this proposal. The latter is a shortcut to
	the current version. Future extensions may introduce new version numbers.

 Tk\_StyleEngine:
	Opaque token for handling style engines.  May be NULL, meaning
	the default system engine.

 Tk\_StyledElement:
	Opaque token holding a style-specific implementation of a
	given element.  Subsequently used for performing element ops.

 Tk\_Style:
	Opaque token for handling styles.  May be NULL, meaning the
	default system style.

 Tk\_GetElementSizeProc, Tk\_GetElementBoxProc, Tk\_GetElementBorderWidthProc, Tk\_DrawElementProc:
	Implementations of various element operations.

	 typedef void (Tk_GetElementSizeProc) _ANSI_ARGS_((ClientData clientData, 
	 	char *recordPtr, CONST Tk_OptionSpec **optionsPtr, Tk_Window tkwin,
	 	int width, int height, int inner, int *widthPtr, int *heightPtr));

	 typedef void (Tk_GetElementBoxProc) _ANSI_ARGS_((ClientData clientData, 
	 	char *recordPtr, CONST Tk_OptionSpec **optionsPtr, Tk_Window tkwin,
	 	int x, int y, int width, int height, int inner, int *xPtr, int *yPtr, 
	 	int *widthPtr, int *heightPtr));

	 typedef int (Tk_GetElementBorderWidthProc) _ANSI_ARGS_((ClientData clientData, 
	 	char *recordPtr, CONST Tk_OptionSpec **optionsPtr, Tk_Window tkwin));

	 typedef void (Tk_DrawElementProc) _ANSI_ARGS_((ClientData clientData, 
	 	char *recordPtr, CONST Tk_OptionSpec **optionsPtr, Tk_Window tkwin,
	 	Drawable d, int x, int y, int width, int height, int state));

 Tk\_ElementOptionSpec:
	Used to specify a list of required widget options, along with
	their type.  This info will be subsequently used to get option
	values from the widget record using its option table.

	 typedef struct Tk_ElementOptionSpec {
	     char *name;
	     Tk_OptionType type;
	 } Tk_ElementOptionSpec;

 Tk\_ElementSpec:
	Static styled element definition. The version field must be set to 
	TK\_STYLE\_VERSION\_1 in order to match the following structure.

	 typedef struct Tk_ElementSpec {
	     int version;
	     char *name;
	     Tk_ElementOptionSpec *options;
	     Tk_GetElementSizeProc *getSize;
	     Tk_GetElementBoxProc *getBox;
	     Tk_GetElementBorderWidthProc *getBorderWidth;
	     Tk_DrawElementProc *draw;
	 } Tk_ElementSpec;

 TK\_ELEMENT\_STATE\_\*:
	Flags used when drawing elements.  Elements may have a
	different visual appearance depending on their state.
	However, it should be noted that the element size is not
	affected by state changes.

	 #define TK_ELEMENT_STATE_ACTIVE       (1<<0)
	 #define TK_ELEMENT_STATE_DISABLED     (1<<1)
	 #define TK_ELEMENT_STATE_FOCUS        (1<<2)
	 #define TK_ELEMENT_STATE_PRESSED      (1<<3)

_Functions_.

 TkStylePkgInit, TkStylePkgFree:
	Internal procedures used to initialize the style subpackage on
	a per-application basis.

	 void TkStylePkgInit (TkMainInfo *mainPtr)
	 void TkStylePkgFree (TkMainInfo *mainPtr)

 TkGetOptionSpec:
	Internal function used to retrieve an option specifier from a
	compiled option table.

	 CONST Tk_OptionSpec * TkGetOptionSpec (CONST char *name, 
	 	Tk_OptionTable optionTable);

 Tk\_RegisterStyleEngine:
	Registers a new style engine.

	 Tk_StyleEngine Tk_RegisterStyleEngine (char *name, Tk_StyleEngine parent)

 >	Name may be NULL, in which case it registers the default
	engine.  Returns a NULL token if an error occurred \(e.g.
	registering an existing engine\).

 Tk\_GetStyleEngine:
	Returns a token to an existing style engine, or NULL.

	 Tk_StyleEngine Tk_GetStyleEngine (char *name)

 Tk\_RegisterStyledElement:
	Registers the implementation of an element for a given style
	engine.

	 int Tk_RegisterStyledElement (Tk_StyleEngine engine, 
	 	Tk_ElementSpec *templatePtr)

 >	Element names use a dotted notation that gives a hierarchical
	search order.  For example, a widget requiring an element
	named "Scrollbar.vslider" can actually use the "vslider"
	generic element.  Apart from this dotted notation, element
	names are free-form.  However, conventions should be defined,
	such as capitalized widget classes, and lower case elements.
	Since whole widgets can act as elements, one can therefore
	register an element named "Scrollbar".

 >	The given pointer is not stored into internal structures, but
	is instead used to fill them.  Styled element specs can thus
	be allocated on the stack or dynamically, but in most cases
	they will be statically defined.

 Tk\_GetElementId:
	Returns the unique numerical ID for an already registered
	element.

	 int Tk_GetElementId (char *name)

 Tk\_CreateStyle:
	Creates a new style as an instance of an existing style
	engine.

	 Tk_Style Tk_CreateStyle (CONST char *name, Tk_StyleEngine engine, 
	 	ClientData clientData)

 >	Client data may be provided, that will be passed as is to
	element operations.

 Tk\_GetStyle:
	Retrieves an existing style by its name.

	 Tk_Style Tk_GetStyle (Tcl_Interp *interp, CONST char *name)

 >	Retrieves either an existing style by its name, or NULL if
	none was found.  In the latter case, leaves an error message
	in _interp_ if it is not NULL.

 Tk\_FreeStyle:
	Frees a style returned by _Tk\_CreateStyle_ or
	_Tk\_GetStyle_.

	 void Tk_FreeStyle (Tk_Style style)

 >	It actually decrements an internal reference count so that
	styles can be shared and deleted safely.

 Tk\_NameOfStyle:
	Gets a style's name.

	 CONST char * Tk_NameOfStyle (Tk_Style style)

 Tk\_AllocStyleFromObj, Tk\_GetStyleFromObj, Tk\_FreeStyleFromObj:
	_Tcl\_Obj_ based interface to styles.

	 Tk_Style  Tk_AllocStyleFromObj (Tcl_Interp *interp, Tcl_Obj *objPtr)
	 Tk_Style Tk_GetStyleFromObj (Tcl_Obj *objPtr)
	 void  Tk_FreeStyleFromObj (Tcl_Obj *objPtr)

 >	_Tk\_AllocStyleFromObj_ gets \(doesn't create\) an existing
	style from an object.  _Tk\_GetStyleFromObj_ returns the
	style already stored in the object's internal representation.
	The object must have been returned by
	_Tk\_AllocStyleFromObj_.  _Tk\_FreeStyleFromObj_ frees the
	style held by the object. 

 Tk\_GetStyledElement:
	Returns a token for the styled element for use with widgets
	having the given _optionTable_.

	 Tk_StyledElement Tk_GetStyledElement (Tk_Style style, int elementId, 
	 	Tk_OptionTable optionTable)

 >	Returns a token for the styled element \(or NULL if not found\),
	for use with widgets having the given optionTable.  The token
	is persistent and doesn't need to be freed, so it can be
	safely stored if needed \(although using element IDs is the
	preferred method\).  It is used in subsequent element
	operations and avoids repeated lookups.  The lookup algorithm
	works as follows:

 > 1.	Look for an implementation of the given element in the current
	style engine.

 > 2.	If none was found, traverse the chain of engines \(each but the
	default engine has a parent\) until the default engine is
	reached.

 > 3.	Restart at step 1 with the base element name instead.  For
	example, if we are looking for "foo.bar.baz", then look for
	"bar.baz" then "baz", until we find an implementation.  

 >	If no implementation was found, then a panic is generated,
	meaning that some dependency has not been resolved.  In the
	general case, this won't happen for core widgets \(because they
	only use core elements\), and new widgets either have to rely
	on core or package-provided elements, or define their own.

 Tk\_GetElementSize, Tk\_GetElementBox, Tk\_GetElementBorderWidth, Tk\_DrawElement:
	Various element operations.

	 void Tk_GetElementSize (Tk_Style style, Tk_StyledElement element, 
	 	char *recordPtr, Tk_Window tkwin, int width, int height, 
	 	int inner, int *widthPtr, int *heightPtr)
	 void Tk_GetElementBox (Tk_Style style, Tk_StyledElement element, 
	 	char *recordPtr, Tk_Window tkwin, int x, int y, int width, 
	 	int height, int inner, int *xPtr, int *yPtr, int *widthPtr, 
	 	int *heightPtr)
	 int Tk_GetElementBorderWidth (Tk_Style style, Tk_StyledElement element, 
	 	char *recordPtr, Tk_Window tkwin)
	 void Tk_DrawElement (Tk_Style style, Tk_StyledElement element, 
	 	char *recordPtr, Tk_Window tkwin, Drawable d, int x, int y, 
	 	int width, int height, int state)

 >	The first two are used for geometry management.  First one
	only computes the size, while second one computes the box
	coordinates.  The _inner_ parameter is a boolean that
	controls whether the inner \(FALSE\) or outer \(TRUE\) geometry is
	requested from the maximum outer/minimum inner geometry.
	Third one returns the uniform internal border width of the
	element and is mostly intended for whole widgets.  Last one
	draws the element using the given geometry and state.

# Implementation

An implementation has been written and completed with respect to the
present specification.  A patch for Tk 8.4a3 is available at:

 > <http://www.purl.org/NET/bonnet/pub/style.patch>

The _square_ widget implemented in the test file
_generic/tkSquare.c_ has also been rewritten to use the new API for
its square element.  It demonstrates basic features.  Patch file:

 > <http://www.purl.org/NET/bonnet/pub/squarestyle.patch>

The sample code registers an element "Square.square" in the default
style engine.  This element is used by the square widget in its
drawing code.  A new style engine "fixedborder" is registered, and
code is provided for the "Square.square" element.  This style engine
draws the element's border using a fixed border width given as client
data by instantiated styles.  Four styles are created as instances of
the "fixedborder" element: "flat", "border2", "border4" and "border8"
\(0, 2, 4 and 8 pixel-wide borders\).

Sample test session:

	 pack [square .s]
	 .s config -style
	 .s config -style flat
	 .s config -style border2
	 .s config -style border4
	 .s config -style border8
	 .s config -style ""
	 pack [square .s2]
	 .s2 config -style border2
	 .s2 config -style border8

# Performance and memory usage

The provided design and implementation is geared towards the best
compromise between performance and memory consumption.

Critical performance bottleneck is element querying.  In order to
minimize element access times, elements are identified by unique IDs
that act as indexes within internal tables, allowing direct
addressing.  Hash tables are used internally by all name pools
\(engines, styles, elements\).  Static structures are used whenever
possible \(for styled element registration, indirectly through
widgets' option tables...\).  Widget processing times are increased by
the extra procedure calls and indirections, but that is the price to
pay for better modularity anyway.  Additional calls are kept minimal.

Per-widget memory consumption is minimal.  A widget usually only needs
to store its current style.  Element IDs can \(should?\) be shared
globally across widgets of the same class and don't need to be stored
in the widget record.  Moreover, most information is shared internally
across widgets of the same class \(identified by their option table\).
Many caching & fast lookup techniques are used throughout the code.

# Compatibility

Existing widgets will need to be rewritten in order to become
style-aware.  The required code changes may be significant \(implying
code modularization\).  However, no incompatibility is introduced.
Thus, migrating widgets from the old to the new model can follow a
smooth path, similar to that needed for the transition to _Tcl\_Obj_
interfaces.  Besides, widgets as a whole can act as elements, which
may shift the amount of work from the core to the style engines at the
expense of a lesser modularity and code reuse.

# Future improvements or changes

 * Additional APIs for querying the list of engines, styles,
   elements...

 * Additional operations for elements, e.g. hit tests.

 * Script-level interfaces.

 * Optional translation tables between real widget options and needed
   element options, e.g. _-elementborderwidth_ => _-borderwidth_.

 * How to handle native widgets? They will certainly have to be
   provided as whole elements.

 * Current implementation uses thread-local storage for holding
   dynamic data.  Since most data is not thread-specific, this could
   be changed to a more memory-efficient scheme.

 * Provide man pages and tests.

 * Additional hidden/private option flag for accessing some widgets'
   non-configurable data \(e.g. scrollbar position\) through option
   tables.

# Glossary

 Element: Part of a widget \(e.g. a checkbox mark or a scrollbar
	arrow\), usually active.

 Style: The visual appearance of a widget.  May include colors, skins,
        tiles, border drawing style \(Windows, Motif...\), element
        pictures.

 Styled element: A style-specific implementation of a widget element.

 Style engine: A visually consistent collection of styled elements.

 Theme: A collection of graphical elements giving a consistent
        appearance to a whole widget hierarchy, application, or
        desktop.  A theme is usually made up out of icons, colors,
        fonts, widget styles, or even desktop background and sounds.

# Copyright

This document has been placed in the public domain.

Name change from tip/49.tip to tip/49.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57

58
59

60
61
62

63
64
65
66

67
68
69

70
71
72
73
74
75
76

77
78
79
80
81
82
83
84
85
86

87
88
89
90
91
92

93
94

95
96
97
98
99

100
TIP:          49
Title:        I/O Subsystem: Add API Tcl_OutputBuffered(chan)
Version:      $Revision: 1.4 $
Author:       Rolf Schroedter <[email protected]>
State:        Final
Type:         Project
Vote:         Done
Created:      25-Jul-2001
Post-History:
Tcl-Version:  8.4

~ Abstract

This document proposes the new public function ''Tcl_OutputBuffered()'',
analogous to the existing public function ''Tcl_InputBuffered()''.

~ Rationale

Tcl has a ''Tcl_InputBuffered()'' function but no analog
function for the output buffer. 
A ''Tcl_OutputBuffered()'' function would be useful
for non-blocking channel drivers which need to know the 
number of bytes pending in Tcl's output queue.

The implementation of [35] allows one to query the number of bytes 
in the channels input and output queues with a ''[fconfigure -queue]''
option. This is a useful feature especially for serial ports
because the input/output may be really slow or even stall.

On the driver level only the number of bytes in the system queue
can be queried. For a non-blocking channel there may also be
some pending output in Tcl buffers. 
Obviously there is not much sense to know only the byte counter
at driver level without knowing ''Tcl_OutputBuffered()''.

~ Related Ideas

It could also be useful to add general ''[fconfigure -inputbuffer
-outputbuffer]'' options for all channels returning the values from
''Tcl_InputBuffered(chan)'' and ''Tcl_OutputBuffered(chan)'' respectively.

At this opportunity the code of ''Tcl_Seek()'' and ''Tcl_Tell()''
may be shortened, because it repeats the code of 
''Tcl_InputBuffered()'' and ''Tcl_OutputBuffered()''.

~ Implementation

This function would be added to ''generic/tclIO.c'' and be 
stubs enabled. This new API should not have any impact
on existing applications.

The implementation is analog to what is done in ''Tcl_Tell()'':

|
|/*
| *----------------------------------------------------------------------
| *

| * Tcl_OutputBuffered --
| *

| *	Returns the number of bytes of output currently buffered in the
| *	common internal buffer of a channel.
| *

| * Results:
| *	The number of output bytes buffered, or zero if the channel is not
| *	open for writing.
| *

| * Side effects:
| *	None.
| *

| *----------------------------------------------------------------------
| */
|
|int
|Tcl_OutputBuffered(chan)
|    Tcl_Channel chan;			/* The channel to query. */
|{

|    ChannelState *statePtr = ((Channel *) chan)->state;
|					/* State of real channel structure. */
|    ChannelBuffer *bufPtr;
|    int bytesBuffered;
|
|    for (bytesBuffered = 0, bufPtr = statePtr->outQueueHead;
|	bufPtr != (ChannelBuffer *) NULL;
|	bufPtr = bufPtr->nextPtr) {
|	bytesBuffered += (bufPtr->nextAdded - bufPtr->nextRemoved);
|    }

|    if ((statePtr->curOutPtr != (ChannelBuffer *) NULL) &&
|	    (statePtr->curOutPtr->nextAdded > statePtr->curOutPtr->nextRemoved)) {
|        statePtr->flags |= BUFFER_READY;
|        bytesBuffered +=
|            (statePtr->curOutPtr->nextAdded - statePtr->curOutPtr->nextRemoved);
|    }

|    return bytesBuffered;
|}

|

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|
|

|

|

|

|
|

|

|

|
|
|

|

|

|

|

|

|
|
|
<
>
|
<
>
|
|
<
>
|
|
|
<
>
|
|
<
>
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
<
>
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55

56
57

58
59
60

61
62
63
64

65
66
67

68
69
70
71
72
73
74

75
76
77
78
79
80
81
82
83
84

85
86
87
88
89
90

91
92

93
94
95
96
97
98
99
100

# TIP 49: I/O Subsystem: Add API Tcl_OutputBuffered(chan)

	Author:       Rolf Schroedter <[email protected]>
	State:        Final
	Type:         Project
	Vote:         Done
	Created:      25-Jul-2001
	Post-History:
	Tcl-Version:  8.4
-----

# Abstract

This document proposes the new public function _Tcl\_OutputBuffered\(\)_,
analogous to the existing public function _Tcl\_InputBuffered\(\)_.

# Rationale

Tcl has a _Tcl\_InputBuffered\(\)_ function but no analog
function for the output buffer. 
A _Tcl\_OutputBuffered\(\)_ function would be useful
for non-blocking channel drivers which need to know the 
number of bytes pending in Tcl's output queue.

The implementation of [[35]](35.md) allows one to query the number of bytes 
in the channels input and output queues with a _[fconfigure -queue]_
option. This is a useful feature especially for serial ports
because the input/output may be really slow or even stall.

On the driver level only the number of bytes in the system queue
can be queried. For a non-blocking channel there may also be
some pending output in Tcl buffers. 
Obviously there is not much sense to know only the byte counter
at driver level without knowing _Tcl\_OutputBuffered\(\)_.

# Related Ideas

It could also be useful to add general _[fconfigure -inputbuffer
-outputbuffer]_ options for all channels returning the values from
_Tcl\_InputBuffered\(chan\)_ and _Tcl\_OutputBuffered\(chan\)_ respectively.

At this opportunity the code of _Tcl\_Seek\(\)_ and _Tcl\_Tell\(\)_
may be shortened, because it repeats the code of 
_Tcl\_InputBuffered\(\)_ and _Tcl\_OutputBuffered\(\)_.

# Implementation

This function would be added to _generic/tclIO.c_ and be 
stubs enabled. This new API should not have any impact
on existing applications.

The implementation is analog to what is done in _Tcl\_Tell\(\)_:

	/*
	 *----------------------------------------------------------------------

	 *
	 * Tcl_OutputBuffered --

	 *
	 *	Returns the number of bytes of output currently buffered in the
	 *	common internal buffer of a channel.

	 *
	 * Results:
	 *	The number of output bytes buffered, or zero if the channel is not
	 *	open for writing.

	 *
	 * Side effects:
	 *	None.

	 *
	 *----------------------------------------------------------------------
	 */

	int
	Tcl_OutputBuffered(chan)
	    Tcl_Channel chan;			/* The channel to query. */

	{
	    ChannelState *statePtr = ((Channel *) chan)->state;
						/* State of real channel structure. */
	    ChannelBuffer *bufPtr;
	    int bytesBuffered;

	    for (bytesBuffered = 0, bufPtr = statePtr->outQueueHead;
		bufPtr != (ChannelBuffer *) NULL;
		bufPtr = bufPtr->nextPtr) {
		bytesBuffered += (bufPtr->nextAdded - bufPtr->nextRemoved);

	    }
	    if ((statePtr->curOutPtr != (ChannelBuffer *) NULL) &&
		    (statePtr->curOutPtr->nextAdded > statePtr->curOutPtr->nextRemoved)) {
	        statePtr->flags |= BUFFER_READY;
	        bytesBuffered +=
	            (statePtr->curOutPtr->nextAdded - statePtr->curOutPtr->nextRemoved);

	    }
	    return bytesBuffered;

	}

# Copyright

This document has been placed in the public domain.

Name change from tip/5.tip to tip/5.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118

119
120
121
122
123

124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262

263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279

280
281
282
283
284
285
286
287
288

289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370

TIP:            5
Title:          Make TkClassProcs and TkSetClassProcs Public and Extensible
Version:        $Revision: 1.3 $
Author:         Eric Melski <[email protected]>
State:          Final
Type:           Project
Tcl-Version:    8.4
Vote:           Done
Created:        17-Oct-2000
Post-History:

~ Abstract

At certain critical moments in the lifetime of a Tk widget, Tk will
invoke various callbacks on that widget.  These callbacks enable the
widget to do lots of interesting things, such as react to
configuration changes for named fonts, or create and manage truly
native widgets (such as the scrollbar widget on Windows platforms).
The API for setting up these callbacks for a particular window are, as
of Tk 8.3.2, private.  This prohibits extension widget authors from fully
utilizing this powerful system; those developers can either copy the
private declarations into their own source code (leading to future
maintenance hassles), or forego the system entirely, hampering their
ability to make truly native and well-integrated widgets.  This
proposal offers an extensible means for making that API public.

~ Rationale:  Why make TkClassProcs and TkSetClassProcs public?

(The following text is adapted from George Howlett
[http://dev.scriptics.com/lists/tclcore/2000/10/msg00143.html])

The Tk toolkit was originally written strictly for Xlib.  It created
wrappers for many of the Xlib calls.  A good example is creating a
window.  Tk's ''Tk_CreateWindow'' call in turn calls Xlib's
''XCreateWindow''.  This is so that the toolkit can perform
bookkeeping on the window and manage it in various ways.  The down
side was that if you needed to pass specific information/flags to the
''XCreateWindow'' call you couldn't.  But this only affected
extensions.

Now when Tk 8.0 added native widgets, Tk also had the same problem.
For example to create a Win32 button control, you have to pass
information through the X emulation layer to the eventual Win32
CreateWindow or CreateWindowEx call.

So the Sun Tk developers created this notion of class procedures.  A
widget of particular type may need to make different calls at the time
the window is created.  They added to the TkWindow structure pointers
to both the widget instance (i.e. the data the represents the specific
widget) and a structure of function pointers (such as one to call when
the window is to be created).

|TkClassProcs tkpButtonProcs =
|
|    CreateProc,             /* createProc. */
|    TkButtonWorldChanged,   /* geometryProc. */
|    NULL                    /* modalProc. */
|};

Inside of Tk, such as in ''Tk_MakeWindowExist'', code was added to
check if the ''createProc'' of the structure isn't NULL and call that
routine to create the native window.

This mechanism was also used to handle font aliasing.  I can create a
font "fred" that is really a { courier bold } font and use it with any
Tk widget.

|font create fred -family courier -weight bold
|button .b -font fred

The widget will get the real font and use it in its graphics context.
Think of GCs like a pen drawing a particular color.  A GC draws with a
particular font.

Now if I change the font, the widget's GC must be updated too.

|font create fred -family helvetica -weight medium

You can see where a ''geometryProc'' is needed to indicate when font
aliases change.  It gets called for all the widgets using the font.

Another callback is used to handle modal events.  This is currently
needed only for the Win32 native scrollbar.

So here's the private structure and ''TkSetClassProcs'' call.

|typedef Window (TkClassCreateProc) _ANSI_ARGS_((Tk_Window tkwin,
|	 Window parent, ClientData instanceData));
|typedef void (TkClassGeometryProc) _ANSI_ARGS_((ClientData instanceData));
|typedef void (TkClassModalProc) _ANSI_ARGS_((Tk_Window tkwin,
|	 XEvent *eventPtr));
|
|/*
| * Widget class procedures used to implement platform specific widget
| * behavior.
| */
|
|typedef struct TkClassProcs {
|    TkClassCreateProc *createProc;
|			 /* Procedure to invoke when the
|			    platform-dependent window needs to be
|			    created. */
|    TkClassGeometryProc *geometryProc;
|			 /* Procedure to invoke when the geometry of a
|			    window needs to be recalculated as a result
|			    of some change in the system. */
|    TkClassModalProc *modalProc;
|			 /* Procedure to invoke after all bindings on a
|			    widget have been triggered in order to
|			    handle a modal loop. */
|} TkClassProcs;
|
|void
|TkSetClassProcs(tkwin, procs, instanceData)
|    Tk_Window tkwin;        /* Token for window to modify. */
|    TkClassProcs *procs;    /* Class procs structure. */
|    ClientData instanceData;/* Data to be passed to class procedures. */
|{

|    register TkWindow *winPtr = (TkWindow *) tkwin;
|
|    winPtr->classProcsPtr = procs;
|    winPtr->instanceData = instanceData;
|}

Extension developers could not use this interface, however, because it
was private to Tk.  The original authors of the interface didn't think
that anything outside of the Tk widgets would need it.  Of course,
hindsight is 20-20, and we have since found that this is not true.
Extension developers do need to use this system:  widget writers that
use fonts obviously need to know when a font alias changes, and new
Win32 native widgets also need access to this mechanism.

Most extensions authors had/have already found a workaround: copy in
the ''TkClassProcs'' structure and ''TkSetClassProcs'' routine into
your code.  However, this workaround leads to future code maintenance
problems.  Because the structure is private, its members and usage are
not guaranteed to remain constant between versions of Tk.  If the
structure changes, the extension authors have to update all of their
code accordingly.

Making the system public locks in the format and usage of the system,
so that extension authors can rely on it existing from one version to
the next, and they will no longer have to maintain parallel redundant
copies of the structure and function definition.

~ Rationale: Why make TkClassProcs and TkSetClassProcs extensible?

Every time we've made a public structure, we've regretted it later
when we needed to extend it to handle some new feature that we didn't
originally anticipate.  In general we should avoid designing new API's
that preclude making future changes without introducing
incompatibilities.
[http://dev.scriptics.com/lists/tclcore/2000/10/msg00083.html]

This system is one that seems likely to require extension in the
future.  There are currently three callbacks: create window, geometry
change, and modal event.  Already one request to extend the mechanism
has been made, to support the notion of a "client area" related to
geometry management and labelled frame widgets
([http://dev.scriptics.com/lists/tclcore/2000/10/msg00121.html],
[http://dev.scriptics.com/lists/tclcore/2000/10/msg00170.html]).
Another possible extension is a focus management callback, to allow
for smoother focus transitions between native widgets and Tk widgets;
note that this focus management callback is a purely hypothetical
extension at this time.

If the system is one that we are likely to want to extend with
additional callbacks in the future, it behooves us to make it public
in a manner that allows us to extend it while causing the minimum
amount of disruption for extension authors.  There are two concerns
here.  First is binary compatibility: will an extension compiled
against a version of Tk which features the base (three callback)
''TkClassProcs'' system work with a version of Tk that features an
extended ''TkClassProcs'' system?  Second is source compatibility:
will an extension author have to update their sources when they want
to recompile their extension against a version of Tk that features an
extended ''TkClassProcs'' system?  Ideally, the system that we make
public will allow extension while retaining binary and source
compatibility between versions of Tk.

~ Specification

I propose that the following steps be taken to make ''TkClassProcs''
and ''TkSetClassProcs'' public:

   1.  Rename ''TkClassProcs'' to ''Tk_ClassProcs''; rename
       ''TkSetClassProcs'' to ''Tk_SetClassProcs''; rename
       ''TkClassCreateProc'', etc., to ''Tk_ClassCreateProc'', etc.
       Move the structure definition, function prototype, and callback
       typedefs from tkInt.h to tk.h.  This is in keeping with Tk
       public interface naming conventions.

   2.  Add a single size field to the ''Tk_ClassProcs'' structure.
       This field is initialized at the time that the structure is
       allocated, and always contains the size of the structure.  This
       field will be used to provide a simple versioning scheme for
       the structure.  Portions of Tk that use the class proc
       callbacks will inspect this size field to ascertain whether or
       not a particular instance of the ''Tk_ClassProcs'' structure is
       of a version that contains a given callback.  See the example
       below.

   3.  Rename the ''geometryProc'' callback to ''worldChangedProc''.
       The name ''geometryProc'' is somewhat misleading.  Currently,
       the callback is used only to support font aliasing, as
       described above.  This is sort of geometry related, but it
       doesn't necessarily mean that geometry of the widget must
       change, it just means that the widget will have to update its
       world view to reflect the current state of the world.  In
       addition, the callback will likely be used to support color
       aliasing when that is added to Tk (imagine defining a color
       "myColor" to mean "#c4d3a2" and then configuring widgets to use
       "myColor" instead of the literal value; this provides all the
       benefits for colors that font aliasing does for fonts).  When
       that is done, ''geometryProc'' will be truly misleading, since
       a color change probably does not mean a geometry change for the
       widget.

   4.  Change the order of the callback fields in the
       ''Tk_ClassProcs'' structure, making ''worldChangedProc'' the
       first of the callbacks listed in the structure.  In the
       existing private ''TkClassProcs'' structure, the first callback
       is the ''createProc''.  It is not strictly necessary to make
       ''worldChangedProc'' the first callback.  However, most widgets
       in Tk (canvas, entry, scale, text, message, listbox, menu,
       menubuttons, scrollbars on Unix and Mac, and buttons on Unix
       and Mac) use only this callback.  Making it first in the
       structure (after the size field, which must be the very first
       entry) means a little bit less work for widget authors in the
       common case, because they need not include the NULL declaration
       for the ''createProc'' slot in the structure.  Compare:

|static Tk_ClassProcs myClassProcs = {
|    sizeof(Tk_ClassProcs), NULL, myWorldChangedProc
|};

 >     with:

|static Tk_ClassProcs myClassProcs = {
|    sizeof(Tk_ClassProcs), myWorldChangedProc
|};

 >     Since the ''createProc'' is used so infrequently, why require
       that all widget authors explicitly declare it to be NULL?  This
       change just simplifies everybody's life that much more.

Usage of the public API will be very similar to usage of the existing
private API:

|static Tk_ClassProcs myClassProcs = {
|    sizeof(Tk_ClassProcs),
|    myWorldChangedProc
|};
|
|static int Tk_MyWidgetObjCmd(...) {
|    ...
|    Tk_SetClassProcs(widgetPtr->tkwin,myClassProcs,(ClientData)widgetPtr);
|    ...
|    return TCL_OK;
|}

Portions of Tk that need to use a particular callback, such as
''Tk_MakeWindowExist'', use code like the following:

|Tk_ClassProcs *thisClassProcs = tkwin->classProcs;
|createProc *procPtr;
|
|/* Make sure the structure we were given has the createProc field
| * in it by checking that the size of the structure is at least
| * big enough to have that slot.
| */
|
|if (thisClassProcs->size <= Tk_Offset(Tk_ClassProcs, createProc)) {
|    procPtr = NULL;
|} else {
|    procPtr = thisClassProcs->createProc;
|}

|
|if (procPtr != NULL) {
|    /* Invoke the createProc for this window. */
|    ...
|} else {
|    /* Use the default Tk window creation mechanism. */
|    ...
|}

~ Benefits of this implementation

Benefits of this implementation are as follows:

   1.  Usage of ''Tk_ClassProcs'' and ''Tk_SetClassProcs'' very, very
       closely parallels the usage of the existing private API.  In
       fact, the only difference is a small change in the particular
       fields of the ''Tk_ClassProcs'' structure (especially, the new
       size field, for version information, and the reordering of the
       callback fields).

   2.  All instances of "mywidget" reference the same
       ''Tk_ClassProcs'' structure.  This is memory efficient.

   3.  We do not need to explicitly initialize to NULL those fields of
       myClassProcs that we don't use.  The ANSI C specification
       states that static variables (and members of statically
       declared structures) that are not explicitly initialized are
       initialized to zero.

   4.  This retains binary compatibility.  The size field of the
       ''Tk_ClassProcs'' structure is set at compile time, so when a
       later version of Tk checks the size field to see if a new
       callback can be used, it will fail.  That is, if extension
       author A compiles the extension against version X of Tk, which
       has three fields in ''Tk_ClassProcs'', the size field of
       myClassProcs will be set to 12 (assuming 4-byte pointers).
       When using that extension with Tk version Y, which may have
       four fields in ''Tk_ClassProcs'', the size check for that
       fourth field will fail, since the size field, set to 12, will
       be less than or equal to the offset of the fourth field in the
       structure.

   5.  This retains source compatibility.  Because of #3 above, unless
       the extension author wants to use the new callbacks, they need
       not worry about their addition, because the new fields will be
       automatically set to zero.

   6.  There is minimal API bloat.  Only one public API is added,
       ''Tk_SetClassProcs''.

   7.  The system is "type safe" with respect to the function
       signatures of the callback functions.  Any type mismatches will
       be caught at compile time.

   8.  If desired, widget authors can directly reference elements of
       the ''Tk_ClassProcs'' structure:

|myClassProcs.createProc = myCreateProc;

~ Drawbacks of this implementation

The drawbacks of this implementation are as follows:

   1.  The required value of the size field will seem like a bit of
       black magic to developers new to the system.  The question
       ''"Why does this field have to be set to this value?  If it's
       always the same thing, why is it stored at all?"''  Of course,
       experienced programmers will recognize why it has to be set,
       and that in fact, it is not always the same value.  This issue
       can best be addressed by appropriate documentation.

   2.  Extensions that use the existing private ''TkClassProcs'' and
       ''TkSetClassProcs'' mechanism and which were compiled against
       versions of Tk <= 8.3 will not work with new versions of Tk,
       since the format of the ''Tk_ClassProcs'' structure will
       change.  However, this is the consequence of using private
       structures and API's in your extensions: when those private
       structures and API's change, you have to update your extension
       accordingly.  We cannot allow ourselves to be overly
       constrained by this issue.  The existing mechanism is private,
       period.  Authors that use it do so knowingly and willfully.

~ Reference Implementation

http://sourceforge.net/patch/?func=detailpatch&patch_id=102213&group_id=10894

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|
|

|

|
|

|
|

|

|
|
|

|
|
|
|
|
|

|
|

|

|
|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
>

|

|

|

|
|

|
|
|

|

|

|
|

|
|
|

|

|

|
|

|
|

|
|

|

|
|
|
|

|
|
|

|

|
|
|

|

|
|
|

|

|
|
|
|
|
|
|
|
|
|
<
>

|

|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
<
|
>
|

|

|

|

|

|
|

|

|
|

|

|

|

|

|

|

|
|

|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116

117
118
119
120
121

122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260

261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277

278
279
280
281
282
283
284
285

286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370

# TIP 5: Make TkClassProcs and TkSetClassProcs Public and Extensible

	Author:         Eric Melski <[email protected]>
	State:          Final
	Type:           Project
	Tcl-Version:    8.4
	Vote:           Done
	Created:        17-Oct-2000
	Post-History:
-----

# Abstract

At certain critical moments in the lifetime of a Tk widget, Tk will
invoke various callbacks on that widget.  These callbacks enable the
widget to do lots of interesting things, such as react to
configuration changes for named fonts, or create and manage truly
native widgets \(such as the scrollbar widget on Windows platforms\).
The API for setting up these callbacks for a particular window are, as
of Tk 8.3.2, private.  This prohibits extension widget authors from fully
utilizing this powerful system; those developers can either copy the
private declarations into their own source code \(leading to future
maintenance hassles\), or forego the system entirely, hampering their
ability to make truly native and well-integrated widgets.  This
proposal offers an extensible means for making that API public.

# Rationale:  Why make TkClassProcs and TkSetClassProcs public?

\(The following text is adapted from George Howlett
<http://dev.scriptics.com/lists/tclcore/2000/10/msg00143.html> \)

The Tk toolkit was originally written strictly for Xlib.  It created
wrappers for many of the Xlib calls.  A good example is creating a
window.  Tk's _Tk\_CreateWindow_ call in turn calls Xlib's
_XCreateWindow_.  This is so that the toolkit can perform
bookkeeping on the window and manage it in various ways.  The down
side was that if you needed to pass specific information/flags to the
_XCreateWindow_ call you couldn't.  But this only affected
extensions.

Now when Tk 8.0 added native widgets, Tk also had the same problem.
For example to create a Win32 button control, you have to pass
information through the X emulation layer to the eventual Win32
CreateWindow or CreateWindowEx call.

So the Sun Tk developers created this notion of class procedures.  A
widget of particular type may need to make different calls at the time
the window is created.  They added to the TkWindow structure pointers
to both the widget instance \(i.e. the data the represents the specific
widget\) and a structure of function pointers \(such as one to call when
the window is to be created\).

	TkClassProcs tkpButtonProcs =

	    CreateProc,             /* createProc. */
	    TkButtonWorldChanged,   /* geometryProc. */
	    NULL                    /* modalProc. */
	};

Inside of Tk, such as in _Tk\_MakeWindowExist_, code was added to
check if the _createProc_ of the structure isn't NULL and call that
routine to create the native window.

This mechanism was also used to handle font aliasing.  I can create a
font "fred" that is really a \{ courier bold \} font and use it with any
Tk widget.

	font create fred -family courier -weight bold
	button .b -font fred

The widget will get the real font and use it in its graphics context.
Think of GCs like a pen drawing a particular color.  A GC draws with a
particular font.

Now if I change the font, the widget's GC must be updated too.

	font create fred -family helvetica -weight medium

You can see where a _geometryProc_ is needed to indicate when font
aliases change.  It gets called for all the widgets using the font.

Another callback is used to handle modal events.  This is currently
needed only for the Win32 native scrollbar.

So here's the private structure and _TkSetClassProcs_ call.

	typedef Window (TkClassCreateProc) _ANSI_ARGS_((Tk_Window tkwin,
		 Window parent, ClientData instanceData));
	typedef void (TkClassGeometryProc) _ANSI_ARGS_((ClientData instanceData));
	typedef void (TkClassModalProc) _ANSI_ARGS_((Tk_Window tkwin,
		 XEvent *eventPtr));

	/*
	 * Widget class procedures used to implement platform specific widget
	 * behavior.
	 */

	typedef struct TkClassProcs {
	    TkClassCreateProc *createProc;
				 /* Procedure to invoke when the
				    platform-dependent window needs to be
				    created. */
	    TkClassGeometryProc *geometryProc;
				 /* Procedure to invoke when the geometry of a
				    window needs to be recalculated as a result
				    of some change in the system. */
	    TkClassModalProc *modalProc;
				 /* Procedure to invoke after all bindings on a
				    widget have been triggered in order to
				    handle a modal loop. */
	} TkClassProcs;

	void
	TkSetClassProcs(tkwin, procs, instanceData)
	    Tk_Window tkwin;        /* Token for window to modify. */
	    TkClassProcs *procs;    /* Class procs structure. */
	    ClientData instanceData;/* Data to be passed to class procedures. */

	{
	    register TkWindow *winPtr = (TkWindow *) tkwin;

	    winPtr->classProcsPtr = procs;
	    winPtr->instanceData = instanceData;

	}

Extension developers could not use this interface, however, because it
was private to Tk.  The original authors of the interface didn't think
that anything outside of the Tk widgets would need it.  Of course,
hindsight is 20-20, and we have since found that this is not true.
Extension developers do need to use this system:  widget writers that
use fonts obviously need to know when a font alias changes, and new
Win32 native widgets also need access to this mechanism.

Most extensions authors had/have already found a workaround: copy in
the _TkClassProcs_ structure and _TkSetClassProcs_ routine into
your code.  However, this workaround leads to future code maintenance
problems.  Because the structure is private, its members and usage are
not guaranteed to remain constant between versions of Tk.  If the
structure changes, the extension authors have to update all of their
code accordingly.

Making the system public locks in the format and usage of the system,
so that extension authors can rely on it existing from one version to
the next, and they will no longer have to maintain parallel redundant
copies of the structure and function definition.

# Rationale: Why make TkClassProcs and TkSetClassProcs extensible?

Every time we've made a public structure, we've regretted it later
when we needed to extend it to handle some new feature that we didn't
originally anticipate.  In general we should avoid designing new API's
that preclude making future changes without introducing
incompatibilities.
<http://dev.scriptics.com/lists/tclcore/2000/10/msg00083.html> 

This system is one that seems likely to require extension in the
future.  There are currently three callbacks: create window, geometry
change, and modal event.  Already one request to extend the mechanism
has been made, to support the notion of a "client area" related to
geometry management and labelled frame widgets
\(<http://dev.scriptics.com/lists/tclcore/2000/10/msg00121.html> ,
<http://dev.scriptics.com/lists/tclcore/2000/10/msg00170.html> \).
Another possible extension is a focus management callback, to allow
for smoother focus transitions between native widgets and Tk widgets;
note that this focus management callback is a purely hypothetical
extension at this time.

If the system is one that we are likely to want to extend with
additional callbacks in the future, it behooves us to make it public
in a manner that allows us to extend it while causing the minimum
amount of disruption for extension authors.  There are two concerns
here.  First is binary compatibility: will an extension compiled
against a version of Tk which features the base \(three callback\)
_TkClassProcs_ system work with a version of Tk that features an
extended _TkClassProcs_ system?  Second is source compatibility:
will an extension author have to update their sources when they want
to recompile their extension against a version of Tk that features an
extended _TkClassProcs_ system?  Ideally, the system that we make
public will allow extension while retaining binary and source
compatibility between versions of Tk.

# Specification

I propose that the following steps be taken to make _TkClassProcs_
and _TkSetClassProcs_ public:

   1.  Rename _TkClassProcs_ to _Tk\_ClassProcs_; rename
       _TkSetClassProcs_ to _Tk\_SetClassProcs_; rename
       _TkClassCreateProc_, etc., to _Tk\_ClassCreateProc_, etc.
       Move the structure definition, function prototype, and callback
       typedefs from tkInt.h to tk.h.  This is in keeping with Tk
       public interface naming conventions.

   2.  Add a single size field to the _Tk\_ClassProcs_ structure.
       This field is initialized at the time that the structure is
       allocated, and always contains the size of the structure.  This
       field will be used to provide a simple versioning scheme for
       the structure.  Portions of Tk that use the class proc
       callbacks will inspect this size field to ascertain whether or
       not a particular instance of the _Tk\_ClassProcs_ structure is
       of a version that contains a given callback.  See the example
       below.

   3.  Rename the _geometryProc_ callback to _worldChangedProc_.
       The name _geometryProc_ is somewhat misleading.  Currently,
       the callback is used only to support font aliasing, as
       described above.  This is sort of geometry related, but it
       doesn't necessarily mean that geometry of the widget must
       change, it just means that the widget will have to update its
       world view to reflect the current state of the world.  In
       addition, the callback will likely be used to support color
       aliasing when that is added to Tk \(imagine defining a color
       "myColor" to mean "\#c4d3a2" and then configuring widgets to use
       "myColor" instead of the literal value; this provides all the
       benefits for colors that font aliasing does for fonts\).  When
       that is done, _geometryProc_ will be truly misleading, since
       a color change probably does not mean a geometry change for the
       widget.

   4.  Change the order of the callback fields in the
       _Tk\_ClassProcs_ structure, making _worldChangedProc_ the
       first of the callbacks listed in the structure.  In the
       existing private _TkClassProcs_ structure, the first callback
       is the _createProc_.  It is not strictly necessary to make
       _worldChangedProc_ the first callback.  However, most widgets
       in Tk \(canvas, entry, scale, text, message, listbox, menu,
       menubuttons, scrollbars on Unix and Mac, and buttons on Unix
       and Mac\) use only this callback.  Making it first in the
       structure \(after the size field, which must be the very first
       entry\) means a little bit less work for widget authors in the
       common case, because they need not include the NULL declaration
       for the _createProc_ slot in the structure.  Compare:

		static Tk_ClassProcs myClassProcs = {
		    sizeof(Tk_ClassProcs), NULL, myWorldChangedProc
		};

	 >     with:

		static Tk_ClassProcs myClassProcs = {
		    sizeof(Tk_ClassProcs), myWorldChangedProc
		};

	 >     Since the _createProc_ is used so infrequently, why require
       that all widget authors explicitly declare it to be NULL?  This
       change just simplifies everybody's life that much more.

Usage of the public API will be very similar to usage of the existing
private API:

	static Tk_ClassProcs myClassProcs = {
	    sizeof(Tk_ClassProcs),
	    myWorldChangedProc
	};

	static int Tk_MyWidgetObjCmd(...) {
	    ...
	    Tk_SetClassProcs(widgetPtr->tkwin,myClassProcs,(ClientData)widgetPtr);
	    ...
	    return TCL_OK;

	}

Portions of Tk that need to use a particular callback, such as
_Tk\_MakeWindowExist_, use code like the following:

	Tk_ClassProcs *thisClassProcs = tkwin->classProcs;
	createProc *procPtr;

	/* Make sure the structure we were given has the createProc field
	 * in it by checking that the size of the structure is at least
	 * big enough to have that slot.
	 */

	if (thisClassProcs->size <= Tk_Offset(Tk_ClassProcs, createProc)) {
	    procPtr = NULL;
	} else {
	    procPtr = thisClassProcs->createProc;

	}

	if (procPtr != NULL) {
	    /* Invoke the createProc for this window. */
	    ...
	} else {
	    /* Use the default Tk window creation mechanism. */
	    ...

	}

# Benefits of this implementation

Benefits of this implementation are as follows:

   1.  Usage of _Tk\_ClassProcs_ and _Tk\_SetClassProcs_ very, very
       closely parallels the usage of the existing private API.  In
       fact, the only difference is a small change in the particular
       fields of the _Tk\_ClassProcs_ structure \(especially, the new
       size field, for version information, and the reordering of the
       callback fields\).

   2.  All instances of "mywidget" reference the same
       _Tk\_ClassProcs_ structure.  This is memory efficient.

   3.  We do not need to explicitly initialize to NULL those fields of
       myClassProcs that we don't use.  The ANSI C specification
       states that static variables \(and members of statically
       declared structures\) that are not explicitly initialized are
       initialized to zero.

   4.  This retains binary compatibility.  The size field of the
       _Tk\_ClassProcs_ structure is set at compile time, so when a
       later version of Tk checks the size field to see if a new
       callback can be used, it will fail.  That is, if extension
       author A compiles the extension against version X of Tk, which
       has three fields in _Tk\_ClassProcs_, the size field of
       myClassProcs will be set to 12 \(assuming 4-byte pointers\).
       When using that extension with Tk version Y, which may have
       four fields in _Tk\_ClassProcs_, the size check for that
       fourth field will fail, since the size field, set to 12, will
       be less than or equal to the offset of the fourth field in the
       structure.

   5.  This retains source compatibility.  Because of \#3 above, unless
       the extension author wants to use the new callbacks, they need
       not worry about their addition, because the new fields will be
       automatically set to zero.

   6.  There is minimal API bloat.  Only one public API is added,
       _Tk\_SetClassProcs_.

   7.  The system is "type safe" with respect to the function
       signatures of the callback functions.  Any type mismatches will
       be caught at compile time.

   8.  If desired, widget authors can directly reference elements of
       the _Tk\_ClassProcs_ structure:

		myClassProcs.createProc = myCreateProc;

# Drawbacks of this implementation

The drawbacks of this implementation are as follows:

   1.  The required value of the size field will seem like a bit of
       black magic to developers new to the system.  The question
       _"Why does this field have to be set to this value?  If it's
       always the same thing, why is it stored at all?"_  Of course,
       experienced programmers will recognize why it has to be set,
       and that in fact, it is not always the same value.  This issue
       can best be addressed by appropriate documentation.

   2.  Extensions that use the existing private _TkClassProcs_ and
       _TkSetClassProcs_ mechanism and which were compiled against
       versions of Tk <= 8.3 will not work with new versions of Tk,
       since the format of the _Tk\_ClassProcs_ structure will
       change.  However, this is the consequence of using private
       structures and API's in your extensions: when those private
       structures and API's change, you have to update your extension
       accordingly.  We cannot allow ourselves to be overly
       constrained by this issue.  The existing mechanism is private,
       period.  Authors that use it do so knowingly and willfully.

# Reference Implementation

<http://sourceforge.net/patch/?func=detailpatch&patch\_id=102213&group\_id=10894>

# Copyright

This document has been placed in the public domain.

Name change from tip/50.tip to tip/50.md.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184

185
TIP:            50
Title:          Bundle [incr Tcl] with the Core Tcl distribution
Version:        $Revision: 1.12 $
Author:         Kevin Kenny <[email protected]>
Author:         Mark Harrison <[email protected]>
Author:         Jeff Hobbs <[email protected]>
Author:         Andreas Kupries <[email protected]>
Author:         Karl Lehenbauer <[email protected]>
Author:         Michael McLennan <[email protected]>
Author:         Don Porter <[email protected]>
Author:         Brent Welch <[email protected]>
State:          Final
Type:           Informative
Vote:           Done
Created:        27-Jul-2001
Post-History:   

~ Abstract

A "town meeting" discussion in which users were given the opportunity
to question the Tcl Core Team at the 2001 Open Source Convention has
revealed a great popular demand for bundling an object system with the
distribution of the Tcl Core.  This TIP presents a compromise proposal
for including [[incr Tcl]] that was acceptable to all eight TCT members
present.

~ Proposal

   * [[incr Tcl]] [http://tcltk.com/itcl/] shall be bundled with the core
     Tcl distribution.

   * [[incr Tcl]] shall be "included" in the Core in only a weak
     sense.

      > The location of the [[incr Tcl]] source tree shall be left to
        the discretion of the affected maintainers.  (It appears likely
        that most [[incr Tcl]] source will appear in a separate
        ''itcl'' directory parallel to the ''generic'', ''mac'',
        ''unix'' and ''win'' directories in the source.)

      > [[incr Tcl]] shall be built as a separate loadable package, 
        similar to the ''dde'', ''http'', ''msgcat'', and ''registry'' 
        packages.  

      > The ''::itcl'' namespace shall be the only new component 
        included in the global namespace, and shall appear only when 
        a script executes

|               package require Itcl

      > There shall be no ''::class'' command in the Core.

      > The ''::info'' command shall not provide any subcommands
        specific to [[incr Tcl]].

   * [[incr Tcl]] shall not be substantially modified under the scope
     of this TIP.

      > The existing issues surrounding errors thrown from object
        destructors shall not be addressed.

      > The existing use of ''rename'' for object destruction shall
        not be amended.

      > All other limitations of [[incr Tcl]] are initially accepted
        as they are.

      > Of course, additional TIPs could be submitted to modify
        [[incr Tcl]] as desired!

   * The TCT shall assume the role of ''gatekeeper'' for changes to
     the functionality of [[incr Tcl]].

      > Changes that affect user-visible functionality of [[incr Tcl]]
        shall be made through the TIP process.

      > Informational TIPs identifying maintainer areas and assigning
        maintainers to them shall be developed.

   * Nothing in this TIP shall be construed as identifying [[incr
     Tcl]] as a single preferred object system for Tcl.  If the
     community desires other systems such as OTcl, XOTcl, or ObjecTcl
     to stand on an equal footing to [[incr Tcl]], their champions
     can introduce TIPs similar to this one.

   * [[incr Tk]] and [[incr widgets]] are outside the scope of this TIP.

~ Rationale

The lack of a standard object and data abstraction system continues to
hinder Tcl development.

  > "Lets face it, not including any sort of OO system is one of
    the major failings of Tcl. Indexing into global arrays is
    a sad hack when compared to a real OO system."
           ''- Mo DeJong <[email protected]>''

Moreover, the argument that "Tcl is not object oriented" continues to
hamper Tcl marketing.  Including at least one object system with the
Tcl core, so that it is dependably available unless the user has built
from source, would address this objection.

Since an earler proposal ([6]) to incorporate [[incr Tcl]] into the
Core failed to garner the necessary votes, at least in part because
participants were uncertain of the rationales, it seems wise to
discuss the individual points in further detail.

   * All agree that some sort of object system must be bundled with
     the core so that it is dependably available.  [[incr Tcl]]
     appears to be the most popular of the existing systems, as well
     as the most familiar to the current TCT, making it the most
     attractive of several candidates for this role.

   * The original [[incr Tcl]] developers have pointed out that
     bundling in the Core would facilitate [[incr Tcl]] development
     greatly.  While it is a separate loadable package, [[incr Tcl]]
     is intimate with the core, depending on many undocumented
     interfaces to carry out its functions.  Integrating it with the
     Core would make it easier to maintain.

   * Including a ''::class'' command in the Core is not acceptable at
     this time, because it would have the effect of disenfranchising
     the users of other object systems -- who are too numerous to
     ignore.  Moreover, the ability of Tcl to serve as a test platform
     for novel object models must not be compromised.

   * Similarly, integrating [[incr Tcl]] closely with commands such as
     ''::info'' or ''::destroy'' would accord it a privileged status
     that the users of other object systems are reluctant to accept.

   * [[incr Tcl]] is what it is.  It would be inappropriate to demand
     that all the perceived shortcomings of the [[incr Tcl]] system be
     addressed prior to inclusion in the Core.  The TIP process is
     available to make further changes; the system is certainly good
     enough that many thousands of programmers use it daily.

   * If [[incr Tcl]] is to be included in the Core, then common sense
     requires that it be under control of the TIP process.

~ Alternatives

   * Include [[incr Tcl]] in a "batteries included" (BI) distribution.

      > Many people will not opt for the BI distribution ([4]) due to its
        larger size.  It is quite likely that (for example) a Linux
        distribution my include Tcl as a standard component, but place the BI
        on a supplemental disk.

      > Moreover, as mentioned above, the [[incr Tcl]] sources are
        already intimate with the Tcl core; there are great
        maintenance savings to be achieved by combining the source
        distributions. 

   * Integrate [[incr Tcl]] tightly into the Tcl Core.

      > This alternative is unacceptable to a good many users.  A
        number of attendees at the 2001 Open Source Convention
        mentioned specifically that they use alternative object
        systems such as OTcl.  These users would be essentially
        disenfranchised if, for instance, a ''::class'' command were to
        appear in the Core.

~ Implementation

Jeff Hobbs has volunteered to lead the implementation effort with
the assistance of all volunteers who want to help.

~ Notes

Eight members of the Tcl Core Team (Harrison, Hobbs, Kenny, Kupries,
Lehenbauer, McLennan, Porter and Welch) agreed orally to this proposal
at the 2001 Open Source Convention.  Since not all have had the
opportunity to read the formal written version of the proposal, that
vote shall not be considered binding.

~ References

   * http://tcltk.com//itcl

   * [6]

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|
|
|
|
|

|
|

|

|

|

|
|

|

|

|

|

|
|

|
|

|

|

|
|

|

|

|

|

|

|

|
|
|

|

|
|

|
|

|

|

|

|
|

|

|

|

|

|

|

|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185

# TIP 50: Bundle [incr Tcl] with the Core Tcl distribution

	Author:         Kevin Kenny <[email protected]>
	Author:         Mark Harrison <[email protected]>
	Author:         Jeff Hobbs <[email protected]>
	Author:         Andreas Kupries <[email protected]>
	Author:         Karl Lehenbauer <[email protected]>
	Author:         Michael McLennan <[email protected]>
	Author:         Don Porter <[email protected]>
	Author:         Brent Welch <[email protected]>
	State:          Final
	Type:           Informative
	Vote:           Done
	Created:        27-Jul-2001
	Post-History:   
-----

# Abstract

A "town meeting" discussion in which users were given the opportunity
to question the Tcl Core Team at the 2001 Open Source Convention has
revealed a great popular demand for bundling an object system with the
distribution of the Tcl Core.  This TIP presents a compromise proposal
for including [incr Tcl] that was acceptable to all eight TCT members
present.

# Proposal

   * [incr Tcl] <http://tcltk.com/itcl/>  shall be bundled with the core
     Tcl distribution.

   * [incr Tcl] shall be "included" in the Core in only a weak
     sense.

	      > The location of the [incr Tcl] source tree shall be left to
        the discretion of the affected maintainers.  \(It appears likely
        that most [incr Tcl] source will appear in a separate
        _itcl_ directory parallel to the _generic_, _mac_,
        _unix_ and _win_ directories in the source.\)

	      > [incr Tcl] shall be built as a separate loadable package, 
        similar to the _dde_, _http_, _msgcat_, and _registry_ 
        packages.  

	      > The _::itcl_ namespace shall be the only new component 
        included in the global namespace, and shall appear only when 
        a script executes

		               package require Itcl

	      > There shall be no _::class_ command in the Core.

	      > The _::info_ command shall not provide any subcommands
        specific to [incr Tcl].

   * [incr Tcl] shall not be substantially modified under the scope
     of this TIP.

	      > The existing issues surrounding errors thrown from object
        destructors shall not be addressed.

	      > The existing use of _rename_ for object destruction shall
        not be amended.

	      > All other limitations of [incr Tcl] are initially accepted
        as they are.

	      > Of course, additional TIPs could be submitted to modify
        [incr Tcl] as desired!

   * The TCT shall assume the role of _gatekeeper_ for changes to
     the functionality of [incr Tcl].

	      > Changes that affect user-visible functionality of [incr Tcl]
        shall be made through the TIP process.

	      > Informational TIPs identifying maintainer areas and assigning
        maintainers to them shall be developed.

   * Nothing in this TIP shall be construed as identifying [incr
     Tcl] as a single preferred object system for Tcl.  If the
     community desires other systems such as OTcl, XOTcl, or ObjecTcl
     to stand on an equal footing to [incr Tcl], their champions
     can introduce TIPs similar to this one.

   * [incr Tk] and [incr widgets] are outside the scope of this TIP.

# Rationale

The lack of a standard object and data abstraction system continues to
hinder Tcl development.

  > "Lets face it, not including any sort of OO system is one of
    the major failings of Tcl. Indexing into global arrays is
    a sad hack when compared to a real OO system."
           _- Mo DeJong <[email protected]>_

Moreover, the argument that "Tcl is not object oriented" continues to
hamper Tcl marketing.  Including at least one object system with the
Tcl core, so that it is dependably available unless the user has built
from source, would address this objection.

Since an earler proposal \([[6]](6.md)\) to incorporate [incr Tcl] into the
Core failed to garner the necessary votes, at least in part because
participants were uncertain of the rationales, it seems wise to
discuss the individual points in further detail.

   * All agree that some sort of object system must be bundled with
     the core so that it is dependably available.  [incr Tcl]
     appears to be the most popular of the existing systems, as well
     as the most familiar to the current TCT, making it the most
     attractive of several candidates for this role.

   * The original [incr Tcl] developers have pointed out that
     bundling in the Core would facilitate [incr Tcl] development
     greatly.  While it is a separate loadable package, [incr Tcl]
     is intimate with the core, depending on many undocumented
     interfaces to carry out its functions.  Integrating it with the
     Core would make it easier to maintain.

   * Including a _::class_ command in the Core is not acceptable at
     this time, because it would have the effect of disenfranchising
     the users of other object systems -- who are too numerous to
     ignore.  Moreover, the ability of Tcl to serve as a test platform
     for novel object models must not be compromised.

   * Similarly, integrating [incr Tcl] closely with commands such as
     _::info_ or _::destroy_ would accord it a privileged status
     that the users of other object systems are reluctant to accept.

   * [incr Tcl] is what it is.  It would be inappropriate to demand
     that all the perceived shortcomings of the [incr Tcl] system be
     addressed prior to inclusion in the Core.  The TIP process is
     available to make further changes; the system is certainly good
     enough that many thousands of programmers use it daily.

   * If [incr Tcl] is to be included in the Core, then common sense
     requires that it be under control of the TIP process.

# Alternatives

   * Include [incr Tcl] in a "batteries included" \(BI\) distribution.

	      > Many people will not opt for the BI distribution \([[4]](4.md)\) due to its
        larger size.  It is quite likely that \(for example\) a Linux
        distribution my include Tcl as a standard component, but place the BI
        on a supplemental disk.

	      > Moreover, as mentioned above, the [incr Tcl] sources are
        already intimate with the Tcl core; there are great
        maintenance savings to be achieved by combining the source
        distributions. 

   * Integrate [incr Tcl] tightly into the Tcl Core.

	      > This alternative is unacceptable to a good many users.  A
        number of attendees at the 2001 Open Source Convention
        mentioned specifically that they use alternative object
        systems such as OTcl.  These users would be essentially
        disenfranchised if, for instance, a _::class_ command were to
        appear in the Core.

# Implementation

Jeff Hobbs has volunteered to lead the implementation effort with
the assistance of all volunteers who want to help.

# Notes

Eight members of the Tcl Core Team \(Harrison, Hobbs, Kenny, Kupries,
Lehenbauer, McLennan, Porter and Welch\) agreed orally to this proposal
at the 2001 Open Source Convention.  Since not all have had the
opportunity to read the formal written version of the proposal, that
vote shall not be considered binding.

# References

   * <http://tcltk.com//itcl>

   * [[6]](6.md)

# Copyright

This document has been placed in the public domain.

Name change from tip/51.tip to tip/51.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121

122
123

124
125
126
127
128
129
130

131
132
133

134
135
136

137
138
139
140
141
142
143
144
145
146

147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162

163
164
165
166
167
168
169
170
171
172
173
174
175

176
177
178
179
180
181
182
183
184
185

186
187
188
189

190
191
192
193
194
195

196
197
198

199
200
201

202
203
204
205
206

207

208
TIP:            51
Title:          Native Menubutton on Macintosh
Version:	$Revision: 1.4 $
Author:         Mats Bengtsson <[email protected]>
State:          Withdrawn
Type:           Project
Tcl-Version:    8.5
Vote:           Pending
Created:        04-Aug-2001
Post-History:

~ Abstract

This is a replacement for the menubutton on the Macintosh with a
native implementation which is compliant with the Appearance Manager
in Mac OS 8 and later.

~ Rationale

The present (in 8.3.3 and earlier) menubutton on the Macintosh is
implemented using Tk drawing to draw something similar to the native
menubutton on Mac how it looks on Pre Mac OS 8.0 systems, and
therefore fails to give the correct appearance on Mac OS 8.0 systems
and later. This TIP presents a step to increase the native appearance
on the Macintosh (similar to [25].)

#image:51compare Comparison of Native (to left) and Standard Menu Buttons.

~ Reference Implementation

The proposed change is now implemented as a loadable extension (in C)
on Macintosh, and can be downloaded
[http://hem.fyristorg.com/matben/download/MacMenuButton.sit]. This
implementation differs from the other buttons in Mac Tk (button,
radiobutton, checkbutton), which use a mixture of native Apple drawing
code and Tk drawing code, since it uses only native Apple code for
drawing.  This extension requires Tcl/Tk 8.3.2p1 or later due to the
changed stub loading mechanism. The new implementation is not a
complete replacement since it lacks the ''-bitmap'' and ''-image''
options, and a few other things, see below.

The changes necessary are:

    * Replace the ''tkMacMenubutton.c'' file with the new one.

    * Add a MENU resource item, which is included in the shared library,
      but needs to be added to the core.

    * Modifications to ''tkMacFont.c'' (see appendix). Put declaration
      so it can be used from any file. Possibly also add the new
      function to the stub table since it can be practical for other
      extension writers.

    * Need to check for the presence of the Appearance manager:

|if (TkMacHaveAppearance()) 
|   use native (new) menubutton 
|else 
|   use present menubutton

All functionality from the documentation that is applicable is
implemented in the extension, with some exceptions:

    * The ''-image'' and ''-bitmap'' options are not supported, yet.

    * There is no button pressed (SELECTED) flag so it highlights when
      the mouse enters, just as a reminder that it must be fixed.
      (see appendix)

    * Don't know which color to pick for the three pixels in each
      corner.  It is now the ''-background'' color, but the ordinary
      button uses ''-highlightbackground''?

    * The position of the popup menu should be changed in order to
      conform better with standard Mac appearance..

    * Something needs to be done so that we can get Mac native font
      stuffs from a ''Tk_Font'' object; I've included a crude hack in
      the appendix.

    * It is compliant to the Appearance Manager which means that
      foreground and background colors are set via themes and not from
      command switches.

    * Minor differences to comply with the Appearance Manager.

All these deviations are consistent with the look-and-feel of Mac OS
8.0 and on. Existing scripts using menubutton are compatible with the
new menubutton.

Open questions: 

    * Option to use for the color of the corner pixels.

    * If (and how) a SELECTED flag should be added to
      ''tkMenuButton.h'', and code to support it in
      ''tkMenuButton.c''.

    * Implementation of the ''-bitmap'' and ''-image'' options.

    * A ''-compound'' option as described in TIP #11.

~ Copyright

This document has been placed in the public domain.

~ Appendix

    * Addition to ''tkMenuButton.h'':

|#define SELECTED		8

    > Other modifications to tkMenuButton.c must be made to support
      this flag.

    * Addition to ''tkMacFont.c'' (possibly add to exported
      functions):

|/*
| *---------------------------------------------------------------------------
| *

| * GetMacFontAttributes -- 
| *

| *      Takes a Tk_Font and gets the Mac font attributes faceNum, size, and style.
| *      Note that the Mac font size is in pixels while the Tk_Font size is
| *      in points. No need to do any UTF-8 translations since this is
| *      implicit in GetFamilyOrAliasNum().
| *      The code here is essentially a modified TkpGetFontFromAttributes() and
| *      InitFont(), both from tkMacFont.c.
| *

| * Results:
| *      Sets the Mac font attributes.
| *

| * Side effects:
| *      None.
| *

| *---------------------------------------------------------------------------
| */
|void
|GetMacFontAttributes(
|        Tk_Window tkwin,        /* Tk window. (in) */
|        Tk_Font tkFont,         /* Tk font. (in) */
|        short *faceNumPtr,      /* Mac font face id. (out) */
|        short *macSizePtr,      /* Mac font size in pixels. (out) */
|        Style *stylePtr)        /* Mac font style specifier. (out) */
|{

|    int i, j;
|    char *faceName, *fallback;
|    char ***fallbacks;
|    MacFont *fontPtr;
|    const TkFontAttributes *faPtr;
|    int size;           /* Size in points. */
|        
|    /*
|     * This is just a macro to access the attribute struct member.
|     */
|     
|    faPtr = GetFontAttributes(tkFont);
|
|    /*
|     * Algorithm to get the closest font to the one requested.
|     *

|     * try fontname
|     * try all aliases for fontname
|     * foreach fallback for fontname
|     *      try the fallback
|     *      try all aliases for the fallback
|     */
|     
|    *faceNumPtr = 0;
|    faceName = faPtr->family;
|    if (faceName != NULL) {
|        if (GetFamilyOrAliasNum(faceName, faceNumPtr) != 0) {
|            goto found;
|        }

|        fallbacks = TkFontGetFallbacks();
|        for (i = 0; fallbacks[i] != NULL; i++) {
|            for (j = 0; (fallback = fallbacks[i][j]) != NULL; j++) {
|                if (strcasecmp(faceName, fallback) == 0) {
|                    for (j = 0; (fallback = fallbacks[i][j]) != NULL; j++) {
|                        if (GetFamilyOrAliasNum(fallback, faceNumPtr)) {
|                            goto found;
|                        }
|                    }
|                }

|                break;
|            }
|        }
|    }

|    
|    found:    
|    *stylePtr = 0;
|    if (faPtr->weight != TK_FW_NORMAL) {
|        *stylePtr |= bold;
|    }

|    if (faPtr->slant != TK_FS_ROMAN) {
|        *stylePtr |= italic;
|    }

|    if (faPtr->underline) {
|        *stylePtr |= underline;
|    }

|    if (faPtr->size == 0) {
|        size = -GetDefFontSize();
|    } else {
|        size = faPtr->size;
|    }

|    *macSizePtr = (short) TkFontGetPixels(tkwin, size);

|}
<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|
|
|

|

|

|

|
|
|
|

|

|

|

|
|

|

|
|
|

|

|

|

|

|

|

|

|
|

|
|
<
>
|
<
>
|
|
|
|
|
|
<
>
|
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
<
<
<
>
>
>
|
<
<
<
>
>
>
|
|
|
|
|
<
>
|
|
<
>
|
|
<
>
|
|
|
|
<
>
|
>
|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119

120
121

122
123
124
125
126
127
128

129
130
131

132
133
134

135
136
137
138
139
140
141
142
143
144

145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160

161
162
163
164
165
166
167
168
169
170
171
172
173

174
175
176
177
178
179
180
181

182
183
184
185

186
187
188
189
190
191
192
193

194
195
196

197
198
199

200
201
202
203
204

205
206
207
208

# TIP 51: Native Menubutton on Macintosh

	Author:         Mats Bengtsson <[email protected]>
	State:          Withdrawn
	Type:           Project
	Tcl-Version:    8.5
	Vote:           Pending
	Created:        04-Aug-2001
	Post-History:
-----

# Abstract

This is a replacement for the menubutton on the Macintosh with a
native implementation which is compliant with the Appearance Manager
in Mac OS 8 and later.

# Rationale

The present \(in 8.3.3 and earlier\) menubutton on the Macintosh is
implemented using Tk drawing to draw something similar to the native
menubutton on Mac how it looks on Pre Mac OS 8.0 systems, and
therefore fails to give the correct appearance on Mac OS 8.0 systems
and later. This TIP presents a step to increase the native appearance
on the Macintosh \(similar to [[25]](25.md).\)

![Comparison of Native (to left) and Standard Menu Buttons.](../assets/51compare.gif)

# Reference Implementation

The proposed change is now implemented as a loadable extension \(in C\)
on Macintosh, and can be downloaded
<http://hem.fyristorg.com/matben/download/MacMenuButton.sit> . This
implementation differs from the other buttons in Mac Tk \(button,
radiobutton, checkbutton\), which use a mixture of native Apple drawing
code and Tk drawing code, since it uses only native Apple code for
drawing.  This extension requires Tcl/Tk 8.3.2p1 or later due to the
changed stub loading mechanism. The new implementation is not a
complete replacement since it lacks the _-bitmap_ and _-image_
options, and a few other things, see below.

The changes necessary are:

    * Replace the _tkMacMenubutton.c_ file with the new one.

    * Add a MENU resource item, which is included in the shared library,
      but needs to be added to the core.

    * Modifications to _tkMacFont.c_ \(see appendix\). Put declaration
      so it can be used from any file. Possibly also add the new
      function to the stub table since it can be practical for other
      extension writers.

    * Need to check for the presence of the Appearance manager:

		if (TkMacHaveAppearance()) 
		   use native (new) menubutton 
		else 
		   use present menubutton

All functionality from the documentation that is applicable is
implemented in the extension, with some exceptions:

    * The _-image_ and _-bitmap_ options are not supported, yet.

    * There is no button pressed \(SELECTED\) flag so it highlights when
      the mouse enters, just as a reminder that it must be fixed.
      \(see appendix\)

    * Don't know which color to pick for the three pixels in each
      corner.  It is now the _-background_ color, but the ordinary
      button uses _-highlightbackground_?

    * The position of the popup menu should be changed in order to
      conform better with standard Mac appearance..

    * Something needs to be done so that we can get Mac native font
      stuffs from a _Tk\_Font_ object; I've included a crude hack in
      the appendix.

    * It is compliant to the Appearance Manager which means that
      foreground and background colors are set via themes and not from
      command switches.

    * Minor differences to comply with the Appearance Manager.

All these deviations are consistent with the look-and-feel of Mac OS
8.0 and on. Existing scripts using menubutton are compatible with the
new menubutton.

Open questions: 

    * Option to use for the color of the corner pixels.

    * If \(and how\) a SELECTED flag should be added to
      _tkMenuButton.h_, and code to support it in
      _tkMenuButton.c_.

    * Implementation of the _-bitmap_ and _-image_ options.

    * A _-compound_ option as described in TIP \#11.

# Copyright

This document has been placed in the public domain.

# Appendix

    * Addition to _tkMenuButton.h_:

		#define SELECTED		8

	    > Other modifications to tkMenuButton.c must be made to support
      this flag.

    * Addition to _tkMacFont.c_ \(possibly add to exported
      functions\):

		/*
		 *---------------------------------------------------------------------------

		 *
		 * GetMacFontAttributes -- 

		 *
		 *      Takes a Tk_Font and gets the Mac font attributes faceNum, size, and style.
		 *      Note that the Mac font size is in pixels while the Tk_Font size is
		 *      in points. No need to do any UTF-8 translations since this is
		 *      implicit in GetFamilyOrAliasNum().
		 *      The code here is essentially a modified TkpGetFontFromAttributes() and
		 *      InitFont(), both from tkMacFont.c.

		 *
		 * Results:
		 *      Sets the Mac font attributes.

		 *
		 * Side effects:
		 *      None.

		 *
		 *---------------------------------------------------------------------------
		 */
		void
		GetMacFontAttributes(
		        Tk_Window tkwin,        /* Tk window. (in) */
		        Tk_Font tkFont,         /* Tk font. (in) */
		        short *faceNumPtr,      /* Mac font face id. (out) */
		        short *macSizePtr,      /* Mac font size in pixels. (out) */
		        Style *stylePtr)        /* Mac font style specifier. (out) */

		{
		    int i, j;
		    char *faceName, *fallback;
		    char ***fallbacks;
		    MacFont *fontPtr;
		    const TkFontAttributes *faPtr;
		    int size;           /* Size in points. */

		    /*
		     * This is just a macro to access the attribute struct member.
		     */

		    faPtr = GetFontAttributes(tkFont);

		    /*
		     * Algorithm to get the closest font to the one requested.

		     *
		     * try fontname
		     * try all aliases for fontname
		     * foreach fallback for fontname
		     *      try the fallback
		     *      try all aliases for the fallback
		     */

		    *faceNumPtr = 0;
		    faceName = faPtr->family;
		    if (faceName != NULL) {
		        if (GetFamilyOrAliasNum(faceName, faceNumPtr) != 0) {
		            goto found;

		        }
		        fallbacks = TkFontGetFallbacks();
		        for (i = 0; fallbacks[i] != NULL; i++) {
		            for (j = 0; (fallback = fallbacks[i][j]) != NULL; j++) {
		                if (strcasecmp(faceName, fallback) == 0) {
		                    for (j = 0; (fallback = fallbacks[i][j]) != NULL; j++) {
		                        if (GetFamilyOrAliasNum(fallback, faceNumPtr)) {
		                            goto found;

		                        }
		                    }
		                }
		                break;

		            }
		        }
		    }

		    found:    
		    *stylePtr = 0;
		    if (faPtr->weight != TK_FW_NORMAL) {
		        *stylePtr |= bold;

		    }
		    if (faPtr->slant != TK_FS_ROMAN) {
		        *stylePtr |= italic;

		    }
		    if (faPtr->underline) {
		        *stylePtr |= underline;

		    }
		    if (faPtr->size == 0) {
		        size = -GetDefFontSize();
		    } else {
		        size = faPtr->size;

		    }
		    *macSizePtr = (short) TkFontGetPixels(tkwin, size);
		}

Name change from tip/52.tip to tip/52.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37

38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55

56
57
58
59
60
61

62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83

84
85
86
87
88
89
90
91

92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198

TIP:            52
Title:          Hierarchical Namespace Lookup of Commands and Variables
Version:        $Revision: 1.6 $
Author:         David Cuthbert <[email protected]>
Author:         Andreas Kupries <[email protected]>
State:          Withdrawn
Type:           Project
Vote:           Pending
Created:        09-Aug-2001
Post-History:   
Discussions-To: news:comp.lang.tcl
Keywords:       namespace,lookup,hierarchy
Tcl-Version:    8.5

~ Abstract

This TIP proposes to change the command and variable namespace lookup
system so that the full hierarchy of namespaces is parsed, rather than
just the current namespace followed by the global namespace.  This is
primarily intended to rectify problems often encountered with the use
of [[incr Tcl]] (ITcl) and namespaces.  In addition, package
encapsulation can be enhanced with judicious application of this
feature.

~ Rationale

Currently, the following code is invalid in Tcl/ITcl:

|package require Itcl
|
|namespace eval SampleNS {
|    proc Hello {} { puts "Hello world!" }
|
|    ::itcl::class X {
|        public constructor {} {} { Hello }
|    }
|}

|
|SampleNS::X x1  ;# Error: invalid command name "Hello"

This is due to the fact that ITcl classes double as namespaces.
Therefore, the lookup of ''Hello'' takes place first in
''::SampleNS::X'', followed by ''::'' (the global namespace).

The current workaround - to reopen the class' namespace and issue a
''namespace import'' directive - is of limited value since ''namespace
import'' is not capable of bringing in names defined later on.  The
following code illustrates this point:

|package require Itcl
|
|namespace eval SampleNS {
|    ::itcl::class X1 {
|        public method GetSibling {} { return [X2 \#auto] }
|    }

|    namespace eval X1 { namespace import ::SampleNS }
|
|    # Further down, or perhaps in a separate file source later:
|
|    ::itcl::class X2 { }
|}

|
|set x [SampleNS::X1 \#auto]
|$x GetSibling ;# Error: invalid command name "X2"

Non-ITcl code can also make use of hierarchical namespaces to better
encapsulate support procedures.  In this example, the child namespace
''private'' illustrates that the ''GetUniqueId'' procedure should not
be used outside of the package; however, ''GetUniqueId'' still has
access to the procedures and variables in the package's main
namespace:

|# MyPackage
|
|namespace eval MyPackage {
|    variable nextId 0
|
|    namespace eval private {
|        proc GetUniqueId {} {
|            variable nextId
|            return "MyPackage.[incr nextId]"
|        }
|    }

|
|    proc CreateObject {} {
|        set name ::[private::GetUniqueId]
|        proc $name args { body }
|        return $name
|    }
|}

~ Specification

Currently, the ''NAME RESOLUTION'' section of the ''namespace''
documentation states:

 > If the name does not start with a :: (i.e., is ''relative''), Tcl
   follows a fixed rule for looking it up: Command and variable names
   are always resolved by looking first in the current namespace, and
   then in the global namespace.  Namespace names, on the other hand,
   are always resolved by looking in only the current namespace.

The proposed change to this is as follows:

 > If the name does not start with a :: (i.e., is ''relative''), Tcl
   follows a fixed rule for looking it up: Command and variable names
   are always resolved by traversing the namespace hierarchy - that
   is, the current namespace is examined first, followed by the
   parent, the parent's parent, and so on, until (finally) the global
   namespace is examined.  Namespace names, on the other hand, are
   always resolved by looking in only the current namespace.

By keeping the current behaviour for namespace names, this TIP affects
only completely unqualified commands and variables (i.e. those that do
not contain ::).  Changing the behaviour of partially qualified names
(those that are relative ''and'' contain ::) is often unintuitive and
can lead to unexpected errors.

~ Consequences

 1. ITcl classes and child namespaces can refer to command and
    variable names in their parent hierarchy without requiring the
    names to be fully qualified.  This improves the intuitiveness and
    readability of Tcl code.  In addition, it can reduce the
    brittleness of the code should parent namespace names undergo a
    change (e.g., ''namespace eval scriptics.com'' to ''namespace eval
    ajubasolutions.com'').

 2. Currently well-defined behaviour is modified.  This can break
    existing code if the following conditions are met:

 > * The code employs the use of namespaces with a depth greater than
     one below the global namespace.

 > * The code creates a variable or procedure in a parent namespace
     with the same name as a variable or procedure in the global
     namespace.

 > * The code in the child namespace uses unscoped names to refer to
     commands and/or variables in the global namespace.

 > A cursory examination of existing Tcl code available on the
   Internet revealed no code which used deeply nested namespaces.

 3. Existing well-defined behaviour of the internal Tcl function
    ''TclGetNamespaceForQualName'' is modified.  Under the sample
    implementation, the ''altNsPtrPtr'' parameter (which currently
    returns a pointer to the global namespace if a name was found
    there) always returns NULL.  It is up to the calling functions
    (e.g., Tcl_FindCommand and Tcl_FindNamespaceVar) to traverse the
    hierarchy.  Although the Tcl and Tk code-base can be modified to
    accommodate this, extensions which depend on this internal
    function may be broken.

~ Namespace History

Namespaces were originally developed by Michael McLennan for ITcl, and
apparently had this hierarchical resolution feature.  When they were
adopted into Tcl, an optimisation was made which led to the current
behaviour.

This TIP argues for the reversal of this decision based on experiences
with the new behaviour.

~ See Also

 * Tcl manual page ''namespace''.

 * Tcl source code file ''tcl8.4a3/generic/tclNamesp.c''.

 * Sample implementation at http://www.kanga.org/tclnamespace/

~ Comments

 * Andreas Kupries:

 > Related information: SF entry [[ #218101 ]] "no man page for library procedures Tcl_AddInterpResolver Tcl" [http://sourceforge.net/tracker/?func=detail&aid=218101&group_id=10894&atid=110894]

 > Not addressed in this TIP: Impact on speed of the interpreter.
(Seeking out mail on Tcl core where author of talks about this)

~ Notice of Withdrawal

This TIP was Withdrawn by the TIP Editor following discussion on the
tcl-core mailing list.  The following is a summary of reasons for
withdrawal:

 > Insufficiently subtle.  52 will break any code that assumes the
   current behaviour (and you can bet someone will have that
   assumption) and 142 doesn't let two namespaces have different
   search paths (unless the variable is always interpreted locally,
   which just creates bizarre variable name magic.)

~ Copyright

Copyright � 2001 by David Cuthbert.  Distribution in whole or part,
with or without annotations, is unlimited.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|
|
|
|
|
|
|
<
<
>
>
|
|

|
|

|
|

|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
|

|
|

|
|
|
|
|
|
|
|
|
<
<
>
>
|
|
|
|
|
<
<
|
>
>
|

|

|

|

|

|
|
|

|

|
|

|

|

|

|

|
|

|
|

|

|

|

|

|

|

|

|
|

|

|
|
|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34

35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53

54
55
56
57
58
59

60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80

81
82
83
84
85
86
87

88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198

# TIP 52: Hierarchical Namespace Lookup of Commands and Variables

	Author:         David Cuthbert <[email protected]>
	Author:         Andreas Kupries <[email protected]>
	State:          Withdrawn
	Type:           Project
	Vote:           Pending
	Created:        09-Aug-2001
	Post-History:   
	Discussions-To: news:comp.lang.tcl
	Keywords:       namespace,lookup,hierarchy
	Tcl-Version:    8.5
-----

# Abstract

This TIP proposes to change the command and variable namespace lookup
system so that the full hierarchy of namespaces is parsed, rather than
just the current namespace followed by the global namespace.  This is
primarily intended to rectify problems often encountered with the use
of [incr Tcl] \(ITcl\) and namespaces.  In addition, package
encapsulation can be enhanced with judicious application of this
feature.

# Rationale

Currently, the following code is invalid in Tcl/ITcl:

	package require Itcl

	namespace eval SampleNS {
	    proc Hello {} { puts "Hello world!" }

	    ::itcl::class X {
	        public constructor {} {} { Hello }

	    }
	}

	SampleNS::X x1  ;# Error: invalid command name "Hello"

This is due to the fact that ITcl classes double as namespaces.
Therefore, the lookup of _Hello_ takes place first in
_::SampleNS::X_, followed by _::_ \(the global namespace\).

The current workaround - to reopen the class' namespace and issue a
_namespace import_ directive - is of limited value since _namespace
import_ is not capable of bringing in names defined later on.  The
following code illustrates this point:

	package require Itcl

	namespace eval SampleNS {
	    ::itcl::class X1 {
	        public method GetSibling {} { return [X2 \#auto] }

	    }
	    namespace eval X1 { namespace import ::SampleNS }

	    # Further down, or perhaps in a separate file source later:

	    ::itcl::class X2 { }

	}

	set x [SampleNS::X1 \#auto]
	$x GetSibling ;# Error: invalid command name "X2"

Non-ITcl code can also make use of hierarchical namespaces to better
encapsulate support procedures.  In this example, the child namespace
_private_ illustrates that the _GetUniqueId_ procedure should not
be used outside of the package; however, _GetUniqueId_ still has
access to the procedures and variables in the package's main
namespace:

	# MyPackage

	namespace eval MyPackage {
	    variable nextId 0

	    namespace eval private {
	        proc GetUniqueId {} {
	            variable nextId
	            return "MyPackage.[incr nextId]"

	        }
	    }

	    proc CreateObject {} {
	        set name ::[private::GetUniqueId]
	        proc $name args { body }
	        return $name

	    }
	}

# Specification

Currently, the _NAME RESOLUTION_ section of the _namespace_
documentation states:

 > If the name does not start with a :: \(i.e., is _relative_\), Tcl
   follows a fixed rule for looking it up: Command and variable names
   are always resolved by looking first in the current namespace, and
   then in the global namespace.  Namespace names, on the other hand,
   are always resolved by looking in only the current namespace.

The proposed change to this is as follows:

 > If the name does not start with a :: \(i.e., is _relative_\), Tcl
   follows a fixed rule for looking it up: Command and variable names
   are always resolved by traversing the namespace hierarchy - that
   is, the current namespace is examined first, followed by the
   parent, the parent's parent, and so on, until \(finally\) the global
   namespace is examined.  Namespace names, on the other hand, are
   always resolved by looking in only the current namespace.

By keeping the current behaviour for namespace names, this TIP affects
only completely unqualified commands and variables \(i.e. those that do
not contain ::\).  Changing the behaviour of partially qualified names
\(those that are relative _and_ contain ::\) is often unintuitive and
can lead to unexpected errors.

# Consequences

 1. ITcl classes and child namespaces can refer to command and
    variable names in their parent hierarchy without requiring the
    names to be fully qualified.  This improves the intuitiveness and
    readability of Tcl code.  In addition, it can reduce the
    brittleness of the code should parent namespace names undergo a
    change \(e.g., _namespace eval scriptics.com_ to _namespace eval
    ajubasolutions.com_\).

 2. Currently well-defined behaviour is modified.  This can break
    existing code if the following conditions are met:

	 > \* The code employs the use of namespaces with a depth greater than
     one below the global namespace.

	 > \* The code creates a variable or procedure in a parent namespace
     with the same name as a variable or procedure in the global
     namespace.

	 > \* The code in the child namespace uses unscoped names to refer to
     commands and/or variables in the global namespace.

	 > A cursory examination of existing Tcl code available on the
   Internet revealed no code which used deeply nested namespaces.

 3. Existing well-defined behaviour of the internal Tcl function
    _TclGetNamespaceForQualName_ is modified.  Under the sample
    implementation, the _altNsPtrPtr_ parameter \(which currently
    returns a pointer to the global namespace if a name was found
    there\) always returns NULL.  It is up to the calling functions
    \(e.g., Tcl\_FindCommand and Tcl\_FindNamespaceVar\) to traverse the
    hierarchy.  Although the Tcl and Tk code-base can be modified to
    accommodate this, extensions which depend on this internal
    function may be broken.

# Namespace History

Namespaces were originally developed by Michael McLennan for ITcl, and
apparently had this hierarchical resolution feature.  When they were
adopted into Tcl, an optimisation was made which led to the current
behaviour.

This TIP argues for the reversal of this decision based on experiences
with the new behaviour.

# See Also

 * Tcl manual page _namespace_.

 * Tcl source code file _tcl8.4a3/generic/tclNamesp.c_.

 * Sample implementation at <http://www.kanga.org/tclnamespace/>

# Comments

 * Andreas Kupries:

	 > Related information: SF entry [ #218101 ] "no man page for library procedures Tcl\_AddInterpResolver Tcl" <http://sourceforge.net/tracker/?func=detail&aid=218101&group_id=10894&atid=110894> 

	 > Not addressed in this TIP: Impact on speed of the interpreter.
\(Seeking out mail on Tcl core where author of talks about this\)

# Notice of Withdrawal

This TIP was Withdrawn by the TIP Editor following discussion on the
tcl-core mailing list.  The following is a summary of reasons for
withdrawal:

 > Insufficiently subtle.  52 will break any code that assumes the
   current behaviour \(and you can bet someone will have that
   assumption\) and 142 doesn't let two namespaces have different
   search paths \(unless the variable is always interpreted locally,
   which just creates bizarre variable name magic.\)

# Copyright

Copyright © 2001 by David Cuthbert.  Distribution in whole or part,
with or without annotations, is unlimited.

Name change from tip/53.tip to tip/53.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93

TIP:            53
Title:          Addition of 'assert' Command
Version:        $Revision: 1.4 $
Author:         Gerald W. Lester <[email protected]>
Author:         Kevin Kenny <[email protected]>
State:          Withdrawn
Type:           Project
Vote:           Pending
Created:        14-Aug-2001
Post-History:   
Keywords:       bytecode,compiler
Tcl-Version:    8.4

~ Abstract

This TIP proposes the addition of an ''assert'' command and supporting
infrastructure to the Tcl core.

~ Rationale

Many languages, including other scripting languages, have assertion
checking features that can be used to assist in validating program
correctness.  Typically, these assertion checking features can be
"compiled out" of production systems so as not to impact performance. 
To have a similar effect in Tcl, the assertion checking features must
be implemented at the byte code compiler level.

If, doing byte code compilation, an assert command is encountered the
byte code stream generated will be dependent on the value of the
''assert_enabled'' command line option.  If the option is true, a byte
code stream will be emitted to implement the assert command.  If the
option is not true, no byte code will be emitted.

Similarly, if the interpreter encounters an ''assert'' command (either
compiled or uncompiled), it will only execute it if the
''assert_enabled'' command line option is true.

It is acceptable for the compiler to throw an error if the
''booleanExpression'' is not brace quoted.

~ Tcl-Level Specification

The manual entry for the ''assert'' command is included here:

----

~NAME

 > assert - Assert a run time validation condition

~SYNOPSIS

|   assert booleanExpression messageText

~DESCRIPTION

 > This command has no effect if the assert_enabled command line
   option is not true at both compile and run time.  If the
   ''assert_enabled'' command line option is true at both compile and
   run time, the following behavior will occur:

 > 1. The ''booleanExpression'' will be evaluated

 > 2. If the ''booleanExpression'' evaluates to a true value,
      ''assert::failed'' will be called at the global level with
      ''messageText'' as its one and only parameter.

 > The default implementation of ''assert::failed'' will write
   ''messageText'' to standard out and ''exit'' with a status code of
   1.

----

~ Remarks

This TIP has been withdrawn because of other changes, both inside and
outside the Tcl core.

 1. The bytecode compiler (8.4a4) contains code that recognizes a
    no-op procedure of the form ''proc no-op args {}'' and generates
    no bytecode if such a procedure is called with arguments that have
    no side effects.

 2. The ''control'' package within tcllib implements a
    ''::control::assert'' procedure that provides all the requested
    functionality.

These two, taken together, provide an implementation of the requested
functionality that is acceptable to the original author of this TIP.

~ Copyright

This TIP is in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

|
|
|

|
|

|

|
|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93

# TIP 53: Addition of 'assert' Command

	Author:         Gerald W. Lester <[email protected]>
	Author:         Kevin Kenny <[email protected]>
	State:          Withdrawn
	Type:           Project
	Vote:           Pending
	Created:        14-Aug-2001
	Post-History:   
	Keywords:       bytecode,compiler
	Tcl-Version:    8.4
-----

# Abstract

This TIP proposes the addition of an _assert_ command and supporting
infrastructure to the Tcl core.

# Rationale

Many languages, including other scripting languages, have assertion
checking features that can be used to assist in validating program
correctness.  Typically, these assertion checking features can be
"compiled out" of production systems so as not to impact performance. 
To have a similar effect in Tcl, the assertion checking features must
be implemented at the byte code compiler level.

If, doing byte code compilation, an assert command is encountered the
byte code stream generated will be dependent on the value of the
_assert\_enabled_ command line option.  If the option is true, a byte
code stream will be emitted to implement the assert command.  If the
option is not true, no byte code will be emitted.

Similarly, if the interpreter encounters an _assert_ command \(either
compiled or uncompiled\), it will only execute it if the
_assert\_enabled_ command line option is true.

It is acceptable for the compiler to throw an error if the
_booleanExpression_ is not brace quoted.

# Tcl-Level Specification

The manual entry for the _assert_ command is included here:

----

# NAME

 > assert - Assert a run time validation condition

# SYNOPSIS

	   assert booleanExpression messageText

# DESCRIPTION

 > This command has no effect if the assert\_enabled command line
   option is not true at both compile and run time.  If the
   _assert\_enabled_ command line option is true at both compile and
   run time, the following behavior will occur:

 > 1. The _booleanExpression_ will be evaluated

 > 2. If the _booleanExpression_ evaluates to a true value,
      _assert::failed_ will be called at the global level with
      _messageText_ as its one and only parameter.

 > The default implementation of _assert::failed_ will write
   _messageText_ to standard out and _exit_ with a status code of
   1.

----

# Remarks

This TIP has been withdrawn because of other changes, both inside and
outside the Tcl core.

 1. The bytecode compiler \(8.4a4\) contains code that recognizes a
    no-op procedure of the form _proc no-op args \{\}_ and generates
    no bytecode if such a procedure is called with arguments that have
    no side effects.

 2. The _control_ package within tcllib implements a
    _::control::assert_ procedure that provides all the requested
    functionality.

These two, taken together, provide an implementation of the requested
functionality that is acceptable to the original author of this TIP.

# Copyright

This TIP is in the public domain.

Name change from tip/54.tip to tip/54.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227

TIP:            54
Title:          Using PURLs to Unite the Tcl Webspace
Version:        $Revision: 1.8 $
Author:         Andreas Kupries <[email protected]>
Author:         Jeff Hobbs <[email protected]>
State:          Withdrawn
Type:           Process
Vote:           Pending
Created:        16-Aug-2001
Post-History:   

~ Abstract

This TIP proposes the use of PURLs to unify the scattered landscape of
Tcl URLs into a coherent set of information about the language, the
community, extensions, etc.

~ Background & Rationale

One of the recurring themes in the community in general (and
news:comp.lang.tcl in particular) is the lack of central website
people can turn to for an introduction to the language, the community,
search for extensions and packages, et cetera.

Most of the solutions proposed so far have the distinctive
disadvantage of not being able to use the existing sites and bind them
into a whole. This is further aggravated by the fact that the
'natural' domain names, like for example http://www.tcl.com/ and
http://www.tcl.org are already taken by other entities, commercial and
not, and thus not available anymore.  We do have control of the http://www.tcl-tk.net/ domain, provided by David Welton.

Instead of giving up at this point I propose to use PURLs a.k.a.
''Persistent URLs'' to construct a virtual website (the ''Tcl space'')
out of all the existing independent efforts. See http://www.purl.org/
for more explanations of PURLs.

Note that PURLs not only can refer to single URLs but to entire
sites. The latter is done through a technique called 'partial
redirection'. This ... is emphasized here because partially redirected
PURL have to be used with a trailing slash whereas PURLs referring to
single URL must not have a trailing slash.

In the lists below partially redirected PURLs are indicated by a
trailing slash.

A restriction we face is that PURLs are case insensitive. This means
that the names we will have to come up with have to be unique even
with case removed.

One of the most important features is the persistency. In real life
however organizations, people, websites, etc. can disappear. According
to http://www.purl.org/OCLC/PURL/FAQ#toc3.14 the PURL stays in
existence but can be redirected to a page detailing the history of the
purl. This would include the decommission. We could also do our own
scheme and redirect the purl to a page explaining the history in a
more Tcl-specific manner (like: Company went out of business, was
acquired, etc.).

~ Specification

This TIP is driven by several conflicting needs:

   * The names will be persistent, so give them some thought before
     creating them; they cannot be undone. This also implies that we
     to set up a simple and minimal structure first so as not to block
     future enhancements and flexibility.

   * Define the structure now before the URN space gets as scattered
     as the URL space is. Note that this process has already
     begun. The PURL resolver at http://www.purl.org/ currently has
     registered 24 Tcl-related PURLs which are not bound together in
     the framework proposed here. Action is necessary to prevent
     further confusion.

Of the existing PURLs the PURL domain ''/tcl'' created by Don Libes is
the most promising one for the unification of the Tcl space. Six of
the 24 aforementioned PURLs are defined below this domain too,
providing a (good) framework on which to build.

The existing PURLs and sub-domains in the ''/tcl'' domain are:

   * ''expect''	- reference to the homepage of the expect extension

   * ''faq''	- reference to the main FAQ

   * ''faqs''	- introduction to the available FAQ documents.

   * ''home/''	- refers to the Tcl Developers Xchange

   * ''tip/''	- refers to the TIP archives

   * ''wiki''	- refers to the entry page of the Tcl'ers WIKI

With the exception of ''expect'' all of these are general classes
and/or refer to important sites. They are used as is, except for
''expect'' which has to be redirected into the proposed sub-domain
''package''.

The following new sub-domains covering the most important general
classes of information and/or websites are proposed here. Please note
that the examples used in the list below are using purely informal
everyday names to refer to entities in the proposed domain. These
examples should not be seen as suggestions for the concrete naming
scheme used by the domain.

   * ''announce'' - Direct reference to a page explaining how to
     announce packages, applications and other tcl-related news and
     linking to the relevant newsgroups, mail archives and submission
     addresses. This includes, but is not restricted to:

   > * A link to the newsgroup ''comp.lang.tcl.announce''.

   > * A link to the eGroups/Yahoo archive of the c.l.t.a newsgroup.

   > * The submission address of c.l.t.a. to directly submit via email
       announcements.

   * ''newsletter'' - Direct reference to an archive of ''Tcl-URL!''.

   * ''package/'' - Sub-domain to contain references to the homepages
     of the known packages. This TIP makes no distinction between
     C-level extensions and script libraries. From the point of view
     of the core these are all packages to be required.

   > Examples of packages are ''Expect'', ''tcllib'', etc.

   * ''application/'' - Sub-domain to contain references to the
     homepages of applications related to Tcl, written in Tcl or using
     it internally.

   > Examples of applications are ''frink'', ''tclHttpd'',
     ''AOLServer'', etc.

   * ''person/'' - Sub-domain to contain references to the homepages of
     people active in the community, as far as they are interested in
     such a reference. References in this domain shall be personal and
     not organization-related. The latter will go into their own
     domain.

   > Examples of people are ''Larry Virden'', ''Cameron Laird'', etc.

   * ''org/'' - Sub-domain to contain references to organizations
     important to the Tcl community.

   > Examples are the Tcl Core Team, the Tcl Core Maintainers,
     Tcl-based based companies (PhaseIt, ActiveState), companies and
     organizations using Tcl (NIST, CAS), etc.

~ Management

The ''/tcl'' domain was created by Don Libes which made him
automatically the maintainer of the domain
[http://www.purl.org/maint/search_user.pl.cgi?userid=^LIBES$]. He has
already extended the maintainership to the entity TCLGROUP
[http://www.purl.org/maint/search_group.pl.cgi?groupid=^TCLGROUP$],
currently consisting of

   * Gordon Johnstone <[email protected]>

   * Jeffrey Hobbs <[email protected]>

   * Don Libes <[email protected]>

   * Andreas Kupries <[email protected]>

   * Larry Virden <[email protected]>

   * Don G. Porter <[email protected]>

   * Jean-Claude Wippler <[email protected]>

For the future I propose that

   * High-level changes to the Tcl space, like new sub-domains, have to
     go through the TCT and the TIP process for approval. This is also
     in line with [0] declaring the responsibility of the TCT for the
     Tcl webspace.

   * The day-to-day routine of adding new packages, persons,
     organizations, etc. is delegated to a new group, the ''Tcl
     Namespace Maintainers''.

   > Initially this group would consist of the people mentioned above,
     with membership open to volunteers from the community.

~ Discussion

   * All of the newly proposed sub-domains will be simple listings
     mapping from the names of the entities contained in them to the
     proper locations. Further categorization of the entities by
     topic, gender or other attributes is out of the scope of this
     TIP. This type of categorization rather is in the domain of
     general and specialized catalogs which can be set up later and
     then bound into the unified webspace proposed here.

   > Note that such catalogs can and should make use of the proposed
     domains to reduce the effort necessary by them to stay current
     with respect to the location of the referenced entities (people,
     packages, etc).

   * The first pre-draft of this TIP contained definitions for the
     names to use in the various domains. These definitions have been
     removed on the grounds that their format and other issues like
     resolution of naming conflicts, order of precedence, etc. are
     best handled in one or more separate documents. The role of this
     TIP is to lay down a framework within which the community can
     operate and not to fill in every conceivable detail.

   > These details can be discussed and decided upon by the group of
     maintainers proposed in the last section.

   * This proposal makes the Tcl community dependent on an external
     entity, namely the maintainers of http://www.purl.org/. This is
     considered acceptable.

~ Example

The following examples show how to use PURLs, using some of the
already existing ones:

    * http://www.purl.org/tcl/tip/ refers to the TIP archive.

    * http://www.purl.org/tcl/wiki/ refers to the Tcl'ers Wiki.

~ Copyright

This document is in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|
|

|
|
|

|
|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|
|

|

|

|

|
|
|

|

|

|

|

|

|

|
|

|

|

|

|
|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227

# TIP 54: Using PURLs to Unite the Tcl Webspace

	Author:         Andreas Kupries <[email protected]>
	Author:         Jeff Hobbs <[email protected]>
	State:          Withdrawn
	Type:           Process
	Vote:           Pending
	Created:        16-Aug-2001
	Post-History:   
-----

# Abstract

This TIP proposes the use of PURLs to unify the scattered landscape of
Tcl URLs into a coherent set of information about the language, the
community, extensions, etc.

# Background & Rationale

One of the recurring themes in the community in general \(and
news:comp.lang.tcl in particular\) is the lack of central website
people can turn to for an introduction to the language, the community,
search for extensions and packages, et cetera.

Most of the solutions proposed so far have the distinctive
disadvantage of not being able to use the existing sites and bind them
into a whole. This is further aggravated by the fact that the
'natural' domain names, like for example <http://www.tcl.com/> and
<http://www.tcl.org> are already taken by other entities, commercial and
not, and thus not available anymore.  We do have control of the <http://www.tcl-tk.net/> domain, provided by David Welton.

Instead of giving up at this point I propose to use PURLs a.k.a.
_Persistent URLs_ to construct a virtual website \(the _Tcl space_\)
out of all the existing independent efforts. See <http://www.purl.org/>
for more explanations of PURLs.

Note that PURLs not only can refer to single URLs but to entire
sites. The latter is done through a technique called 'partial
redirection'. This ... is emphasized here because partially redirected
PURL have to be used with a trailing slash whereas PURLs referring to
single URL must not have a trailing slash.

In the lists below partially redirected PURLs are indicated by a
trailing slash.

A restriction we face is that PURLs are case insensitive. This means
that the names we will have to come up with have to be unique even
with case removed.

One of the most important features is the persistency. In real life
however organizations, people, websites, etc. can disappear. According
to <http://www.purl.org/OCLC/PURL/FAQ\#toc3.14> the PURL stays in
existence but can be redirected to a page detailing the history of the
purl. This would include the decommission. We could also do our own
scheme and redirect the purl to a page explaining the history in a
more Tcl-specific manner \(like: Company went out of business, was
acquired, etc.\).

# Specification

This TIP is driven by several conflicting needs:

   * The names will be persistent, so give them some thought before
     creating them; they cannot be undone. This also implies that we
     to set up a simple and minimal structure first so as not to block
     future enhancements and flexibility.

   * Define the structure now before the URN space gets as scattered
     as the URL space is. Note that this process has already
     begun. The PURL resolver at <http://www.purl.org/> currently has
     registered 24 Tcl-related PURLs which are not bound together in
     the framework proposed here. Action is necessary to prevent
     further confusion.

Of the existing PURLs the PURL domain _/tcl_ created by Don Libes is
the most promising one for the unification of the Tcl space. Six of
the 24 aforementioned PURLs are defined below this domain too,
providing a \(good\) framework on which to build.

The existing PURLs and sub-domains in the _/tcl_ domain are:

   * _expect_	- reference to the homepage of the expect extension

   * _faq_	- reference to the main FAQ

   * _faqs_	- introduction to the available FAQ documents.

   * _home/_	- refers to the Tcl Developers Xchange

   * _tip/_	- refers to the TIP archives

   * _wiki_	- refers to the entry page of the Tcl'ers WIKI

With the exception of _expect_ all of these are general classes
and/or refer to important sites. They are used as is, except for
_expect_ which has to be redirected into the proposed sub-domain
_package_.

The following new sub-domains covering the most important general
classes of information and/or websites are proposed here. Please note
that the examples used in the list below are using purely informal
everyday names to refer to entities in the proposed domain. These
examples should not be seen as suggestions for the concrete naming
scheme used by the domain.

   * _announce_ - Direct reference to a page explaining how to
     announce packages, applications and other tcl-related news and
     linking to the relevant newsgroups, mail archives and submission
     addresses. This includes, but is not restricted to:

	   > \* A link to the newsgroup _comp.lang.tcl.announce_.

	   > \* A link to the eGroups/Yahoo archive of the c.l.t.a newsgroup.

	   > \* The submission address of c.l.t.a. to directly submit via email
       announcements.

   * _newsletter_ - Direct reference to an archive of _Tcl-URL!_.

   * _package/_ - Sub-domain to contain references to the homepages
     of the known packages. This TIP makes no distinction between
     C-level extensions and script libraries. From the point of view
     of the core these are all packages to be required.

	   > Examples of packages are _Expect_, _tcllib_, etc.

   * _application/_ - Sub-domain to contain references to the
     homepages of applications related to Tcl, written in Tcl or using
     it internally.

	   > Examples of applications are _frink_, _tclHttpd_,
     _AOLServer_, etc.

   * _person/_ - Sub-domain to contain references to the homepages of
     people active in the community, as far as they are interested in
     such a reference. References in this domain shall be personal and
     not organization-related. The latter will go into their own
     domain.

	   > Examples of people are _Larry Virden_, _Cameron Laird_, etc.

   * _org/_ - Sub-domain to contain references to organizations
     important to the Tcl community.

	   > Examples are the Tcl Core Team, the Tcl Core Maintainers,
     Tcl-based based companies \(PhaseIt, ActiveState\), companies and
     organizations using Tcl \(NIST, CAS\), etc.

# Management

The _/tcl_ domain was created by Don Libes which made him
automatically the maintainer of the domain
<http://www.purl.org/maint/search_user.pl.cgi?userid=^LIBES$> . He has
already extended the maintainership to the entity TCLGROUP
<http://www.purl.org/maint/search_group.pl.cgi?groupid=^TCLGROUP$> ,
currently consisting of

   * Gordon Johnstone <[email protected]>

   * Jeffrey Hobbs <[email protected]>

   * Don Libes <[email protected]>

   * Andreas Kupries <andreas\[email protected]>

   * Larry Virden <[email protected]>

   * Don G. Porter <[email protected]>

   * Jean-Claude Wippler <[email protected]>

For the future I propose that

   * High-level changes to the Tcl space, like new sub-domains, have to
     go through the TCT and the TIP process for approval. This is also
     in line with [[0]](0.md) declaring the responsibility of the TCT for the
     Tcl webspace.

   * The day-to-day routine of adding new packages, persons,
     organizations, etc. is delegated to a new group, the _Tcl
     Namespace Maintainers_.

	   > Initially this group would consist of the people mentioned above,
     with membership open to volunteers from the community.

# Discussion

   * All of the newly proposed sub-domains will be simple listings
     mapping from the names of the entities contained in them to the
     proper locations. Further categorization of the entities by
     topic, gender or other attributes is out of the scope of this
     TIP. This type of categorization rather is in the domain of
     general and specialized catalogs which can be set up later and
     then bound into the unified webspace proposed here.

	   > Note that such catalogs can and should make use of the proposed
     domains to reduce the effort necessary by them to stay current
     with respect to the location of the referenced entities \(people,
     packages, etc\).

   * The first pre-draft of this TIP contained definitions for the
     names to use in the various domains. These definitions have been
     removed on the grounds that their format and other issues like
     resolution of naming conflicts, order of precedence, etc. are
     best handled in one or more separate documents. The role of this
     TIP is to lay down a framework within which the community can
     operate and not to fill in every conceivable detail.

	   > These details can be discussed and decided upon by the group of
     maintainers proposed in the last section.

   * This proposal makes the Tcl community dependent on an external
     entity, namely the maintainers of <http://www.purl.org/.> This is
     considered acceptable.

# Example

The following examples show how to use PURLs, using some of the
already existing ones:

    * <http://www.purl.org/tcl/tip/> refers to the TIP archive.

    * <http://www.purl.org/tcl/wiki/> refers to the Tcl'ers Wiki.

# Copyright

This document is in the public domain.

Name change from tip/55.tip to tip/55.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
TIP:            55
Title:          Package Format for Tcl Extensions
Version:        $Revision: 1.18 $
Author:         Steve Cassidy <[email protected]>
Author:         Larry W. Virden <[email protected]>
State:          Draft
Type:           Informative
Vote:           No voting
Created:        16-Aug-2001
Post-History:   

~ Abstract

This document specifies the contents of a binary distribution of a Tcl
package, especially directory structure and required files, suitable
for automated installation into an existing Tcl installation.

~ Rationale

There is currently no standard way of distributing or installing a Tcl
extension package.  The TEA document defines a standard interface to
''building'' packages and includes an ''install'' target but
presumes that the packages is being installed on the same machine as
it was built. This TIP defines a directory structure and assorted
files for the binary distribution of a package which can be placed
into an archive (for example zip or tar file) and transferred for
installation on another machine.  A basic mechanism for installation of
packages is also described.

~ Definitions

The following definitions are excerpted from [78]:

 package: A collection of files providing additional functionality to
   a user of a Tcl interpreter when loaded into said interpreter.

 > Some files in a package implement the provided functionality
   whereas other files contain metadata required by the package
   management of Tcl to be able to use the package.

 distribution: An encapsulation of one or more ''packages'' for
   transport between places, machines, organizations, and people.

 shared library: A piece of binary code that provides a set of
   operations and data structures like a normal library, but which does
   not need to be physically incorporated into the executables that
   use it until they are actually executed. This is the normal way to
   distribute binary code for a Tcl package such that it can be
   incorporated into a Tcl interpreter with the ''load'' command. On
   Windows, shared libraries are known as DLLs, on the Macintosh ...

~ References

Much of the required structure for an installable distribution is
defined by the requirements of Tcl's existing package loading methods.
The structure of an installable distribution should largely mirror
the structure of an installed package where possible.

The R system (a statistical package [http://www.r-project.org/]) has a
well defined package format which enables automatic installation of new
packages and integration of documentation and demonstration programs
for these with that of the main R system.

A number of packaging and installation systems (for example, Debian
[http://www.debian.org] and RPM [http://www.redhat.com]) have been
developed by the Linux community which provide an interesting range of
facilities.  These systems commonly provide facilities for pre and post
installation scripts and pre and post removal scripts to help set up
and shut down packages.  Also included are detailed dependency
relations between packages which can be used by an installer to ensure
that a package will work once it is installed or warn of potential
conflicts after installation.

A significant part of this proposal is the proposed format of the
package metadata which derives from other metadata standardisation
efforts, mainly the Dublin Core [http://purl.org/dc/] and the Resource
Description Framework [http://www.w3.org/RDF].

~ Requirements

The simplest case of a Tcl package is one that contains only Tcl code;
these will be considered first, and the additional issues raised by
packages containing compiled code will be dealt with later.

The minimum contents of a Tcl only package are defined by the
requirements of [[package require xyzzy]].  The package needs to be
placed in a directory on the ''auto_path'' and must contain one or more
''.tcl'' files which implement the functionality provided by the
package.

In addition to these files, it is useful to include documentation for
the commands implemented by the package and some additional metadata
about the author etc.  Distributions might also optionally include
demonstration scripts and applications illustrating their use, these
could either be incorporated into the documentation or included as
stand-alone Tcl files.

Distributions which include shared libraries add an additional layer of
complexity since these will only run on the platforms for which they
have been compiled.  There are two clear options here: either
distributions are platform specific, intended for installation on one
platform alone, or the structure of the distribution is extended to
allow the option of including multiple shared libraries.  The latter
option would allow a single installation to serve multiple platforms
and so should be preferred although this TIP will not ''require'' a
distribution to support multiple platforms.

~ Proposed Directory Structure

The following directory structure is proposed for an installable
distribution:

|  packagename$version
|      + DESCRIPTION.txt  -- Metadata, description of the package
|      + doc/             -- documentation
|      + examples/        -- example scripts and applications
|      + $architecture/   -- shared library directories
|      + pkgIndex.tcl     -- package index file (optional)

In addition, a distribution may include any additional files or
directories required for its operation.

''DESCRIPTION'' is a file containing metadata about the
package(s) contained in the distribution. Its format will be described
in a later section of this document.

The file ''pkgIndex.tcl'' currently required by the package-loading
mechanism of the Tcl core is ''optionally'' distributed. In most cases,
it will be generated by the installer; all the information which is
necessary to do this is part of the distribution.  Distribution authors
should only include ''pkgIndex.tcl'' if special features of their
distribution mean that the generated file would not work.

If the ''pkgIndex.tcl'' file is included in the distribution it should
load files from their locations within the distribution directory
structure. For example, Tcl files should be loaded from the ''tcl''
directory.

''doc/'' directory contains documentation in an accepted format.
Currently Tcl documentation is delivered either in source form (nroff
or TMML) or as HTML files.  Given the lack of a standard cross platform
solution, this TIP does not require a specific format; however, the
inclusion of either a text or HTML formatted help file is strongly
encouraged.  If HTML formatted help is included the main file should be
named ''index.html'' or ''index.htm'' so that it can be linked to a
central web page.  If only plain text documentation is included there
should be a file called ''readme.txt'' (in either upper or lower case)
which will serve as the top level documentation file.

''examples/'' directory contains one or more Tcl files giving examples
of the use of this package. These should be complete scripts
suitable for either sourcing in tclsh/wish or running from the command
line. The examples should be self contained and any external data
should be included in files in this directory or a sub-directory.  This
directory should contain a file ''readme.txt'' which explains how to
run the examples and provides a commentary on what they do.

''$architecture'' directories contain shared libraries for various
platforms. The special architecture ''tcl'' is used for Tcl script
files. They either implement the package or contain companion procedure
definitions to the shared libraries of the package.

The distribution need not provide all possible combinations of
architectures and may only provide one shared library.  This structure
is proposed to allow shared libraries to co-exist in a multi-platform
environment and to allow binary packages to be distributed in
multi-platform distributions.  The architectures included in the
distribution should be named in the DESCRIPTION.txt file.

The possible values of $architecture and methods for generating them
are discussed in a later section.

~ Metadata

This section defines the metadata describing the package contained in
the distribution in a format-neutral way. The model for this data is
that provided by the Resource Description Framework (RDF
[http://www.w3.org/rdf]) which defines a triple based data model.  The
RDF model defines objects, their properties and relationships between
them.  In addition, where possible, element names are taken from the
Dublin Core Metadata Element Set
[http://dublincore.org/documents/1999/07/02/dces/] which defines a
standard set of element names for metadata. Dublin Core names are
marked with DC in parentheses in the following list.

In a package description, the object being described is the package
itself, hence the element names are all intended to describe
packages. Other objects might be described including people and
organisations. The package description should not include these
objects but a package repository might store them separately keyed on
the values stored in this description (e.g. email addresses of creators).

 * ''Identifier'' (DC)

 > This element is a string containing the name of the distributed
   package. The name may consist only of alphanumeric characters,
   colons, dashes and underscores.  This name should correspond to the
   name of
   the package defined by this distribution (that is, the code should
   contain ''package provide xyzzy'' where ''xyzzy'' is the value of
   this element.

 > Care must be taken to make this name unique among the package names
   in the archive. To overcome this, namespace style names separated by
   double colons should be used.

 > Examples: xyzzy, tcllib, xml::soap, cassidy::wonderful-package_2

 * ''Version''

 > This element is a string containing the version of the
   package. It consists of 4 components separated by full stops. The
   components are ''major version'', ''minor version'', ''maturity''
   and ''level''; and are written in this order.

 > The major and minor version components are integer numbers greater
   than or equal to zero.

 > The component ''maturity'' is restricted to the values a, b.
   The represent the maturity states ''alpha'', ''beta''
   respectively. For a production release, this component can be omitted.

 > The ''level'' component allows a more fine-grained differentiation
   of maturity levels.  When a package has maturity ''production'' the
   ''level'' component is often called the ''patchlevel'' of the package.
   If the ''level'' component is zero, it may be omitted.

 > The period each side of the ''maturity'' component may be omitted.

 > Valid version numbers can be decoded via the following regular
   expression:

|regexp {([0-9]+)\.([0-9]+)\.?([ab])?\.?([0-9]*)} $ver => major minor maturity level

 > Examples: 8.4.0  8.4a1 2.5.b.5

 * ''Title'' (DC)

 > This element is a free form string containing a one sentence
   description of the package contained in the distribution.

 > Example: Installer Tools for Tcl Packages

 * ''Creator'' (DC)

 > This element is a string containing the name of the person,
   organisation or service responsible for the creation of the
   package optionally followed by the email address of the author in
   angle brackets [http://www.faqs.org/rfcs/rfc2822.html]. More detail
   about an author can be provided in a separate object in the RDF
   description and if this is provided the email address should be used
   as the value of the Name field in that object.

 > If there is more than one author this field may appear multiple
   times.

 > Email addresses may be obfuscated to avoid spam harvesters.

 > Example: Steve Cassidy <Steve.Cassidy at mq dot edu dot au>

 * ''Contributor'' (DC)

 > This element is a string analogous to the Creator element which
   contains the name of a contributor to the package.

 * ''Rights'' (DC)

 > Typically, a Rights element will contain a rights management
   statement for the resource, or reference a service providing such
   information.  This will usually be a reference to the license under
   which the package is distributed. This can be a free form string
   naming the license or a URL referring to a document containing the
   text of the license.

 > If the Rights element is absent, no assumptions can be made
   about the status of these and other rights with respect to
   the resource.

 > Examples: BSD, http://www.opensource.org/licenses/artistic-license.html

 * ''URL''

 > This element is a string containing an url referring to a
   document or site at which the information about the package can be
   found. This url is ''not'' the location of the distribution, as this
   might be part of a larger repository separate from the package site.

 > Example: http://www.shlrc.mq.edu.au/~steve/tcl/

 * ''Available'' (DC)

 > This element is the release data of the package in the form YYYY-MM-DD.

 > YYYY is a four-digit integer number greater than zero denoting the
   year the distribution was released.

 > MM is a two-digit integer number greater than zero and less than
   13. It is padded with zero at the front if it less than 10. It
   denotes the month the distribution was released. The number 1
   represents January, 2 represents February; and 12 represents December.

 > DD is a two-digit integer number greater than zero and less than 32.
   It is and padded with zero at the front if less than 10. It denotes
   the day in the month the distribution was released.

 > A valid data string can be obtained with the Tcl command
   [[clock format [clock seconds] -format "%Y-%m-%d"]]

 > Example: 2002-01-23

 > (The DC element is Date but it can be refined to Created,
   Available, Applies)

 * ''Description'' (DC)

 > This element is a free form string briefly describing the package.

 * ''Architecture''

 > This element is a string describing one of the architectures
  included in the distribution. As a distribution is allowed to
  contain the files for several architectures, this element may
  appear multiple times and should correspond to a directory in the
  distribution.

 * ''Require''

 > Names a package that must be installed for this package to operate
   properly. This should have the same format as the ''package
   require'' command, eg. ''?-exact? package ?version?''.

 > Example: http 2.0

 * ''Recommend''

 > Declares a strong, but not absolute dependency on another package.
   In most cases this package should be installed unless the user has
   specific reasons not to install them.

 * ''Suggest''

 > Declares a package which would enhance the functionality of this
   package but which is not a requirement for the basic functionality
   of the package.

 * ''Conflict''

 > Names a package with which can't be installed alongside this
   package. The syntax is the same as for Require.  If a conflicting
   package is present on the system, an installer might offer an option
   of removing it or not installing this package.

 * ''Subject'' (DC)

 > The topic or content of the package expressed as a set of Keywords.
   At some future time, a set of canonical keywords may be established
   by a repository manager.

The following Dublin Core elements were not included in the standard
set above but may be used in a package description if appropriate.

 * ''Publisher''

 > An entity responsible for making the package available.

 * ''Type''

 > The nature or genre of the content of the resource. For a Tcl
   package the value of this element would be Software if the DCMI
   Type Vocabulary
   [http://au.dublincore.org/documents/2000/07/11/dcmi-type-vocabulary/]
   was used.  A more useful set of types might be developed in the
   future for Tcl packages.

 * ''Format''

 > The physical or digital manifestation of the resource.  This might
   be used by archive maintainers to specify the format of a package
   archive, eg. zip, tar etc.

 * ''Source''

 > A Reference to a resource from which the present resource is derived.

 * ''Language''

 > A language of the intellectual content of the resource.  Could be
   used if multi-language packages are available. Should use the two
   letter language code defined by RFC 1766, eg. 'fr' for French, 'en'
   for English.

~ Encoding of the Metadata

The primary means of storing RDF data is using XML but it can be stored
in many other formats.  This TIP prescribes a simple text based
encoding according to the RFC 2822 format which is described in this
section. Data stored in this format can be converted to XML format for
use by other tools, similarly XML formatted descriptions can be
converted into this text format without loss of information.

The text format description is stored in the file ''DESCRIPTION.txt''.
The XML formatted version of the data may be stored in the file
''DESCRIPTION.rdf'' within the archive and may be automatically
generated if not present.

The general format of this file is that of a RFC 2822 mail message,
without body and using custom headers. The available headers are the
case-independent logical names from the preceding section but may be
augmented by other fields defined by repository maintainers or other
applications. The headers are allowed appear in any order.

Example:

|  Identifier: stemmer
|  Version: 1.0.0
|  Title: A stemmer for English.
|  Creator: Steve Cassidy <[email protected]>
|  Description:   Provides a procedure to remove any prefixes or suffixes on
|         a word to give the word stem. Uses Porter's algorithm to do this
|         in an intelligent manner with an accuracy of around 80%.
|  Rights: BSD
|  URL: http://www.shlrc.mq.edu.au/emu/tcl/
|  Available: 2001-08-16
|  Architecture: tcl
|  Subject: linguistics
|  Subject: text

~ Combination Distributions

It is often useful to combine a number of related packages so that they
can be installed together to provide a certain kind of functionality,
for example, web page production tools or database access.  Perl uses
the term ''Bundle'' to refer to such a group of related packages.
There are two alternative mechanisms for distribution of such a package
within the mechanisms suggested here.
Firstly, since a distribution may contain more than one package, the
set of files making up the various packages could be combined together
and described by a single DESCRIPTION.txt file.  This is similar to the
way that tcllib is currently distributed.  The disadvantage would be
that all of the Tcl files implementing these packages would have to
reside in the same directory which could cause name clashes.

The second alternative is to create a distribution consisting of only a
DESCRIPTION.txt file to describe which Requires the component packages
causing them to be installed from the repository. For example, tcllib
might be described as follows:

|  Identifier: tcllib
|  Version: 1.0.0
|  Title: The Standard Tcl Library
|  Description:  This package is intended to be a collection of Tcl
|             packages that provide utility functions useful to a large
|             collection of Tcl programmers.
|  Rights: BSD
|  URL: http://sourceforge.net/projects/tcllib
|  Contributor: Andreas Kupries  <andreas_kupries at users dot sourceforge dot net>
|  Contributor: Don Porter <dgp at users dot sourceforge dot net>
|  Require: base64
|  Require: cmdline
|  Require: csv
|  ...

Installing tcllib would cause the installer to fetch base64, cmdline,
csv etc from the repository and install them in order to satisfy the
tcllib requirement.  A new pkgIndex.tcl file could be constructed to
load all of these packages if ''[[package require tcllib]]'' was called.

~ Architecture

Possible values for $architecture in the directory structure include:

 * the value of
   ''tcl_platform(platform)'': windows, unix, macintosh

 * a composite of tcl_platform
   values: ''$tcl_platform(machine)-$tcl_platform(os)-$tcl_platform(osVersion)''

 * a canonical system name as returned by
   ''config.guess'': ''i686-pc-linux-gnu''

~ Installing Packages

A package structured according to this TIP can be installed using the following
steps:

  1. Download the package archive (eg. zip file)

  2. Locate a writable directory included on $auto_path (or ask for a
     installation directory)

  3. Unpack the archive in the desired location.

  4. Run pkg_mkIndex with appropriate arguments to generate a
     pkgIndex.tcl file if none is present. Arguments will include the
     appropriate Architecture directories for the platform.

  5. ''(optional)'' link help files and demos to the central index.

~ Alternatives

Alternatives might be considered for the package DESCRIPTION.txt file, for
the documentation directory and for the location of shared libraries.

An alternative for package description file is to include an
alternative package description, for example the XML based ``ppd''
format used to describe Perl packages on the ActiveState Perl package
repository. The main motivation for the simple format proposed is that
it is trivial for authors to write and trivial for programs to read and
can be transformed into standards based RDF XML.  The use of the DC
element names means that search engines etc. will be able to usefully
index the packages in a repository.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|
|

|
|

|

|
|
|

|

|

|
|
|
|
|
|

|
|

|
|

|

|

|

|
|
|

|

|

|

|

|
|

|

|
|

|

|

|

|

|
|

|

|

|

|

|
|

|

|
|

|
|
|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|
|

|

|

|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|

|

|

|

|
|

|

|

|

|
|

|

|

|

|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511

# TIP 55: Package Format for Tcl Extensions

	Author:         Steve Cassidy <[email protected]>
	Author:         Larry W. Virden <[email protected]>
	State:          Draft
	Type:           Informative
	Vote:           No voting
	Created:        16-Aug-2001
	Post-History:   
-----

# Abstract

This document specifies the contents of a binary distribution of a Tcl
package, especially directory structure and required files, suitable
for automated installation into an existing Tcl installation.

# Rationale

There is currently no standard way of distributing or installing a Tcl
extension package.  The TEA document defines a standard interface to
_building_ packages and includes an _install_ target but
presumes that the packages is being installed on the same machine as
it was built. This TIP defines a directory structure and assorted
files for the binary distribution of a package which can be placed
into an archive \(for example zip or tar file\) and transferred for
installation on another machine.  A basic mechanism for installation of
packages is also described.

# Definitions

The following definitions are excerpted from [[78]](78.md):

 package: A collection of files providing additional functionality to
   a user of a Tcl interpreter when loaded into said interpreter.

 > Some files in a package implement the provided functionality
   whereas other files contain metadata required by the package
   management of Tcl to be able to use the package.

 distribution: An encapsulation of one or more _packages_ for
   transport between places, machines, organizations, and people.

 shared library: A piece of binary code that provides a set of
   operations and data structures like a normal library, but which does
   not need to be physically incorporated into the executables that
   use it until they are actually executed. This is the normal way to
   distribute binary code for a Tcl package such that it can be
   incorporated into a Tcl interpreter with the _load_ command. On
   Windows, shared libraries are known as DLLs, on the Macintosh ...

# References

Much of the required structure for an installable distribution is
defined by the requirements of Tcl's existing package loading methods.
The structure of an installable distribution should largely mirror
the structure of an installed package where possible.

The R system \(a statistical package <http://www.r-project.org/> \) has a
well defined package format which enables automatic installation of new
packages and integration of documentation and demonstration programs
for these with that of the main R system.

A number of packaging and installation systems \(for example, Debian
<http://www.debian.org>  and RPM <http://www.redhat.com> \) have been
developed by the Linux community which provide an interesting range of
facilities.  These systems commonly provide facilities for pre and post
installation scripts and pre and post removal scripts to help set up
and shut down packages.  Also included are detailed dependency
relations between packages which can be used by an installer to ensure
that a package will work once it is installed or warn of potential
conflicts after installation.

A significant part of this proposal is the proposed format of the
package metadata which derives from other metadata standardisation
efforts, mainly the Dublin Core <http://purl.org/dc/>  and the Resource
Description Framework <http://www.w3.org/RDF> .

# Requirements

The simplest case of a Tcl package is one that contains only Tcl code;
these will be considered first, and the additional issues raised by
packages containing compiled code will be dealt with later.

The minimum contents of a Tcl only package are defined by the
requirements of [package require xyzzy].  The package needs to be
placed in a directory on the _auto\_path_ and must contain one or more
_.tcl_ files which implement the functionality provided by the
package.

In addition to these files, it is useful to include documentation for
the commands implemented by the package and some additional metadata
about the author etc.  Distributions might also optionally include
demonstration scripts and applications illustrating their use, these
could either be incorporated into the documentation or included as
stand-alone Tcl files.

Distributions which include shared libraries add an additional layer of
complexity since these will only run on the platforms for which they
have been compiled.  There are two clear options here: either
distributions are platform specific, intended for installation on one
platform alone, or the structure of the distribution is extended to
allow the option of including multiple shared libraries.  The latter
option would allow a single installation to serve multiple platforms
and so should be preferred although this TIP will not _require_ a
distribution to support multiple platforms.

# Proposed Directory Structure

The following directory structure is proposed for an installable
distribution:

	  packagename$version
	      + DESCRIPTION.txt  -- Metadata, description of the package
	      + doc/             -- documentation
	      + examples/        -- example scripts and applications
	      + $architecture/   -- shared library directories
	      + pkgIndex.tcl     -- package index file (optional)

In addition, a distribution may include any additional files or
directories required for its operation.

_DESCRIPTION_ is a file containing metadata about the
package\(s\) contained in the distribution. Its format will be described
in a later section of this document.

The file _pkgIndex.tcl_ currently required by the package-loading
mechanism of the Tcl core is _optionally_ distributed. In most cases,
it will be generated by the installer; all the information which is
necessary to do this is part of the distribution.  Distribution authors
should only include _pkgIndex.tcl_ if special features of their
distribution mean that the generated file would not work.

If the _pkgIndex.tcl_ file is included in the distribution it should
load files from their locations within the distribution directory
structure. For example, Tcl files should be loaded from the _tcl_
directory.

_doc/_ directory contains documentation in an accepted format.
Currently Tcl documentation is delivered either in source form \(nroff
or TMML\) or as HTML files.  Given the lack of a standard cross platform
solution, this TIP does not require a specific format; however, the
inclusion of either a text or HTML formatted help file is strongly
encouraged.  If HTML formatted help is included the main file should be
named _index.html_ or _index.htm_ so that it can be linked to a
central web page.  If only plain text documentation is included there
should be a file called _readme.txt_ \(in either upper or lower case\)
which will serve as the top level documentation file.

_examples/_ directory contains one or more Tcl files giving examples
of the use of this package. These should be complete scripts
suitable for either sourcing in tclsh/wish or running from the command
line. The examples should be self contained and any external data
should be included in files in this directory or a sub-directory.  This
directory should contain a file _readme.txt_ which explains how to
run the examples and provides a commentary on what they do.

_$architecture_ directories contain shared libraries for various
platforms. The special architecture _tcl_ is used for Tcl script
files. They either implement the package or contain companion procedure
definitions to the shared libraries of the package.

The distribution need not provide all possible combinations of
architectures and may only provide one shared library.  This structure
is proposed to allow shared libraries to co-exist in a multi-platform
environment and to allow binary packages to be distributed in
multi-platform distributions.  The architectures included in the
distribution should be named in the DESCRIPTION.txt file.

The possible values of $architecture and methods for generating them
are discussed in a later section.

# Metadata

This section defines the metadata describing the package contained in
the distribution in a format-neutral way. The model for this data is
that provided by the Resource Description Framework \(RDF
<http://www.w3.org/rdf> \) which defines a triple based data model.  The
RDF model defines objects, their properties and relationships between
them.  In addition, where possible, element names are taken from the
Dublin Core Metadata Element Set
<http://dublincore.org/documents/1999/07/02/dces/>  which defines a
standard set of element names for metadata. Dublin Core names are
marked with DC in parentheses in the following list.

In a package description, the object being described is the package
itself, hence the element names are all intended to describe
packages. Other objects might be described including people and
organisations. The package description should not include these
objects but a package repository might store them separately keyed on
the values stored in this description \(e.g. email addresses of creators\).

 * _Identifier_ \(DC\)

	 > This element is a string containing the name of the distributed
   package. The name may consist only of alphanumeric characters,
   colons, dashes and underscores.  This name should correspond to the
   name of
   the package defined by this distribution \(that is, the code should
   contain _package provide xyzzy_ where _xyzzy_ is the value of
   this element.

	 > Care must be taken to make this name unique among the package names
   in the archive. To overcome this, namespace style names separated by
   double colons should be used.

	 > Examples: xyzzy, tcllib, xml::soap, cassidy::wonderful-package\_2

 * _Version_

	 > This element is a string containing the version of the
   package. It consists of 4 components separated by full stops. The
   components are _major version_, _minor version_, _maturity_
   and _level_; and are written in this order.

	 > The major and minor version components are integer numbers greater
   than or equal to zero.

	 > The component _maturity_ is restricted to the values a, b.
   The represent the maturity states _alpha_, _beta_
   respectively. For a production release, this component can be omitted.

	 > The _level_ component allows a more fine-grained differentiation
   of maturity levels.  When a package has maturity _production_ the
   _level_ component is often called the _patchlevel_ of the package.
   If the _level_ component is zero, it may be omitted.

	 > The period each side of the _maturity_ component may be omitted.

	 > Valid version numbers can be decoded via the following regular
   expression:

		regexp {([0-9]+)\.([0-9]+)\.?([ab])?\.?([0-9]*)} $ver => major minor maturity level

	 > Examples: 8.4.0  8.4a1 2.5.b.5

 * _Title_ \(DC\)

	 > This element is a free form string containing a one sentence
   description of the package contained in the distribution.

	 > Example: Installer Tools for Tcl Packages

 * _Creator_ \(DC\)

	 > This element is a string containing the name of the person,
   organisation or service responsible for the creation of the
   package optionally followed by the email address of the author in
   angle brackets <http://www.faqs.org/rfcs/rfc2822.html> . More detail
   about an author can be provided in a separate object in the RDF
   description and if this is provided the email address should be used
   as the value of the Name field in that object.

	 > If there is more than one author this field may appear multiple
   times.

	 > Email addresses may be obfuscated to avoid spam harvesters.

	 > Example: Steve Cassidy <Steve.Cassidy at mq dot edu dot au>

 * _Contributor_ \(DC\)

	 > This element is a string analogous to the Creator element which
   contains the name of a contributor to the package.

 * _Rights_ \(DC\)

	 > Typically, a Rights element will contain a rights management
   statement for the resource, or reference a service providing such
   information.  This will usually be a reference to the license under
   which the package is distributed. This can be a free form string
   naming the license or a URL referring to a document containing the
   text of the license.

	 > If the Rights element is absent, no assumptions can be made
   about the status of these and other rights with respect to
   the resource.

	 > Examples: BSD, <http://www.opensource.org/licenses/artistic-license.html>

 * _URL_

	 > This element is a string containing an url referring to a
   document or site at which the information about the package can be
   found. This url is _not_ the location of the distribution, as this
   might be part of a larger repository separate from the package site.

	 > Example: <http://www.shlrc.mq.edu.au/~steve/tcl/>

 * _Available_ \(DC\)

	 > This element is the release data of the package in the form YYYY-MM-DD.

	 > YYYY is a four-digit integer number greater than zero denoting the
   year the distribution was released.

	 > MM is a two-digit integer number greater than zero and less than
   13. It is padded with zero at the front if it less than 10. It
   denotes the month the distribution was released. The number 1
   represents January, 2 represents February; and 12 represents December.

	 > DD is a two-digit integer number greater than zero and less than 32.
   It is and padded with zero at the front if less than 10. It denotes
   the day in the month the distribution was released.

	 > A valid data string can be obtained with the Tcl command
   [clock format [clock seconds] -format "%Y-%m-%d"]

	 > Example: 2002-01-23

	 > \(The DC element is Date but it can be refined to Created,
   Available, Applies\)

 * _Description_ \(DC\)

	 > This element is a free form string briefly describing the package.

 * _Architecture_

	 > This element is a string describing one of the architectures
  included in the distribution. As a distribution is allowed to
  contain the files for several architectures, this element may
  appear multiple times and should correspond to a directory in the
  distribution.

 * _Require_

	 > Names a package that must be installed for this package to operate
   properly. This should have the same format as the _package
   require_ command, eg. _?-exact? package ?version?_.

	 > Example: http 2.0

 * _Recommend_

	 > Declares a strong, but not absolute dependency on another package.
   In most cases this package should be installed unless the user has
   specific reasons not to install them.

 * _Suggest_

	 > Declares a package which would enhance the functionality of this
   package but which is not a requirement for the basic functionality
   of the package.

 * _Conflict_

	 > Names a package with which can't be installed alongside this
   package. The syntax is the same as for Require.  If a conflicting
   package is present on the system, an installer might offer an option
   of removing it or not installing this package.

 * _Subject_ \(DC\)

	 > The topic or content of the package expressed as a set of Keywords.
   At some future time, a set of canonical keywords may be established
   by a repository manager.

The following Dublin Core elements were not included in the standard
set above but may be used in a package description if appropriate.

 * _Publisher_

	 > An entity responsible for making the package available.

 * _Type_

	 > The nature or genre of the content of the resource. For a Tcl
   package the value of this element would be Software if the DCMI
   Type Vocabulary
   <http://au.dublincore.org/documents/2000/07/11/dcmi-type-vocabulary/> 
   was used.  A more useful set of types might be developed in the
   future for Tcl packages.

 * _Format_

	 > The physical or digital manifestation of the resource.  This might
   be used by archive maintainers to specify the format of a package
   archive, eg. zip, tar etc.

 * _Source_

	 > A Reference to a resource from which the present resource is derived.

 * _Language_

	 > A language of the intellectual content of the resource.  Could be
   used if multi-language packages are available. Should use the two
   letter language code defined by RFC 1766, eg. 'fr' for French, 'en'
   for English.

# Encoding of the Metadata

The primary means of storing RDF data is using XML but it can be stored
in many other formats.  This TIP prescribes a simple text based
encoding according to the RFC 2822 format which is described in this
section. Data stored in this format can be converted to XML format for
use by other tools, similarly XML formatted descriptions can be
converted into this text format without loss of information.

The text format description is stored in the file _DESCRIPTION.txt_.
The XML formatted version of the data may be stored in the file
_DESCRIPTION.rdf_ within the archive and may be automatically
generated if not present.

The general format of this file is that of a RFC 2822 mail message,
without body and using custom headers. The available headers are the
case-independent logical names from the preceding section but may be
augmented by other fields defined by repository maintainers or other
applications. The headers are allowed appear in any order.

Example:

	  Identifier: stemmer
	  Version: 1.0.0
	  Title: A stemmer for English.
	  Creator: Steve Cassidy <[email protected]>
	  Description:   Provides a procedure to remove any prefixes or suffixes on
	         a word to give the word stem. Uses Porter's algorithm to do this
	         in an intelligent manner with an accuracy of around 80%.
	  Rights: BSD
	  URL: http://www.shlrc.mq.edu.au/emu/tcl/
	  Available: 2001-08-16
	  Architecture: tcl
	  Subject: linguistics
	  Subject: text

# Combination Distributions

It is often useful to combine a number of related packages so that they
can be installed together to provide a certain kind of functionality,
for example, web page production tools or database access.  Perl uses
the term _Bundle_ to refer to such a group of related packages.
There are two alternative mechanisms for distribution of such a package
within the mechanisms suggested here.
Firstly, since a distribution may contain more than one package, the
set of files making up the various packages could be combined together
and described by a single DESCRIPTION.txt file.  This is similar to the
way that tcllib is currently distributed.  The disadvantage would be
that all of the Tcl files implementing these packages would have to
reside in the same directory which could cause name clashes.

The second alternative is to create a distribution consisting of only a
DESCRIPTION.txt file to describe which Requires the component packages
causing them to be installed from the repository. For example, tcllib
might be described as follows:

	  Identifier: tcllib
	  Version: 1.0.0
	  Title: The Standard Tcl Library
	  Description:  This package is intended to be a collection of Tcl
	             packages that provide utility functions useful to a large
	             collection of Tcl programmers.
	  Rights: BSD
	  URL: http://sourceforge.net/projects/tcllib
	  Contributor: Andreas Kupries  <andreas_kupries at users dot sourceforge dot net>
	  Contributor: Don Porter <dgp at users dot sourceforge dot net>
	  Require: base64
	  Require: cmdline
	  Require: csv
	  ...

Installing tcllib would cause the installer to fetch base64, cmdline,
csv etc from the repository and install them in order to satisfy the
tcllib requirement.  A new pkgIndex.tcl file could be constructed to
load all of these packages if _[package require tcllib]_ was called.

# Architecture

Possible values for $architecture in the directory structure include:

 * the value of
   _tcl\_platform\(platform\)_: windows, unix, macintosh

 * a composite of tcl\_platform
   values: _$tcl\_platform\(machine\)-$tcl\_platform\(os\)-$tcl\_platform\(osVersion\)_

 * a canonical system name as returned by
   _config.guess_: _i686-pc-linux-gnu_

# Installing Packages

A package structured according to this TIP can be installed using the following
steps:

  1. Download the package archive \(eg. zip file\)

  2. Locate a writable directory included on $auto\_path \(or ask for a
     installation directory\)

  3. Unpack the archive in the desired location.

  4. Run pkg\_mkIndex with appropriate arguments to generate a
     pkgIndex.tcl file if none is present. Arguments will include the
     appropriate Architecture directories for the platform.

  5. _\(optional\)_ link help files and demos to the central index.

# Alternatives

Alternatives might be considered for the package DESCRIPTION.txt file, for
the documentation directory and for the location of shared libraries.

An alternative for package description file is to include an
alternative package description, for example the XML based \`\`ppd_
format used to describe Perl packages on the ActiveState Perl package
repository. The main motivation for the simple format proposed is that
it is trivial for authors to write and trivial for programs to read and
can be transformed into standards based RDF XML.  The use of the DC
element names means that search engines etc. will be able to usefully
index the packages in a repository.

︙ ︙ 
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557

The alternative to having shared libraries in specific directories is
to have separate packages for each new platform. This has the
advantage of making the packages smaller and more closely correspond
to the existing directory structure of an installed package.  The
main motivation for the suggested directory structure is to allow
multi-platform packages or to facilitate multi-platform installations.

~ Supporting Tools

The standards outlined in this TIP should be supported by Tcl scripts
to:

 * Generate empty package templates for new projects.

 * Validate package directories or archive files.

 * Read and write the DESCRIPTION.txt file and provide a standard
   interface to the information it contains. Convert between RFC 2822
   and XML formats.

 * Install a package from an appropriately structured archive.

In addition, the TEA standard should be extended with a ''package''
makefile target which will act like the current ''install'' target but
which will copy files to a local directory and optionally build an
archive of the package for distribution.

~ Copyright

This document has been placed in the public domain.

|

|
|

|

>
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
The alternative to having shared libraries in specific directories is
to have separate packages for each new platform. This has the
advantage of making the packages smaller and more closely correspond
to the existing directory structure of an installed package.  The
main motivation for the suggested directory structure is to allow
multi-platform packages or to facilitate multi-platform installations.

# Supporting Tools

The standards outlined in this TIP should be supported by Tcl scripts
to:

 * Generate empty package templates for new projects.

 * Validate package directories or archive files.

 * Read and write the DESCRIPTION.txt file and provide a standard
   interface to the information it contains. Convert between RFC 2822
   and XML formats.

 * Install a package from an appropriately structured archive.

In addition, the TEA standard should be extended with a _package_
makefile target which will act like the current _install_ target but
which will copy files to a local directory and optionally build an
archive of the package for distribution.

# Copyright

This document has been placed in the public domain.

Name change from tip/56.tip to tip/56.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53

TIP:		56
Title:		Standardize Call Interface to Tcl_Eval* Functions
State:		Final
Type:		Project
Tcl-Version:	8.4
Vote:		Done
Post-History:	
Version:	$Revision: 1.4 $
Author:		Miguel Sofer <[email protected]>
Created:	28-Aug-2001

~ Abstract

This TIP replaces ''Tcl_EvalTokens'' with ''Tcl_EvalTokensStandard'',
which obeys the standard result management conventions for script
evaluation functions.

~ Rationale

The standard call interface for ''Tcl_Eval*'' functions returns a Tcl
completion code (TCL_OK, TCL_ERROR, TCL_RETURN, TCL_BREAK, or
TCL_CONTINUE), and sets a result object in the interpreter.  The
single exception is the function ''Tcl_EvalTokens'', that returns a
pointer to the result object, or a NULL when an exception occurs.
This effectively transforms all exceptions into errors.  This TIP
proposes to replace ''Tcl_EvalTokens'' with a new function
''Tcl_EvalTokensStandard'' that performs the same chores but adheres
to the standard call interface.

There are two arguments for the replacement of ''Tcl_EvalTokens'':

   * Present a consistent call interface to all ''Tcl_Eval*''
     functions.

   * Allow the return of non-error exceptional returns when evaluating
     tokens; the impossibility to do this is the cause of Bugs #455151
(https://sourceforge.net/tracker/index.php?func=detail&aid=455151&group_id=10894&atid=110894)
     and #219384
(https://sourceforge.net/tracker/index.php?func=detail&aid=219384&group_id=10894&atid=110894)

~ Proposed Change

The proposal is to deprecate the use of ''Tcl_EvalTokens'' and replace
it with a new ''Tcl_EvalTokensStandard''. The core should only use the
new function, the old one remains only for backward compatibility with
extensions.

The proposal is implemented in the patch included in [[Bug: 455151]]
https://sourceforge.net/tracker/index.php?func=detail&aid=455151&group_id=10894&atid=110894

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
>

|

|

|

|
|
|
|

|
|

|

|

|
|
|
|

|

|
|

|
|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53

# TIP 56: Standardize Call Interface to Tcl_Eval* Functions
	State:		Final
	Type:		Project
	Tcl-Version:	8.4
	Vote:		Done
	Post-History:	

	Author:		Miguel Sofer <[email protected]>
	Created:	28-Aug-2001
-----

# Abstract

This TIP replaces _Tcl\_EvalTokens_ with _Tcl\_EvalTokensStandard_,
which obeys the standard result management conventions for script
evaluation functions.

# Rationale

The standard call interface for _Tcl\_Eval\*_ functions returns a Tcl
completion code \(TCL\_OK, TCL\_ERROR, TCL\_RETURN, TCL\_BREAK, or
TCL\_CONTINUE\), and sets a result object in the interpreter.  The
single exception is the function _Tcl\_EvalTokens_, that returns a
pointer to the result object, or a NULL when an exception occurs.
This effectively transforms all exceptions into errors.  This TIP
proposes to replace _Tcl\_EvalTokens_ with a new function
_Tcl\_EvalTokensStandard_ that performs the same chores but adheres
to the standard call interface.

There are two arguments for the replacement of _Tcl\_EvalTokens_:

   * Present a consistent call interface to all _Tcl\_Eval\*_
     functions.

   * Allow the return of non-error exceptional returns when evaluating
     tokens; the impossibility to do this is the cause of Bugs \#455151
\(<https://sourceforge.net/tracker/index.php?func=detail&aid=455151&group\_id=10894&atid=110894\)>
     and \#219384
\(<https://sourceforge.net/tracker/index.php?func=detail&aid=219384&group\_id=10894&atid=110894\)>

# Proposed Change

The proposal is to deprecate the use of _Tcl\_EvalTokens_ and replace
it with a new _Tcl\_EvalTokensStandard_. The core should only use the
new function, the old one remains only for backward compatibility with
extensions.

The proposal is implemented in the patch included in [Bug: 455151]
<https://sourceforge.net/tracker/index.php?func=detail&aid=455151&group\_id=10894&atid=110894>

# Copyright

This document has been placed in the public domain.

Name change from tip/57.tip to tip/57.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88

TIP:            57
Title:          Move TclX's [lassign] into the Tcl Core
Version:        $Revision: 2.4 $
Author:         Donal K. Fellows <[email protected]>
Author:         Agnar Renolen <[email protected]>
Author:         Don Porter <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        30-Aug-2001
Post-History:   
Tcl-Version:    8.5

~ Abstract

This TIP proposes to move the ''lassign'' command from the TclX
extension into the Tcl core to make multiple assignment a much easier
process for people.

~Rationale

In many cases, a command needs to return more than one return value to
the caller.  For example, suppose that the statement:

| set coords [LocateFeature $featureID]

would set the variable "coords" to a list containing two elements "x"
and "y".  Assume that you need to set the "x" and "y" components
directly, you can do this today using the following statement:

| foreach {x y} [LocateFeature $featureID] {}

Now, this is not what the ''foreach'' command was designed for, and it
is not obvious at first glance from the source code what the statement
does.  Although it is quite useful for the purpose described in this
TIP, It would be more logical if the developer could write the
following:

| set {x y} [LocateFeature $featureID]

or

| mset {x y} [LocateFeature $featureID]

However, there is already a command in TclX for doing this kind of
operation: [lassign].  Given that many people already know TclX,
importing the command from there makes a great deal of sense.  It also
has the nice feature of returning those list items that were not
assigned, making it easy to strip a few words off the front of a list.
That sort of operation is useful when performing tasks like
command-line option parsing.

~Proposal

Define a new command in Tcl called [lassign] with the following syntax
(''$val'' indicates an argument that the caller would supply):

| lassign $listValue $varName ?$varName ...?

The command interprets its first argument as a list value and all
subsequent arguments as variable names.  The first item in the list
value (i.e. at index 0) will be assigned to the first variable named,
the second item in the list value will be assigned to the second
variable named (if present), etc.  When there are more variables than
list items, the remaining variables will be assigned the empty string.
The result of the command is a sublist of the input list-value that
contains only items that were not assigned to a variable; if all
values were assigned, the result is an empty list.

This is exactly the specification of the behaviour of the
correspondingly-named command in TclX.

~Notes

It should be possible to efficiently compile [lassign] in many cases,
which would make a tremendous difference in execution speed over not
only the TclX version of [lassign], but also over the [foreach]
"idiom", especially when assigning to variables that are not simple
local variables (the case which [foreach] compilation is optimized
for.)  For this reason, I'm not committing to implementing [lassign]
using the TclX code.

This TIP was substantially different in the past.  Please view the CVS
history for details.

~ Copyright

This document is placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88

# TIP 57: Move TclX's [lassign] into the Tcl Core

	Author:         Donal K. Fellows <[email protected]>
	Author:         Agnar Renolen <[email protected]>
	Author:         Don Porter <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        30-Aug-2001
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

This TIP proposes to move the _lassign_ command from the TclX
extension into the Tcl core to make multiple assignment a much easier
process for people.

# Rationale

In many cases, a command needs to return more than one return value to
the caller.  For example, suppose that the statement:

	 set coords [LocateFeature $featureID]

would set the variable "coords" to a list containing two elements "x"
and "y".  Assume that you need to set the "x" and "y" components
directly, you can do this today using the following statement:

	 foreach {x y} [LocateFeature $featureID] {}

Now, this is not what the _foreach_ command was designed for, and it
is not obvious at first glance from the source code what the statement
does.  Although it is quite useful for the purpose described in this
TIP, It would be more logical if the developer could write the
following:

	 set {x y} [LocateFeature $featureID]

or

	 mset {x y} [LocateFeature $featureID]

However, there is already a command in TclX for doing this kind of
operation: [lassign].  Given that many people already know TclX,
importing the command from there makes a great deal of sense.  It also
has the nice feature of returning those list items that were not
assigned, making it easy to strip a few words off the front of a list.
That sort of operation is useful when performing tasks like
command-line option parsing.

# Proposal

Define a new command in Tcl called [lassign] with the following syntax
\(_$val_ indicates an argument that the caller would supply\):

	 lassign $listValue $varName ?$varName ...?

The command interprets its first argument as a list value and all
subsequent arguments as variable names.  The first item in the list
value \(i.e. at index 0\) will be assigned to the first variable named,
the second item in the list value will be assigned to the second
variable named \(if present\), etc.  When there are more variables than
list items, the remaining variables will be assigned the empty string.
The result of the command is a sublist of the input list-value that
contains only items that were not assigned to a variable; if all
values were assigned, the result is an empty list.

This is exactly the specification of the behaviour of the
correspondingly-named command in TclX.

# Notes

It should be possible to efficiently compile [lassign] in many cases,
which would make a tremendous difference in execution speed over not
only the TclX version of [lassign], but also over the [foreach]
"idiom", especially when assigning to variables that are not simple
local variables \(the case which [foreach] compilation is optimized
for.\)  For this reason, I'm not committing to implementing [lassign]
using the TclX code.

This TIP was substantially different in the past.  Please view the CVS
history for details.

# Copyright

This document is placed in the public domain.

Name change from tip/58.tip to tip/58.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90

TIP:            58
Title:          Extend [set] to Assign Multiple Values to Multiple Variables
Version:        $Revision: 1.6 $
Author:         Anselm Lingnau <[email protected]>
State:          Rejected
Type:           Project
Vote:           Done
Created:        02-Sep-2001
Post-History:   
Tcl-Version:    8.5

~ Abstract

This TIP proposes a multiple assignment command as a
backwards-compatible extension to the Tcl ''set'' command.

~ Introduction

Often one needs to assign values to several variables in close
proximity.  Right now several ''set'' commands are necessary:

|  set a 123
|  set b 456

or

|  set a 123; set b 456

Or one abuses the ''foreach'' command:

|  foreach {a b} {123 456} break

However, by analogy to the ''variable'' and ''array set'' commands,
the following would be useful:

|  set a 123 b 456

This would assign 123 to the variable ''a'' and 456 to the variable
''b''.

Note that this extension is backwards-compatible to existing uses of
the ''set'' command since until now only one or two arguments to
''set'' were allowed.

~ Specification

The ''set'' command is extended to allow either one or an even number
of arguments.  The behaviour in the case of one argument remains the
one documented in the ''set'' manual page; when an even number of
arguments is specified, the behaviour of ''set v0 e0 ... vn en'' is
identical to that of the sequence of commands ''set v0 e0; ...; set vn
en'' according to the traditional semantics, except that the way
Tcl processes commands means that ''e0'' ... ''en'' are all
evaluated before any assignments are performed. I.e., the commands

|  set a 1
|  set a 2 b $a
|  puts $b

print ''1'', not ''2''. If this is an issue you must use separate
''set'' statements.

The command ''set v0 e0 ... vn en'' returns the value of ''en''.

~ Rationale

This extension is an obvious analogy to the ''variable'' and ''array
set'' commands of Tcl, both of which allow an alternating list of
names and expressions to be given as arguments.  It is completely
backwards-compatible (''set'' invocations with more than two arguments
used to be syntax errors) and very easily implemented.

This extension in no way prejudices against the adoption and use of
other multiple-assignment commands, such as ''lassign'' (see [57]).
In particular, the ''set'' extension is unsuitable for assigning a
list result to a number of variables element by element.  However, its
simplicity and consistency to other similar Tcl commands is appealing.

~ Reference Implementation

A patch to Tcl 8.4a3 which implements the ''set'' extension may be
found at http://anselm.our-isp.org/set-patch.diff - a patched Tcl
8.4a3 passes the Tcl 8.4a3 regression test suite with no test
failures.  No test cases nor documentation for the ''set'' extensions
have been devised yet but this is easy to do once there is a consensus
that this feature is actually desirable.

~ Copyright

This document is placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|

|

|

|

|

|
|

|
|

|

|

|
|
|
|
|

|
|
|

|
|

|

|

|
|

|
|

|
|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90

# TIP 58: Extend [set] to Assign Multiple Values to Multiple Variables

	Author:         Anselm Lingnau <[email protected]>
	State:          Rejected
	Type:           Project
	Vote:           Done
	Created:        02-Sep-2001
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

This TIP proposes a multiple assignment command as a
backwards-compatible extension to the Tcl _set_ command.

# Introduction

Often one needs to assign values to several variables in close
proximity.  Right now several _set_ commands are necessary:

	  set a 123
	  set b 456

or

	  set a 123; set b 456

Or one abuses the _foreach_ command:

	  foreach {a b} {123 456} break

However, by analogy to the _variable_ and _array set_ commands,
the following would be useful:

	  set a 123 b 456

This would assign 123 to the variable _a_ and 456 to the variable
_b_.

Note that this extension is backwards-compatible to existing uses of
the _set_ command since until now only one or two arguments to
_set_ were allowed.

# Specification

The _set_ command is extended to allow either one or an even number
of arguments.  The behaviour in the case of one argument remains the
one documented in the _set_ manual page; when an even number of
arguments is specified, the behaviour of _set v0 e0 ... vn en_ is
identical to that of the sequence of commands _set v0 e0; ...; set vn
en_ according to the traditional semantics, except that the way
Tcl processes commands means that _e0_ ... _en_ are all
evaluated before any assignments are performed. I.e., the commands

	  set a 1
	  set a 2 b $a
	  puts $b

print _1_, not _2_. If this is an issue you must use separate
_set_ statements.

The command _set v0 e0 ... vn en_ returns the value of _en_.

# Rationale

This extension is an obvious analogy to the _variable_ and _array
set_ commands of Tcl, both of which allow an alternating list of
names and expressions to be given as arguments.  It is completely
backwards-compatible \(_set_ invocations with more than two arguments
used to be syntax errors\) and very easily implemented.

This extension in no way prejudices against the adoption and use of
other multiple-assignment commands, such as _lassign_ \(see [[57]](57.md)\).
In particular, the _set_ extension is unsuitable for assigning a
list result to a number of variables element by element.  However, its
simplicity and consistency to other similar Tcl commands is appealing.

# Reference Implementation

A patch to Tcl 8.4a3 which implements the _set_ extension may be
found at <http://anselm.our-isp.org/set-patch.diff> - a patched Tcl
8.4a3 passes the Tcl 8.4a3 regression test suite with no test
failures.  No test cases nor documentation for the _set_ extensions
have been devised yet but this is easy to do once there is a consensus
that this feature is actually desirable.

# Copyright

This document is placed in the public domain.

Name change from tip/59.tip to tip/59.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103

104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352

TIP:            59
Title:          Embed Build Information in Tcl Binary Library
Version:        $Revision: 1.16 $
Author:         Andreas Kupries <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        04-Sep-2001
Post-History:   
Tcl-Version:    8.5

~ Abstract

This TIP provides an interface through which Tcl may be queried for
information on its own configuration, in order to
extract the information directly instead of reading it from a Bourne
shell file.  An important reason to do this is to have the information
not only available but also tightly bound to the binary configured by
it, so that the information doesn't get lost.

~ Foreword

This TIP proposes a rather small change to Tcl and tries very hard to
follow the KISS principle. Given that the casual observer might find
it rather long, be assured, the actual specification in here is not
very long, nor complicated. Most of the following explanations were
added to preserve the KISS principle and head off attempts to extend
the TIP beyond its small goal and scope.

Note: All instances of "Tcl library" in the following text refer to
the generated installable library and not the script library coming
with the core.

~ Background and Rationale

The main reason for writing this TIP are the disadvantages inherent in
the current way of storing the configuration of Tcl, namely in the file
''tclConfig.sh''.

   * It is a separate file, easily lost or not installed at all,
     making it difficult for extension developers to access this
     information.

   > Note: The non-installation of development files like ''tclConfig.sh''
     might even be required by through vendor policies and such and
     thus not under the control of the package author or builder.

   * The name does not convey that ''tclConfig.sh'' contains platform
     and build specific information. When installing different builds
     this usually leads to clashes. This makes it again difficult for
     extension developers to find the right file for their current build.

   * Not every extension generates such a file for use by other
     extensions.

Thus, this TIP proposes:

   * an extension of the public API so that extensions are able to
     define configuration introspection commands and to declare the
     returned information during initialization with the information
     embedded into their installable libraries during compilation.

   * to embed the information about the configuration of the
     Tcl library as strings into the generated installable library and
     make them accessible at the script level through Tcl variables,
     thus allowing developers on any platform Tcl compiles on to
     access this information.

The file ''tclConfig.sh'' is ''not'' replaced by this system, both
sets of information exist in parallel.

Neither is the variable ''tcl_platform'' replaced. This means that some information, like ''threaded'', is held redundantly. Other information in ''tcl_platform'', like ''user'' is runtime and not configuration information. The operating system information is important to a build system, but this is out of the scope of this TIP.

~ Interface Specification

Any embedded information is made accessible at the Tcl level through a
new command. The name of the command used by Tcl itself is
''::tcl::pkgconfig''. Extensions have to use their own commands. These
commands will be named ''pkgconfig'' too and have to be placed within in
the namespaces owned by the extensions initializing them.

At the C-level the public API of the Tcl core is extended with a
single function to register the embedded configuration information.
This function is added to the public stub table of the Tcl core so
that it can be used by Tcl and extensions to register their own
configuration information in the system during initialization.

The function takes three (3) arguments; first, the name of the package
registering its configuration information, second, a pointer to an
array of structures, and third a string declaring the encoding used by
the configuration values.  Each element of the array refers to two
strings containing the key and the value associated with that key. The
end of the array is signaled by an empty key.

Formalized, name and signature of this new function are

| Tcl_RegisterConfig (CONST char* pkgName, Config* configuration, CONST char* valEncoding)
|
| typedef struct Config {
|    char* key;
|    char* value;
| }

The string ''valEncoding'' contains the name of an encoding known to
Tcl. All these names are use only characters in the ASCII subset of
UTF-8 and are thus implicity in the UTF-8 encoding. It is expected
that keys are legible english text and therefore using the ASCII
subset of UTF-8. In other words, they are expected to be in UTF-8
too. The values associated with the keys can be any string
however. For these the contents of ''valEncoding'' define which
encoding was used to represent the characters of the strings.

During compile time the value of ''valEncoding'' is specified as a
makefile variable for non-''configure'' based build systems and
through the new option ''--with-encoding=FOO'' of configure
otherwise. The default value is ''iso8859-1''.

This approach gives us all what we desire with not too many drawbacks.

   * The default case (no special characters) requires no action on
     part of the builder at all.

   * The non-default case (path containing special characters like
     Kanji) is supported.

   * Cross-compilation is unimpeded and no more complex than normal
     compilation.

   * The requirement for conversion of strings is a drawback, but
     should not have a big impact on performance. It has no impact on
     the performance of scripts which do not use the embedded
     information. The impact is even more negligigble if the result of
     the conversion is cached.

The function will

   * create a namespace having the provided ''pkgName'', if not yet existing.

   * create the command ''pkgconfig'' in that namespace and link it
     to the provided information so that the keys from
     ''configuration'' and their associated values can be retrieved through
     calls to ''pkgconfig''.

The command ''pkgconfig'' will provide two subcommands, ''list'' and
''get''. The first subcommand, ''list'' takes no arguments and returns
a list containing the names of the defined keys. The second subcommand
takes one argument, the name of a key and returns the string
associated with that key.

~ How to Gather the Embedded Information

The information to be embedded is gathered in a platform-specific way
and written into the file ''generic/tclPkgConfig.c''. The different
platforms may employ platform specific intermediate files to hold the
information, but in the compilation phase only ''tclPkgConfig.c'' will
be used.

   * Under unix it is determined primarily by the existing
     ''configure'' script. The configuration information coming
     from the Makefile, or from other compile time means, is
     embedded into the ''tclConfig.c'' file by means of
    preprocessor statements (#ifdef ... #endif).

   * For the Windows and Mac platforms volunteers may have to create
     files ''tclWinConfig.c.some-ext'' containing this information for
     each supported build environment, like VC++, Borland, Cygwin,
     etc.

   > ''tclWinConfig.c.vc'' = VC++.

   > ''tclWinConfig.c.bc'' = Borland.

   > ''tclWinConfig.c.in'' = Cygwin. ''.in'' is used because Cygwin
     can use configure to determine the values and embed them into a
     template.

   * As for other platforms, these are handled either like Unix or
     like Mac, depending on the availability and usability of
     ''configure''.

   > Volunteers are required to write the appropriate files for their
     build environment.

~ Specification of Tcl Configuration Information

The configuration information registered by Tcl itself is specified
here. A discussion of the choices made here follows in the next
section. Please read this discussion before commenting on the
specification.

The values associated with the keys below are all of one of the
following types:

   * Boolean flag. Allowed values are all the values which are
     accepted by Tcl itself. Examples are:

|       true, false, on, off, 1, 0

   * String. General container for all other information.

   * Templated string. With respect to placeholders in the same format
     as 'Script' below, but does not have to be a valid Tcl script.

   * Script. A string containing a full Tcl script. The user should
     handle this string like a procedure body. The script is allowed
     to contain placeholders to be filled by the user of the string.
     Placeholders follow the syntax of full-braced Tcl variables,
     i.e. ''${some_name}'. The actual values can be filled in by the
     user of the configuration information. Possible ways to do so are
     [regsub], [string map] or [subst -nocommand]. The best way
     however would be to use the script as a procedure body, with the
     placeholders as the arguments of the procedure. This will avoid
     many problems regarding bracing and the protection of special
     characters.

   > Which placeholders are possible for a particular script or
     template is described together with the meaning of the key.

   > Beyond the placeholders a script or templated string is allowed
     to contain references to other keys returned by the ''config''
     command. These references use the same variable syntax as the
     placeholders.

The registered keys follow below. They will be always present with
some value. Non-boolean keys not applicable to a particular platform
will contain the empty string as their value.

   * Configuration of Tcl itself:

   >   * ''debug''. Boolean flag. Set to false if Tcl was not compiled
         to contain debugging information.

   >   * ''threaded''. Boolean flag. Set to false if Tcl was not
         compiled as thread-enabled.

   >   * ''profiled''. Boolean flag. Set to false if Tcl was not
         compiled to contain profiling statements.

   >   * ''64bit''. Boolean flag. Set to false if Tcl was not compiled
         in 64bit mode.

   >   * ''optimized''. Boolean flag. Set to false if Tcl was compiled
         without compiler optimizations.

   >   * ''mem_debug''. Boolean flag. Set to false if Tcl has no
         memory debugging compiled into it.

   >   * ''compile_debug''. Boolean flag. Set to false if Tcl has no
         bytecode compiler debugging compiled in.

   >   * ''compile_stats''. Boolean flag. Set to false if Tcl has no
         bytecode compiler statistics compiled in.

   * Installation configuration of Tcl. In other words, various
     important locations.

   >   * ''prefix,runtime''.  String. The directory for platform
         independent files as seen by the interpreter during runtime.

   >   * ''exec_prefix,runtime'' String. The directory for platform
         dependent files as seen by the interpreter during runtime.

   >   * ''prefix,install''. String. The directory for platform
         independent files as seen by the installer at install-time.

   >   * ''exec_prefix,install''. String. The directory for platform
         dependent files as seen by the installer at install-time.

~ Discussion

The placement of this information into a separate package was proposed
but rejected because of the trouble of finding the right information
for the right library in the case of multiple configurations installed
into the same directory space.  Embedding into the library does not
cost much space and binds the information tightly to the right spot.

Another reason to do it this way is that this enables us to embed
information coming from the Makefile itself (like ''MEM_DEBUG'') or
from other compile time means. This would not be possible for a file
generated solely by the Tcl configure. It would also restrict the
embedding to the platforms which allow the use of ''configure''
script.

The usage of a separate package to just access the information placed
into the Tcl library was also proposed. This was rejected too, due to
the overhead for the management of the package in comparison to the
small size of the code actually involved.

Another proposal rejected in the early discussions was to have this
TIP define an entire build system based upon Tcl. This TIP is
certainly a step in this direction and facilitates the building of
such a build system (sic!). Still, specifying such here was seen as
too large a step right now, with too many issues to be solved and thus
delaying the implementation of this TIP.

Only the configuration of the particular variant of the Tcl library or
extension which was generated is recorded in the library. No attempt
is made to record the information required to allow the compilation of
any possible variant of an extension. Doing so would reach again into
the bigger topic of specifying a full build system. We've already
established that as being out of the intended scope of this TIP.

Note further that the scheme as specified above does not prevent us
from adding the full information in a later stage. In other words, it
does not restrict the development of a more powerful system in the
future.

This should be enough reasoning to allow the acceptance of even this
admittedly simple system.

The configuration information registered by Tcl is currently a very
small subset of the information in ''tclConfig.sh''. A future TIP is
planned to provide the missing information in a regular and generalized
manner.

If an extension requires more information than provided by the Tcl
configuration it will have to obtain this information itself. For
instance, TclBlend requires a CLASSPATH, the name of a Java compiler,
etc. whereas the TclPython and TclPerl extensions require paths to
those environments, etc. It is not reasonable that the configure
script for Tcl itself have to accommodate all requirements of all
extensions of Tcl.  Instead, the configure scripts or whatever other
means is used to obtain the configuration information for the
extensions should reflect their needs, and register the requirements
gathered into their own configuration command. Note that an extension
is only expected to create variables for information unique to
it. Everything else can be had from the configuration command of Tcl
and the extensions it depends on.

This TIP is not in opposition to [34] but rather fleshes out one of
the many details in the specification which were left open by that
TIP.

This TIP also does not propose to change the process for building Tcl
itself. The goal is rather to make the building of extensions easier
in the future.

A naming convention for keys returned by the ''config'' command would
have been possible but would also require quite a lot more text, both
in careful definition of the general categories and in explanations of
the choices made.

~ Implementation

Work on implementing this feature is tracked at Tcl Patch 507083 at
the Tcl project at SourceForge.  Implementation effort also takes
place on the tip-59-implementation branch in the Tcl CVS repository
(see [31]).

~ Copyright

This document is in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|
|

|

|
|
|
|
|
<
|
>
|

|

|
|
|
|

|

|
|

|

|

|
|

|
|

|

|

|

|

|
|

|
|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352

# TIP 59: Embed Build Information in Tcl Binary Library

	Author:         Andreas Kupries <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        04-Sep-2001
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

This TIP provides an interface through which Tcl may be queried for
information on its own configuration, in order to
extract the information directly instead of reading it from a Bourne
shell file.  An important reason to do this is to have the information
not only available but also tightly bound to the binary configured by
it, so that the information doesn't get lost.

# Foreword

This TIP proposes a rather small change to Tcl and tries very hard to
follow the KISS principle. Given that the casual observer might find
it rather long, be assured, the actual specification in here is not
very long, nor complicated. Most of the following explanations were
added to preserve the KISS principle and head off attempts to extend
the TIP beyond its small goal and scope.

Note: All instances of "Tcl library" in the following text refer to
the generated installable library and not the script library coming
with the core.

# Background and Rationale

The main reason for writing this TIP are the disadvantages inherent in
the current way of storing the configuration of Tcl, namely in the file
_tclConfig.sh_.

   * It is a separate file, easily lost or not installed at all,
     making it difficult for extension developers to access this
     information.

	   > Note: The non-installation of development files like _tclConfig.sh_
     might even be required by through vendor policies and such and
     thus not under the control of the package author or builder.

   * The name does not convey that _tclConfig.sh_ contains platform
     and build specific information. When installing different builds
     this usually leads to clashes. This makes it again difficult for
     extension developers to find the right file for their current build.

   * Not every extension generates such a file for use by other
     extensions.

Thus, this TIP proposes:

   * an extension of the public API so that extensions are able to
     define configuration introspection commands and to declare the
     returned information during initialization with the information
     embedded into their installable libraries during compilation.

   * to embed the information about the configuration of the
     Tcl library as strings into the generated installable library and
     make them accessible at the script level through Tcl variables,
     thus allowing developers on any platform Tcl compiles on to
     access this information.

The file _tclConfig.sh_ is _not_ replaced by this system, both
sets of information exist in parallel.

Neither is the variable _tcl\_platform_ replaced. This means that some information, like _threaded_, is held redundantly. Other information in _tcl\_platform_, like _user_ is runtime and not configuration information. The operating system information is important to a build system, but this is out of the scope of this TIP.

# Interface Specification

Any embedded information is made accessible at the Tcl level through a
new command. The name of the command used by Tcl itself is
_::tcl::pkgconfig_. Extensions have to use their own commands. These
commands will be named _pkgconfig_ too and have to be placed within in
the namespaces owned by the extensions initializing them.

At the C-level the public API of the Tcl core is extended with a
single function to register the embedded configuration information.
This function is added to the public stub table of the Tcl core so
that it can be used by Tcl and extensions to register their own
configuration information in the system during initialization.

The function takes three \(3\) arguments; first, the name of the package
registering its configuration information, second, a pointer to an
array of structures, and third a string declaring the encoding used by
the configuration values.  Each element of the array refers to two
strings containing the key and the value associated with that key. The
end of the array is signaled by an empty key.

Formalized, name and signature of this new function are

	 Tcl_RegisterConfig (CONST char* pkgName, Config* configuration, CONST char* valEncoding)

	 typedef struct Config {
	    char* key;
	    char* value;

	 }

The string _valEncoding_ contains the name of an encoding known to
Tcl. All these names are use only characters in the ASCII subset of
UTF-8 and are thus implicity in the UTF-8 encoding. It is expected
that keys are legible english text and therefore using the ASCII
subset of UTF-8. In other words, they are expected to be in UTF-8
too. The values associated with the keys can be any string
however. For these the contents of _valEncoding_ define which
encoding was used to represent the characters of the strings.

During compile time the value of _valEncoding_ is specified as a
makefile variable for non-_configure_ based build systems and
through the new option _--with-encoding=FOO_ of configure
otherwise. The default value is _iso8859-1_.

This approach gives us all what we desire with not too many drawbacks.

   * The default case \(no special characters\) requires no action on
     part of the builder at all.

   * The non-default case \(path containing special characters like
     Kanji\) is supported.

   * Cross-compilation is unimpeded and no more complex than normal
     compilation.

   * The requirement for conversion of strings is a drawback, but
     should not have a big impact on performance. It has no impact on
     the performance of scripts which do not use the embedded
     information. The impact is even more negligigble if the result of
     the conversion is cached.

The function will

   * create a namespace having the provided _pkgName_, if not yet existing.

   * create the command _pkgconfig_ in that namespace and link it
     to the provided information so that the keys from
     _configuration_ and their associated values can be retrieved through
     calls to _pkgconfig_.

The command _pkgconfig_ will provide two subcommands, _list_ and
_get_. The first subcommand, _list_ takes no arguments and returns
a list containing the names of the defined keys. The second subcommand
takes one argument, the name of a key and returns the string
associated with that key.

# How to Gather the Embedded Information

The information to be embedded is gathered in a platform-specific way
and written into the file _generic/tclPkgConfig.c_. The different
platforms may employ platform specific intermediate files to hold the
information, but in the compilation phase only _tclPkgConfig.c_ will
be used.

   * Under unix it is determined primarily by the existing
     _configure_ script. The configuration information coming
     from the Makefile, or from other compile time means, is
     embedded into the _tclConfig.c_ file by means of
    preprocessor statements \(\#ifdef ... \#endif\).

   * For the Windows and Mac platforms volunteers may have to create
     files _tclWinConfig.c.some-ext_ containing this information for
     each supported build environment, like VC\+\+, Borland, Cygwin,
     etc.

	   > _tclWinConfig.c.vc_ = VC\+\+.

	   > _tclWinConfig.c.bc_ = Borland.

	   > _tclWinConfig.c.in_ = Cygwin. _.in_ is used because Cygwin
     can use configure to determine the values and embed them into a
     template.

   * As for other platforms, these are handled either like Unix or
     like Mac, depending on the availability and usability of
     _configure_.

	   > Volunteers are required to write the appropriate files for their
     build environment.

# Specification of Tcl Configuration Information

The configuration information registered by Tcl itself is specified
here. A discussion of the choices made here follows in the next
section. Please read this discussion before commenting on the
specification.

The values associated with the keys below are all of one of the
following types:

   * Boolean flag. Allowed values are all the values which are
     accepted by Tcl itself. Examples are:

		       true, false, on, off, 1, 0

   * String. General container for all other information.

   * Templated string. With respect to placeholders in the same format
     as 'Script' below, but does not have to be a valid Tcl script.

   * Script. A string containing a full Tcl script. The user should
     handle this string like a procedure body. The script is allowed
     to contain placeholders to be filled by the user of the string.
     Placeholders follow the syntax of full-braced Tcl variables,
     i.e. _$\{some\_name\}'. The actual values can be filled in by the
     user of the configuration information. Possible ways to do so are
     [regsub], [string map] or [subst -nocommand]. The best way
     however would be to use the script as a procedure body, with the
     placeholders as the arguments of the procedure. This will avoid
     many problems regarding bracing and the protection of special
     characters.

	   > Which placeholders are possible for a particular script or
     template is described together with the meaning of the key.

	   > Beyond the placeholders a script or templated string is allowed
     to contain references to other keys returned by the _config_
     command. These references use the same variable syntax as the
     placeholders.

The registered keys follow below. They will be always present with
some value. Non-boolean keys not applicable to a particular platform
will contain the empty string as their value.

   * Configuration of Tcl itself:

	   >   \* _debug_. Boolean flag. Set to false if Tcl was not compiled
         to contain debugging information.

	   >   \* _threaded_. Boolean flag. Set to false if Tcl was not
         compiled as thread-enabled.

	   >   \* _profiled_. Boolean flag. Set to false if Tcl was not
         compiled to contain profiling statements.

	   >   \* _64bit_. Boolean flag. Set to false if Tcl was not compiled
         in 64bit mode.

	   >   \* _optimized_. Boolean flag. Set to false if Tcl was compiled
         without compiler optimizations.

	   >   \* _mem\_debug_. Boolean flag. Set to false if Tcl has no
         memory debugging compiled into it.

	   >   \* _compile\_debug_. Boolean flag. Set to false if Tcl has no
         bytecode compiler debugging compiled in.

	   >   \* _compile\_stats_. Boolean flag. Set to false if Tcl has no
         bytecode compiler statistics compiled in.

   * Installation configuration of Tcl. In other words, various
     important locations.

	   >   \* _prefix,runtime_.  String. The directory for platform
         independent files as seen by the interpreter during runtime.

	   >   \* _exec\_prefix,runtime_ String. The directory for platform
         dependent files as seen by the interpreter during runtime.

	   >   \* _prefix,install_. String. The directory for platform
         independent files as seen by the installer at install-time.

	   >   \* _exec\_prefix,install_. String. The directory for platform
         dependent files as seen by the installer at install-time.

# Discussion

The placement of this information into a separate package was proposed
but rejected because of the trouble of finding the right information
for the right library in the case of multiple configurations installed
into the same directory space.  Embedding into the library does not
cost much space and binds the information tightly to the right spot.

Another reason to do it this way is that this enables us to embed
information coming from the Makefile itself \(like _MEM\_DEBUG_\) or
from other compile time means. This would not be possible for a file
generated solely by the Tcl configure. It would also restrict the
embedding to the platforms which allow the use of _configure_
script.

The usage of a separate package to just access the information placed
into the Tcl library was also proposed. This was rejected too, due to
the overhead for the management of the package in comparison to the
small size of the code actually involved.

Another proposal rejected in the early discussions was to have this
TIP define an entire build system based upon Tcl. This TIP is
certainly a step in this direction and facilitates the building of
such a build system \(sic!\). Still, specifying such here was seen as
too large a step right now, with too many issues to be solved and thus
delaying the implementation of this TIP.

Only the configuration of the particular variant of the Tcl library or
extension which was generated is recorded in the library. No attempt
is made to record the information required to allow the compilation of
any possible variant of an extension. Doing so would reach again into
the bigger topic of specifying a full build system. We've already
established that as being out of the intended scope of this TIP.

Note further that the scheme as specified above does not prevent us
from adding the full information in a later stage. In other words, it
does not restrict the development of a more powerful system in the
future.

This should be enough reasoning to allow the acceptance of even this
admittedly simple system.

The configuration information registered by Tcl is currently a very
small subset of the information in _tclConfig.sh_. A future TIP is
planned to provide the missing information in a regular and generalized
manner.

If an extension requires more information than provided by the Tcl
configuration it will have to obtain this information itself. For
instance, TclBlend requires a CLASSPATH, the name of a Java compiler,
etc. whereas the TclPython and TclPerl extensions require paths to
those environments, etc. It is not reasonable that the configure
script for Tcl itself have to accommodate all requirements of all
extensions of Tcl.  Instead, the configure scripts or whatever other
means is used to obtain the configuration information for the
extensions should reflect their needs, and register the requirements
gathered into their own configuration command. Note that an extension
is only expected to create variables for information unique to
it. Everything else can be had from the configuration command of Tcl
and the extensions it depends on.

This TIP is not in opposition to [[34]](34.md) but rather fleshes out one of
the many details in the specification which were left open by that
TIP.

This TIP also does not propose to change the process for building Tcl
itself. The goal is rather to make the building of extensions easier
in the future.

A naming convention for keys returned by the _config_ command would
have been possible but would also require quite a lot more text, both
in careful definition of the general categories and in explanations of
the choices made.

# Implementation

Work on implementing this feature is tracked at Tcl Patch 507083 at
the Tcl project at SourceForge.  Implementation effort also takes
place on the tip-59-implementation branch in the Tcl CVS repository
\(see [[31]](31.md)\).

# Copyright

This document is in the public domain.

Name change from tip/6.tip to tip/6.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99

TIP:            6
Title:          Include [Incr Tcl] in the Core Tcl distribution
Version:        $Revision: 1.6 $
Author:         Mark Harrison <[email protected]>
State:          Rejected
Type:           Project
Vote:           Done
Created:        16-Oct-2000
Post-History:   
Tcl-Version:    8.4.0

~ Abstract

Include [[Incr Tcl]] in the Core Tcl distribution.

~ Proposal

[[incr Tcl]] [http://tcltk.com/itcl/] shall be included in the core
Tcl distribution.  It shall be included in the Tcl source tree, and
built as part of the standard Tcl distribution.

Specific items:

 *  "itclsh" will not be included

 *  "itcl_*" commands will not be included

 *  everything will move from ::itcl to ::

 *  the "find" subcommands will be reintegrated into "info"

~ Rationale

The lack of a standard object and data abstraction system continues to
hinder Tcl development.

  > "Lets face it, not including any sort of OO system is one of
    the major failings of Tcl. Indexing into global arrays is
    a sad hack when compared to a real OO system."
           ''- Mo DeJong <[email protected]>''

Earlier this year, it seemed that it would finally be included in Tcl
8.4, but that plan was rescinded.

Note that this is distinct from the "batteries included" (BI)
distribution, and is not intended to be a model for building the BI
distribution.  It is a special case for inclusion in the core tcl
command set, since a "class" command is a fundamental language
construct.

~ Alternatives

Include [[incr Tcl]] in a "batteries included" (BI) distribution.

Many people will not opt for the BI distribution ([4]) due to its
larger size.  It is quite likely that (for example) a Linux
distribution my include Tcl as a standard component, but place the BI
on a supplemental disk.

~ Objections

''I don't want any object system included!''

You can delete the [[incr Tcl]] library with no harm to your code.

''John Ousterhout hates objects!''

This misconception is primarily due to a misreading of one of his
papers.  A better summary of his position is that "scripting is a
better solution than objects in many cases."  John Ousterhout has told
me that he will not stand in the way of adding object-oriented
programming to Tcl.

''[[incr Tcl]]'s object model is not good!''

[[incr Tcl]] supports the same object model as C++ and Java.  Many
programmers are familiar with this model and accept it as a good
model.

''The CLOS object model is better!''

Quoting John Ousterhout, "People vote with their feet".  For whatever
reason, slot-based systems failed to gain as much popularity as
C++/Java-like systems.

''There are many Tcl object systems to choose from!''

None are even a fraction as long-lived, popular, or well-supported as
[[incr Tcl]].

~ Special Provisions

Since [[incr Tcl]] still exists as a separately named entity, this TIP
shall not be construed as relieving any individual from the
responsibility of providing appropriate [[incr Apparel]].

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99

# TIP 6: Include [Incr Tcl] in the Core Tcl distribution

	Author:         Mark Harrison <[email protected]>
	State:          Rejected
	Type:           Project
	Vote:           Done
	Created:        16-Oct-2000
	Post-History:   
	Tcl-Version:    8.4.0
-----

# Abstract

Include [Incr Tcl] in the Core Tcl distribution.

# Proposal

[incr Tcl] <http://tcltk.com/itcl/>  shall be included in the core
Tcl distribution.  It shall be included in the Tcl source tree, and
built as part of the standard Tcl distribution.

Specific items:

 *  "itclsh" will not be included

 *  "itcl\_\*" commands will not be included

 *  everything will move from ::itcl to ::

 *  the "find" subcommands will be reintegrated into "info"

# Rationale

The lack of a standard object and data abstraction system continues to
hinder Tcl development.

  > "Lets face it, not including any sort of OO system is one of
    the major failings of Tcl. Indexing into global arrays is
    a sad hack when compared to a real OO system."
           _- Mo DeJong <[email protected]>_

Earlier this year, it seemed that it would finally be included in Tcl
8.4, but that plan was rescinded.

Note that this is distinct from the "batteries included" \(BI\)
distribution, and is not intended to be a model for building the BI
distribution.  It is a special case for inclusion in the core tcl
command set, since a "class" command is a fundamental language
construct.

# Alternatives

Include [incr Tcl] in a "batteries included" \(BI\) distribution.

Many people will not opt for the BI distribution \([[4]](4.md)\) due to its
larger size.  It is quite likely that \(for example\) a Linux
distribution my include Tcl as a standard component, but place the BI
on a supplemental disk.

# Objections

_I don't want any object system included!_

You can delete the [incr Tcl] library with no harm to your code.

_John Ousterhout hates objects!_

This misconception is primarily due to a misreading of one of his
papers.  A better summary of his position is that "scripting is a
better solution than objects in many cases."  John Ousterhout has told
me that he will not stand in the way of adding object-oriented
programming to Tcl.

_[incr Tcl]'s object model is not good!_

[incr Tcl] supports the same object model as C\+\+ and Java.  Many
programmers are familiar with this model and accept it as a good
model.

_The CLOS object model is better!_

Quoting John Ousterhout, "People vote with their feet".  For whatever
reason, slot-based systems failed to gain as much popularity as
C\+\+/Java-like systems.

_There are many Tcl object systems to choose from!_

None are even a fraction as long-lived, popular, or well-supported as
[incr Tcl].

# Special Provisions

Since [incr Tcl] still exists as a separately named entity, this TIP
shall not be construed as relieving any individual from the
responsibility of providing appropriate [incr Apparel].

# Copyright

This document has been placed in the public domain.

Name change from tip/60.tip to tip/60.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102

103
104
105
106

107
108
109
110
111
112
113
114
115
116

117
118
119
120

121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146

147
148
149
150

151
152
153
154
155
156
157
158
159
160

161
162
163
164

165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187

TIP:            60
Title:          EXTERN Macro Change to Support a Wider Set of Attributes
Version:        $Revision: 1.21 $
Author:         David Gravereaux <[email protected]>
Author:         Donal K. Fellows <[email protected]>
State:          Rejected
Type:           Project
Vote:           Done
Created:        06-Sep-2001
Post-History:   
Tcl-Version:    8.6

~ Abstract

This TIP proposes a change to how the EXTERN macro in ''tcl.h'' works
to support a wider range of compiler specific attributes.

~ Rationale

With working on Borland support recently, I found that luckily the
newest "free commandline tools"
[http://www.borland.com/bcppbuilder/freecompiler/] does support
Microsoft's ''__declspec(dllexport)'' attribute.  But at the same
time, the older way with ''__export'' is still valid, but can't be
used due to the order within the prototype declaration of the EXTERN
macro.

What's this with the MS compiler:

|	__declspec(dllexport) __cdecl int func (int a, int b);

will have to be this with Borland:

|	int __export __cdecl func (int a, int b);

The order of the attribute needs to be after the return type.

Even though ''__declspec'' is supported in the Microsoft style with
version 5.5+ of the Borland compiler, if EXTERN could swap around the
order a hair, old Turbo C v5.0 has a better chance to make a DOS
library.  Should someone feel the need.

Let's leave the existing EXTERN macro as-is and just make a new one called TCL_EXTERN to support the new behavior.

Karl Lembuaer (sp?) did a presentation @ OSCON regarding his recent
tinytcl project ''%TODO: add link here%'' about his DOS port of Tcl
6.7 for use in a hand-held device.

Stepping backward for DOS support, may actually be a leap forward in
an off-beat manner...

~ Rejected Alternatives

I saw something like this in a very old DDE extension that someone at
Sun wrote.  It was used as an example windows extension for years.

ftp://tcl.activestate.com/pub/tcl/misc/example.zip

In example.h is this:

|#if defined(__WIN32__)
|#   if defined(_MSC_VER)
|#	define EXPORT(a,b) __declspec(dllexport) a b
|#   else
|#	if defined(__BORLANDC__)
|#	    define EXPORT(a,b) a _export b
|#	else
|#	    define EXPORT(a,b) a b
|#	endif
|#   endif
|#else
|#   define EXPORT(a,b) a b
|#endif
|
|EXTERN EXPORT(int,Example_Init) _ANSI_ARGS_((Tcl_Interp *interp));

That work is doing the same job, but I prefer the method that I'm
proposing.

It is also mentioned on http://tcl.activestate.com/doc/howto/winext.html
and feel it is rather out-of-date and the order issue with ''__export''
should be brought into the core with this patch and be fix for good.

Is>

|	EXTERN int Foobar_Init (Tcl_Interp *interp);

Proposed>

|	TCL_EXTERN(int) Foobar_Init (Tcl_Interp *interp);

~ Reference Implementation

https://sourceforge.net/tracker/download.php?group_id=10894&atid=310894&file_id=70480&aid=436116

~ Examples

Is:

|EXTERN int
|Foobar_Init (Tcl_Interp *interp)
|{

|#ifdef USE_TCL_STUBS
|    if (Tcl_InitStubs(interp, "8.1", 0) == NULL) {
|        return TCL_ERROR;
|    }

|#endif
|    Tcl_CreateObjCommand(interp, "foobar", FooBar, NULL, NULL);
|    return TCL_OK;
|};

Proposed:

|TCL_EXTERN(int)
|Foobar_Init (Tcl_Interp *interp)
|{

|#ifdef USE_TCL_STUBS
|    if (Tcl_InitStubs(interp, "8.1", 0) == NULL) {
|        return TCL_ERROR;
|    }

|#endif
|    Tcl_CreateObjCommand(interp, "foobar", FooBar, NULL, NULL);
|    return TCL_OK;
|};

Preprocessor output is the following:

 >	Borland:

|/* foobar.c 14: */extern  int __export
|/* foobar.c 15: */Foobar_Init (Tcl_Interp *interp)
|/* foobar.c 16: */{
|/* foobar.c 17: */
|/* foobar.c 18: */if (Tcl_InitStubs(interp, "8.1", 0) == 0) {
|/* foobar.c 19: */return 1;
|/* foobar.c 20: */}
|/* foobar.c 21: */
|/* foobar.c 22: */(tclStubsPtr->tcl_CreateObjCommand)(interp, "foobar", FooBar, 0, 0);
|/* foobar.c 23: */return 0;
|/* foobar.c 24: */};

 >	VC++:

|extern  __declspec(dllexport) int
|Foobar_Init (Tcl_Interp *interp)
|{

|
|    if (Tcl_InitStubs(interp, "8.1", 0) == ((void *)0)) {
|        return 1;
|    }

|#line 22 "foobar.c"
|    (tclStubsPtr->tcl_CreateObjCommand)(interp, "foobar", FooBar, ((void *)0), ((void *)0));
|    return 0;
|};

 >	MinGW (native gcc on win):

|extern       int
|Foobar_Init (Tcl_Interp *interp)
|{

|
|    if (Tcl_InitStubs(interp, "8.1", 0) == ((void *)0) ) {
|        return 1 ;
|    }

|
|    (tclStubsPtr->tcl_CreateObjCommand) (interp, "foobar", FooBar, ((void *)0) , ((void *)0) );
|    return 0 ;
|};

~ Random Notes

In ''tclInt.h'' starting around line 1916, are prototypes for the
internal cmdprocs.  I can't think of any reason why they should be
exported.  Also note the comment about line:1673, as it states:

|/*
| *----------------------------------------------------------------
| * Procedures shared among Tcl modules but not used by the outside
| * world:
| *----------------------------------------------------------------
| */

As the current EXTERN macro places ""everything"" exportable, the use of EXTERN following this comment in ''tclInt.h'' is contradictory.  In place of EXTERN for this purpose I used the new TCL_EXTRNC in the reference implementation.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|
|
|

|

|

|
|

|

|
|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|
|

|

|

|

|

|

|
|
<
>
|
|
|
<
>
|
|
|
|

|
|
<
>
|
|
|
<
>
|
|
|
|

|
|
|
|
|
|
|
|
|
|
|

|

|
|
<
>
|
|
|
<
>
|
|
|
|

|

|
|
<
>
|
|
|
<
>
|
|
|
|

|

|

|
|
|
|
|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

101
102
103
104

105
106
107
108
109
110
111
112
113
114

115
116
117
118

119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144

145
146
147
148

149
150
151
152
153
154
155
156
157
158

159
160
161
162

163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187

# TIP 60: EXTERN Macro Change to Support a Wider Set of Attributes

	Author:         David Gravereaux <[email protected]>
	Author:         Donal K. Fellows <[email protected]>
	State:          Rejected
	Type:           Project
	Vote:           Done
	Created:        06-Sep-2001
	Post-History:   
	Tcl-Version:    8.6
-----

# Abstract

This TIP proposes a change to how the EXTERN macro in _tcl.h_ works
to support a wider range of compiler specific attributes.

# Rationale

With working on Borland support recently, I found that luckily the
newest "free commandline tools"
<http://www.borland.com/bcppbuilder/freecompiler/>  does support
Microsoft's _\_\_declspec\(dllexport\)_ attribute.  But at the same
time, the older way with _\_\_export_ is still valid, but can't be
used due to the order within the prototype declaration of the EXTERN
macro.

What's this with the MS compiler:

		__declspec(dllexport) __cdecl int func (int a, int b);

will have to be this with Borland:

		int __export __cdecl func (int a, int b);

The order of the attribute needs to be after the return type.

Even though _\_\_declspec_ is supported in the Microsoft style with
version 5.5\+ of the Borland compiler, if EXTERN could swap around the
order a hair, old Turbo C v5.0 has a better chance to make a DOS
library.  Should someone feel the need.

Let's leave the existing EXTERN macro as-is and just make a new one called TCL\_EXTERN to support the new behavior.

Karl Lembuaer \(sp?\) did a presentation @ OSCON regarding his recent
tinytcl project _%TODO: add link here%_ about his DOS port of Tcl
6.7 for use in a hand-held device.

Stepping backward for DOS support, may actually be a leap forward in
an off-beat manner...

# Rejected Alternatives

I saw something like this in a very old DDE extension that someone at
Sun wrote.  It was used as an example windows extension for years.

ftp://tcl.activestate.com/pub/tcl/misc/example.zip

In example.h is this:

	#if defined(__WIN32__)
	#   if defined(_MSC_VER)
	#	define EXPORT(a,b) __declspec(dllexport) a b
	#   else
	#	if defined(__BORLANDC__)
	#	    define EXPORT(a,b) a _export b
	#	else
	#	    define EXPORT(a,b) a b
	#	endif
	#   endif
	#else
	#   define EXPORT(a,b) a b
	#endif

	EXTERN EXPORT(int,Example_Init) _ANSI_ARGS_((Tcl_Interp *interp));

That work is doing the same job, but I prefer the method that I'm
proposing.

It is also mentioned on <http://tcl.activestate.com/doc/howto/winext.html>
and feel it is rather out-of-date and the order issue with _\_\_export_
should be brought into the core with this patch and be fix for good.

Is>

		EXTERN int Foobar_Init (Tcl_Interp *interp);

Proposed>

		TCL_EXTERN(int) Foobar_Init (Tcl_Interp *interp);

# Reference Implementation

<https://sourceforge.net/tracker/download.php?group\_id=10894&atid=310894&file\_id=70480&aid=436116>

# Examples

Is:

	EXTERN int
	Foobar_Init (Tcl_Interp *interp)

	{
	#ifdef USE_TCL_STUBS
	    if (Tcl_InitStubs(interp, "8.1", 0) == NULL) {
	        return TCL_ERROR;

	    }
	#endif
	    Tcl_CreateObjCommand(interp, "foobar", FooBar, NULL, NULL);
	    return TCL_OK;
	};

Proposed:

	TCL_EXTERN(int)
	Foobar_Init (Tcl_Interp *interp)

	{
	#ifdef USE_TCL_STUBS
	    if (Tcl_InitStubs(interp, "8.1", 0) == NULL) {
	        return TCL_ERROR;

	    }
	#endif
	    Tcl_CreateObjCommand(interp, "foobar", FooBar, NULL, NULL);
	    return TCL_OK;
	};

Preprocessor output is the following:

 >	Borland:

	/* foobar.c 14: */extern  int __export
	/* foobar.c 15: */Foobar_Init (Tcl_Interp *interp)
	/* foobar.c 16: */{
	/* foobar.c 17: */
	/* foobar.c 18: */if (Tcl_InitStubs(interp, "8.1", 0) == 0) {
	/* foobar.c 19: */return 1;
	/* foobar.c 20: */}
	/* foobar.c 21: */
	/* foobar.c 22: */(tclStubsPtr->tcl_CreateObjCommand)(interp, "foobar", FooBar, 0, 0);
	/* foobar.c 23: */return 0;
	/* foobar.c 24: */};

 >	VC\+\+:

	extern  __declspec(dllexport) int
	Foobar_Init (Tcl_Interp *interp)

	{

	    if (Tcl_InitStubs(interp, "8.1", 0) == ((void *)0)) {
	        return 1;

	    }
	#line 22 "foobar.c"
	    (tclStubsPtr->tcl_CreateObjCommand)(interp, "foobar", FooBar, ((void *)0), ((void *)0));
	    return 0;
	};

 >	MinGW \(native gcc on win\):

	extern       int
	Foobar_Init (Tcl_Interp *interp)

	{

	    if (Tcl_InitStubs(interp, "8.1", 0) == ((void *)0) ) {
	        return 1 ;

	    }

	    (tclStubsPtr->tcl_CreateObjCommand) (interp, "foobar", FooBar, ((void *)0) , ((void *)0) );
	    return 0 ;
	};

# Random Notes

In _tclInt.h_ starting around line 1916, are prototypes for the
internal cmdprocs.  I can't think of any reason why they should be
exported.  Also note the comment about line:1673, as it states:

	/*
	 *----------------------------------------------------------------
	 * Procedures shared among Tcl modules but not used by the outside
	 * world:
	 *----------------------------------------------------------------
	 */

As the current EXTERN macro places ""everything"" exportable, the use of EXTERN following this comment in _tclInt.h_ is contradictory.  In place of EXTERN for this purpose I used the new TCL\_EXTRNC in the reference implementation.

# Copyright

This document has been placed in the public domain.

Name change from tip/61.tip to tip/61.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77

TIP:            61
Title:          Make TK_NO_SECURITY Run-Time Switchable
Version:        $Revision: 1.4 $
Author:         Jeff Hobbs <[email protected]>
Author:         Donal K. Fellows <[email protected]>
State:          Deferred
Type:           Project
Vote:           Pending
Created:        12-Sep-2001
Post-History:   
Tcl-Version:    8.5

~ Abstract

This TIP changes the compile time Tk define TK_NO_SECURITY to be
switchable at run-time.

~ Rationale

The TK_NO_SECURITY compile time #define is available to disable some
security checking when send is used.  The direct comments in the
Makefile are:

| # To turn off the security checks that disallow incoming sends when
| # the X server appears to be insecure, reverse the comments on the
| # following lines:
| SECURITY_FLAGS		=
| #SECURITY_FLAGS		= -DTK_NO_SECURITY

I propose to make this switch configurable at runtime through a ''tk
securesend'' option.

~ Benefits

Users would be able to debug between Tk applications on Unix using
''send'' without having to compile a special version of Tk or
manipulating the security settings of their X server to Tk's liking
(which can then conflict with other work).  It is common for users in
internal ("safe") networks to open up access to an X server with
''xhost +machine''.

~ Drawbacks

By allowing security to be disabled, users do possibly open up their
system to attack.  However, secure is the default setting, and any
paranoid users can ''rename send {}'' to ensure that it is not used at
all.

~ Reference Implementation

A full patch for this feature is available at:

http://sf.net/tracker/?func=detail&aid=456732&group_id=12997&atid=312997

The proposal adds one element to the private ''TkDisplay'' structure
(configuration for secure send is done per display), and creates the
Tcl level command:

|	tk securesend ?-displayof window? ?boolean?

It leaves the TK_NO_SECURITY flag alone.  If specified, send is
insecure by default, otherwise it is secure.

~ Comments

''DKF'' - It should be possible to control the setting of the
compile-time TK_NO_SECURITY flag from the ''configure'' script; having
to edit the Makefile by hand to adjust it makes it too easy to
inadvertently break something by introducing an unfortunate typo. Being
able to pass a ''--disable-security'' flag would make thing much easier
from a user's point of view, and will make it less likely that the Tk
maintainers will have to deal with bug reports that ultimately stem from
a dumb mistake made in a sensitive spot...

~ Copyright  

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|
|
|
|

|
|

|

|

|
|
|

|

|

|

|

|
|

|

|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77

# TIP 61: Make TK_NO_SECURITY Run-Time Switchable

	Author:         Jeff Hobbs <[email protected]>
	Author:         Donal K. Fellows <[email protected]>
	State:          Deferred
	Type:           Project
	Vote:           Pending
	Created:        12-Sep-2001
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

This TIP changes the compile time Tk define TK\_NO\_SECURITY to be
switchable at run-time.

# Rationale

The TK\_NO\_SECURITY compile time \#define is available to disable some
security checking when send is used.  The direct comments in the
Makefile are:

	 # To turn off the security checks that disallow incoming sends when
	 # the X server appears to be insecure, reverse the comments on the
	 # following lines:
	 SECURITY_FLAGS		=
	 #SECURITY_FLAGS		= -DTK_NO_SECURITY

I propose to make this switch configurable at runtime through a _tk
securesend_ option.

# Benefits

Users would be able to debug between Tk applications on Unix using
_send_ without having to compile a special version of Tk or
manipulating the security settings of their X server to Tk's liking
\(which can then conflict with other work\).  It is common for users in
internal \("safe"\) networks to open up access to an X server with
_xhost \+machine_.

# Drawbacks

By allowing security to be disabled, users do possibly open up their
system to attack.  However, secure is the default setting, and any
paranoid users can _rename send \{\}_ to ensure that it is not used at
all.

# Reference Implementation

A full patch for this feature is available at:

<http://sf.net/tracker/?func=detail&aid=456732&group\_id=12997&atid=312997>

The proposal adds one element to the private _TkDisplay_ structure
\(configuration for secure send is done per display\), and creates the
Tcl level command:

		tk securesend ?-displayof window? ?boolean?

It leaves the TK\_NO\_SECURITY flag alone.  If specified, send is
insecure by default, otherwise it is secure.

# Comments

_DKF_ - It should be possible to control the setting of the
compile-time TK\_NO\_SECURITY flag from the _configure_ script; having
to edit the Makefile by hand to adjust it makes it too easy to
inadvertently break something by introducing an unfortunate typo. Being
able to pass a _--disable-security_ flag would make thing much easier
from a user's point of view, and will make it less likely that the Tk
maintainers will have to deal with bug reports that ultimately stem from
a dumb mistake made in a sensitive spot...

# Copyright  

This document has been placed in the public domain.

Name change from tip/62.tip to tip/62.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170

171
172
173
174
175

176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197

198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347

348
349
350
351
352
353
354

355
356
357

TIP:            62
Title:          Add Support for Command Tracing
Version:        $Revision: 1.11 $
Author:         Hemang Lavana <[email protected]>
Author:         Vince Darley <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        18-Sep-2001
Post-History:   
Tcl-Version:    8.4

~ Abstract

This TIP proposes that the Tcl's trace command be extended to include
the following features:
  1. tracing of command execution for the specified tcl command, and
  2. step-wise tracing of any command execution within a specified procedure.

~ Rationale

One of the main strengths of Tcl is the ability to trace ''read'',
''write'' or ''delete'' operations on variables.  Moreover, Tcl8.4 has
already added support for tracing ''rename'' or ''delete'' operations
on Tcl commands.  Addition of the proposed subcommand for tracing
executions will further improve the capabilities of Tcl without any loss
of performance (see ''Benchmark Results'' section below).

I can see several applications of this feature, including:

  *  overloading/wrapping of Tcl commands, please see
     http://mini.net/tcl/1494.html

  *  aid developer in debugging Tcl scripts

  *  profiler module in ''tcllib'' can benefit from this feature

~ Specification

This TIP proposes an enhancement to the trace command with the
following syntax:

|        trace add execution name ops command

The type ''execution'' is used to arrange for ''command'' to be executed
whenever the command ''name'' is invoked for execution.
''Name'' may refer to any of the tcl commands or procedures that have been
previously defined. It is an error to create an ''execution'' trace on a 
non-existant command or a procedure.

The ''ops'' argument can accept ''enter'', ''leave'', ''enterstep'', and
''leavestep'' as valid operations:

   1. ''enter'' - Invoke ''command'' whenever the command ''name'' is
        executed, just before the actual execution takes place.

   2. ''leave'' - Invoke ''command'' whenever the command ''name'' is
        executed, just after the actual execution takes place.

   3. ''enterstep'' - Invoke ''command'' for every tcl command which is
        executed inside the procedure ''name'', just before the actual
	execution takes place. Setting a ''enterstep'' trace on a ''command''
	will not result in an error and is simply ignored.

   4. ''leavestep'' - Invoke ''command'' for every tcl command which is
        executed inside the procedure ''name'', just after the actual
	execution takes place. Setting a ''leavestep'' trace on a ''command''
	will not result in an error and is simply ignored.

When the trace triggers, depending on the operations being traced, a
number of arguments are appended to command so that the actual command
is as follows:

For ''enter'' and ''enterstep'' operations:

|    command command-string op

''command-string'' gives the complete current command being executed,
including all arguments in their fully expanded form.
''Op'' indicates what operation is being performed on the
variable, and is one of ''enter'' or ''enterstep'' here.
The trace operation can be used to stop the command from executing,
by deleting the command in question.  Of course when the command is
subsequently executed, an 'invalid command' error will occur.

For ''leave'' and ''leavestep'' operations:

|   command command-string code result op

''command-string'' gives the complete current command being executed,
including all arguments in their fully expanded form.  ''code'' gives
the result code of that execution, and ''result'' gives its result
string.  ''Op'' indicates what operation is being performed on the
variable, and is one of ''leave'' or ''leavestep'' here.

''Command'' executes in the same context as the code that invoked
the traced operation: thus the ''command'', if invoked from a procedure,
will have access to the same local variables as code in the procedure.
This context may be different than the context in which the trace was
created. If ''command'' invokes a procedure (which it normally does)
then the procedure will have to use upvar or uplevel commands if it
wishes to access the local variables of the code which invoked the
trace operation. Note that if the value of a local variable is passed
as an argument to the traced command ''name'' and is modified by
the ''command'' procedure, the traced command ''name'' will still
be invoked with the old value of the local variable. This is because
the argument list to ''name'' is formed before the traced ''command''
is invoked. Please see the section on ''Future Scope'' below on how
to modify the arguments passed to ''name''.

While ''command'' is executing during an ''execution'' trace, traces
on ''name'' are temporarily disabled. This allows the ''command'' to
execute ''name'' in its body without invoking any other traces
again. If an error occurs while executing the ''command'' body,
then the command ''name'' as a whole will return that same error.
Therefore, if ''catch'' command is used for invocation of the
''name'' command, it will also ignore errors resulting from such traces.

When multiple traces are set on ''name'', then the sequence of trace
command invocation is as follows:

    1.  For ''enter'' and ''enterstep'' operations, the traced
        commands are invoked in the reverse order of how these
	traces were created.

    2.  For ''leave'' and ''leavestep'' operations, the traced
        commands are invoked in the same order as how these
	traces were created.

For example, if we have two traces on proc foo:

|    trace add execution foo {enter leave} {barA}
|    trace add execution foo {enter leave} {barB}

then the trace commands ''barA'' and ''barB'' will be invoked in
the following sequence:

|    barB {foo x} {enter}
|    barA {foo x} {enter}
|      foo x
|    barA {foo x} 0 {} {leave}
|    barB {foo x} 0 {} {leave}

The creation of many ''enterstep'' or ''leavestep'' traces can lead
to unintuitive results, since the invoked commands from one trace
can themselves lead to further command invocations for other traces.
However, these unintuitive results are completely predictable and safe
(and tested in the test suite).  Nevertheless the user will probably
only want to have one such trace active at a time.

Once created, the trace remains in effect either until the trace is
removed with the ''trace remove execution'' command, until the ''name''
is deleted or until the interpreter is deleted. Note that renaming the
command ''name'' will not remove the execution traces.

To implement ''enterstep'' and ''leavestep'' traces, it is necessary
to invoke traces regardless of at what level the ''command'' is being
traced. This means that the value for ''level'' argument to 
Tcl_CreateTrace and Tcl_CreateObjTrace APIs should also accept ''0''.
A value of ''0'' implies that commands at all levels will be traced. 

~ Examples

The following script defines a procedure ''foo'' and illustrates 
several cases as to how its execution can be traced.

|    # Define the proc foo
|    proc foo {var} {
|         return [string index $var [expr {$var*2}]]
|    }

|    
|    # Command to invoke on trace activation
|    proc print {args} {
|        puts stdout "PRINT: $args"
|    }

|    
|    proc main {} {
|        puts stdout "================CASE 1========================="
|        puts stdout "Trace proc foo only"
|        trace add execution foo {enter leave} {print exec}
|        foo 4
|    
|        puts stdout "================CASE 2========================="
|        puts stdout "Trace proc foo as well as all commands within it"
|        trace add execution foo {enterstep leavestep} {print step}
|        foo 4
|    
|        # Remove all traces
|        trace remove execution foo {enter leave} {print exec}
|        trace remove execution foo {enterstep leavestep} {print step}
|    
|        puts stdout "================CASE 3========================="
|        puts stdout "Add a trace on string command"
|        trace add execution string {enter leave} {print exec}
|        foo 4
|        trace remove execution string {enter leave} {print exec}
|    }

|    main

The expected output of running the above script would be:

|    ===================CASE 1========================
|    Trace proc foo only
|    PRINT: exec {foo 4} enter
|    PRINT: exec {foo 4} 0 {} leave
|    ===================CASE 2=======================
|    Trace proc foo as well as all commands within it
|    PRINT: exec {foo 4} enter
|    PRINT: step {expr {$var*2}} enterstep
|    PRINT: step {expr {$var*2}} 0 8 leavestep
|    PRINT: step {string index 4 8} enterstep
|    PRINT: step {string index 4 8} 0 {} leavestep
|    PRINT: step {return {}} enterstep
|    PRINT: step {return {}} 2 {} leavestep
|    PRINT: exec {foo 4} 0 {} leave
|    ===================CASE 3=======================
|    Add a trace on string command
|    PRINT: exec {string index 4 8} enter
|    PRINT: exec {string index 4 8} 0 {} leave

Case 1 specifies a enter and leave trace on proc foo.
Here the proc foo is fully byte-code-compiled.

Case 2 additionally invokes a enterstep and leavestep trace on
proc foo.  This means that it will trace each command 
that is inovked within the proc foo.
Here the proc foo is *not* byte-code-compiled. This is
implemented by setting the DONT_COMPILE_CMDS_INLINE flag.

Case 3 specifies a trace on string command only.
Here all commands within proc foo, except string command,
is byte-code-compiled. This is implemented by modifying
compilation engine to check for CMD_HAS_EXEC_TRACES flag
before generating any byte-code.

~ Reference Implementation

This proposal was originally implemented by Vince Darley.  Please see
Feature Request #462580:
http://sf.net/tracker/?func=detail&aid=462580&group_id=10894&atid=360894

The original patch from Vince Darley has been modified in the
following respects:

  1.  For ''enter'' and ''enterstep'' operations, the original patch
      passed arguments to the ''command'' in its unexpanded form.
      This behavior has been changed to pass the arguments in its
      fully expanded form since it should be more useful for debugging
      scripts.

  2.  The original patch could not trace Tcl commands that were
      invoked inside a procedure because tracing is currently not
      possible for compiled commands.  Therefore, the patch was
      modified such that Tcl commands are no longer internally
      compiled if a trace has been set on a command.

  3.  For multiple traces on same command, the original patch 
      invoked the traces in the same order as they were created.
      This behavior was changed so that for ''enter'' and ''enterstep''
      operations, the traces are invoked in the reverse order of
      its creation. For ''leave'' and ''leavestep'', the traces are
      still invoked in the original order.

  4.  The original patch was created on 2000-Sept-14.  It was updated
      to work with the current CVS head.

The latest patch for this tip 62 is available at:

  http://www.employees.org/~hlavana/tcl/

The main changes for the patch are described in brief next.

Two new flags have been defined in tcl.h:

|  #define TCL_TRACE_ENTER_EXEC          1
|  #define TCL_TRACE_LEAVE_EXEC          2

These flag values are passed to Tcl_CreateObjTrace and used by command
execution traces. More internal flags for slots 4, 8, 15, 16, 32 are
defined in tclCmdMZ.c file: TCL_TRACE_ENTER_DURING_EXEC, 
TCL_TRACE_LEAVE_DURING_EXEC,  TCL_TRACE_ANY_EXEC, TCL_TRACE_EXEC_IN_PROGRESS
and TCL_TRACE_EXEC_DIRECT.

A new function TclTraceExecutionObjCmd function implements the
''trace {add|remove|list} execution ...'' subcommands.
A new function TclCheckExecutionTraces is defined to check for traces added
by the execution subcommand. A new function TclCheckInterpTraces is defined
to check for global traces added by the Tcl_CreateObjTrace command.
The TclEvalObjvInternal has been modified to call the above
two functions before as well as after the original command is executed.
A new function TraceExecutionProc is invoked, when necessary, to execute
the actual trace command in the interpreter.

A new structure ActiveInterpTrace has been defined for internal use so
that it behaves reasonably when traces are deleted while active.
In tclVar.c file, the function CallTraces has been renamed to CallVarTraces
and iPtr->activeTracePtr has been renamed to iPtr->activeTraceVarPtr.

An additional check for (tracePtr->level == 0) has been added in
Tcl_EvalObjv and TclExecuteByteCode functions, so as to enable
command tracing at all levels.

~ Benchmark Results

The benchmark results corresponds to ''Version 1.1'' of the reference
implementation.

One potential objection to this TIP could be that it may affect the
performance of the Tcl-core.  Therefore, I have run the
''runbench.tcl'' script from the tclbench module for comparison on a
Sun Ultra5, Solaris2.6 machine.  The results have been posted at
http://sf.net/tracker/?func=detail&aid=462580&group_id=10894&atid=360894

These results show that there is hardly any performance hit, if any,
by addition of this feature.  Of course when you activate a trace on a
command, then you will see a performance hit, but since primary uses
of traces will be in profiling and debugging, that isn't an issue.

~ Future Scope

This proposal does not allow for the trace invocation ''command''
to do the following:

  1. modify the number of arguments passed to ''name''

  2. modify the value of arguments passed to ''name''

  3. modify the result value and result code returned by ''name''

  4. skip invocation of ''name'' altogether if desired.

Consider the example of adding a sub-command "string reverse ..."
as shown on http://mini.net/tcl/1570.html.
Instead of using the rename command, it should be possible to use
the trace command to do the same, as follows:

|    trace add execution string {enter} {::mylib::stringx}
|    proc ::mylib::stringx {subcmd args} {
|        switch -exact -- $subcmd {
|            "reverse" {
|                # Hmm ... this is my subcommand, process it here
|                set returnValue [code_to_reverse_string_value]
|
|                # We need a mechansim to return immediately here
|                # with the processed results and an appropriate
|                # code value and not invoke the original string command.
|            }

|            default {
|                # This is probably a vaild subcommand, so do nothing
|                # and let the original string command handle it
|            }
|        }
|    }

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|
|
|

|

|

|

|

|

|
|
|
|

|
|

|

|

|
|
|

|
|
|

|

|

|

|
|

|

|

|
|
|
|
|

|
|

|

|
|

|
|
|

|
|
|
|
|
|
|

|

|

|

|
|

|

|
|
|
|
|

|

|

|

|

|
|
|
|
|

|

|

|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|
|

|

|

|
|

|
|

|

|

|

|
|

|

|
|
|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
<
<
<
|
>
>
>
|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168

169
170
171
172
173

174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195

196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345

346
347
348
349

350
351
352
353
354
355
356
357

# TIP 62: Add Support for Command Tracing

	Author:         Hemang Lavana <[email protected]>
	Author:         Vince Darley <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        18-Sep-2001
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

This TIP proposes that the Tcl's trace command be extended to include
the following features:
  1. tracing of command execution for the specified tcl command, and
  2. step-wise tracing of any command execution within a specified procedure.

# Rationale

One of the main strengths of Tcl is the ability to trace _read_,
_write_ or _delete_ operations on variables.  Moreover, Tcl8.4 has
already added support for tracing _rename_ or _delete_ operations
on Tcl commands.  Addition of the proposed subcommand for tracing
executions will further improve the capabilities of Tcl without any loss
of performance \(see _Benchmark Results_ section below\).

I can see several applications of this feature, including:

  *  overloading/wrapping of Tcl commands, please see
     <http://mini.net/tcl/1494.html>

  *  aid developer in debugging Tcl scripts

  *  profiler module in _tcllib_ can benefit from this feature

# Specification

This TIP proposes an enhancement to the trace command with the
following syntax:

	        trace add execution name ops command

The type _execution_ is used to arrange for _command_ to be executed
whenever the command _name_ is invoked for execution.
_Name_ may refer to any of the tcl commands or procedures that have been
previously defined. It is an error to create an _execution_ trace on a 
non-existant command or a procedure.

The _ops_ argument can accept _enter_, _leave_, _enterstep_, and
_leavestep_ as valid operations:

   1. _enter_ - Invoke _command_ whenever the command _name_ is
        executed, just before the actual execution takes place.

   2. _leave_ - Invoke _command_ whenever the command _name_ is
        executed, just after the actual execution takes place.

   3. _enterstep_ - Invoke _command_ for every tcl command which is
        executed inside the procedure _name_, just before the actual
	execution takes place. Setting a _enterstep_ trace on a _command_
	will not result in an error and is simply ignored.

   4. _leavestep_ - Invoke _command_ for every tcl command which is
        executed inside the procedure _name_, just after the actual
	execution takes place. Setting a _leavestep_ trace on a _command_
	will not result in an error and is simply ignored.

When the trace triggers, depending on the operations being traced, a
number of arguments are appended to command so that the actual command
is as follows:

For _enter_ and _enterstep_ operations:

	    command command-string op

_command-string_ gives the complete current command being executed,
including all arguments in their fully expanded form.
_Op_ indicates what operation is being performed on the
variable, and is one of _enter_ or _enterstep_ here.
The trace operation can be used to stop the command from executing,
by deleting the command in question.  Of course when the command is
subsequently executed, an 'invalid command' error will occur.

For _leave_ and _leavestep_ operations:

	   command command-string code result op

_command-string_ gives the complete current command being executed,
including all arguments in their fully expanded form.  _code_ gives
the result code of that execution, and _result_ gives its result
string.  _Op_ indicates what operation is being performed on the
variable, and is one of _leave_ or _leavestep_ here.

_Command_ executes in the same context as the code that invoked
the traced operation: thus the _command_, if invoked from a procedure,
will have access to the same local variables as code in the procedure.
This context may be different than the context in which the trace was
created. If _command_ invokes a procedure \(which it normally does\)
then the procedure will have to use upvar or uplevel commands if it
wishes to access the local variables of the code which invoked the
trace operation. Note that if the value of a local variable is passed
as an argument to the traced command _name_ and is modified by
the _command_ procedure, the traced command _name_ will still
be invoked with the old value of the local variable. This is because
the argument list to _name_ is formed before the traced _command_
is invoked. Please see the section on _Future Scope_ below on how
to modify the arguments passed to _name_.

While _command_ is executing during an _execution_ trace, traces
on _name_ are temporarily disabled. This allows the _command_ to
execute _name_ in its body without invoking any other traces
again. If an error occurs while executing the _command_ body,
then the command _name_ as a whole will return that same error.
Therefore, if _catch_ command is used for invocation of the
_name_ command, it will also ignore errors resulting from such traces.

When multiple traces are set on _name_, then the sequence of trace
command invocation is as follows:

    1.  For _enter_ and _enterstep_ operations, the traced
        commands are invoked in the reverse order of how these
	traces were created.

    2.  For _leave_ and _leavestep_ operations, the traced
        commands are invoked in the same order as how these
	traces were created.

For example, if we have two traces on proc foo:

	    trace add execution foo {enter leave} {barA}
	    trace add execution foo {enter leave} {barB}

then the trace commands _barA_ and _barB_ will be invoked in
the following sequence:

	    barB {foo x} {enter}
	    barA {foo x} {enter}
	      foo x
	    barA {foo x} 0 {} {leave}
	    barB {foo x} 0 {} {leave}

The creation of many _enterstep_ or _leavestep_ traces can lead
to unintuitive results, since the invoked commands from one trace
can themselves lead to further command invocations for other traces.
However, these unintuitive results are completely predictable and safe
\(and tested in the test suite\).  Nevertheless the user will probably
only want to have one such trace active at a time.

Once created, the trace remains in effect either until the trace is
removed with the _trace remove execution_ command, until the _name_
is deleted or until the interpreter is deleted. Note that renaming the
command _name_ will not remove the execution traces.

To implement _enterstep_ and _leavestep_ traces, it is necessary
to invoke traces regardless of at what level the _command_ is being
traced. This means that the value for _level_ argument to 
Tcl\_CreateTrace and Tcl\_CreateObjTrace APIs should also accept _0_.
A value of _0_ implies that commands at all levels will be traced. 

# Examples

The following script defines a procedure _foo_ and illustrates 
several cases as to how its execution can be traced.

	    # Define the proc foo
	    proc foo {var} {
	         return [string index $var [expr {$var*2}]]

	    }

	    # Command to invoke on trace activation
	    proc print {args} {
	        puts stdout "PRINT: $args"

	    }

	    proc main {} {
	        puts stdout "================CASE 1========================="
	        puts stdout "Trace proc foo only"
	        trace add execution foo {enter leave} {print exec}
	        foo 4

	        puts stdout "================CASE 2========================="
	        puts stdout "Trace proc foo as well as all commands within it"
	        trace add execution foo {enterstep leavestep} {print step}
	        foo 4

	        # Remove all traces
	        trace remove execution foo {enter leave} {print exec}
	        trace remove execution foo {enterstep leavestep} {print step}

	        puts stdout "================CASE 3========================="
	        puts stdout "Add a trace on string command"
	        trace add execution string {enter leave} {print exec}
	        foo 4
	        trace remove execution string {enter leave} {print exec}

	    }
	    main

The expected output of running the above script would be:

	    ===================CASE 1========================
	    Trace proc foo only
	    PRINT: exec {foo 4} enter
	    PRINT: exec {foo 4} 0 {} leave
	    ===================CASE 2=======================
	    Trace proc foo as well as all commands within it
	    PRINT: exec {foo 4} enter
	    PRINT: step {expr {$var*2}} enterstep
	    PRINT: step {expr {$var*2}} 0 8 leavestep
	    PRINT: step {string index 4 8} enterstep
	    PRINT: step {string index 4 8} 0 {} leavestep
	    PRINT: step {return {}} enterstep
	    PRINT: step {return {}} 2 {} leavestep
	    PRINT: exec {foo 4} 0 {} leave
	    ===================CASE 3=======================
	    Add a trace on string command
	    PRINT: exec {string index 4 8} enter
	    PRINT: exec {string index 4 8} 0 {} leave

Case 1 specifies a enter and leave trace on proc foo.
Here the proc foo is fully byte-code-compiled.

Case 2 additionally invokes a enterstep and leavestep trace on
proc foo.  This means that it will trace each command 
that is inovked within the proc foo.
Here the proc foo is \*not\* byte-code-compiled. This is
implemented by setting the DONT\_COMPILE\_CMDS\_INLINE flag.

Case 3 specifies a trace on string command only.
Here all commands within proc foo, except string command,
is byte-code-compiled. This is implemented by modifying
compilation engine to check for CMD\_HAS\_EXEC\_TRACES flag
before generating any byte-code.

# Reference Implementation

This proposal was originally implemented by Vince Darley.  Please see
Feature Request \#462580:
<http://sf.net/tracker/?func=detail&aid=462580&group\_id=10894&atid=360894>

The original patch from Vince Darley has been modified in the
following respects:

  1.  For _enter_ and _enterstep_ operations, the original patch
      passed arguments to the _command_ in its unexpanded form.
      This behavior has been changed to pass the arguments in its
      fully expanded form since it should be more useful for debugging
      scripts.

  2.  The original patch could not trace Tcl commands that were
      invoked inside a procedure because tracing is currently not
      possible for compiled commands.  Therefore, the patch was
      modified such that Tcl commands are no longer internally
      compiled if a trace has been set on a command.

  3.  For multiple traces on same command, the original patch 
      invoked the traces in the same order as they were created.
      This behavior was changed so that for _enter_ and _enterstep_
      operations, the traces are invoked in the reverse order of
      its creation. For _leave_ and _leavestep_, the traces are
      still invoked in the original order.

  4.  The original patch was created on 2000-Sept-14.  It was updated
      to work with the current CVS head.

The latest patch for this tip 62 is available at:

  <http://www.employees.org/~hlavana/tcl/>

The main changes for the patch are described in brief next.

Two new flags have been defined in tcl.h:

	  #define TCL_TRACE_ENTER_EXEC          1
	  #define TCL_TRACE_LEAVE_EXEC          2

These flag values are passed to Tcl\_CreateObjTrace and used by command
execution traces. More internal flags for slots 4, 8, 15, 16, 32 are
defined in tclCmdMZ.c file: TCL\_TRACE\_ENTER\_DURING\_EXEC, 
TCL\_TRACE\_LEAVE\_DURING\_EXEC,  TCL\_TRACE\_ANY\_EXEC, TCL\_TRACE\_EXEC\_IN\_PROGRESS
and TCL\_TRACE\_EXEC\_DIRECT.

A new function TclTraceExecutionObjCmd function implements the
_trace \{add\|remove\|list\} execution ..._ subcommands.
A new function TclCheckExecutionTraces is defined to check for traces added
by the execution subcommand. A new function TclCheckInterpTraces is defined
to check for global traces added by the Tcl\_CreateObjTrace command.
The TclEvalObjvInternal has been modified to call the above
two functions before as well as after the original command is executed.
A new function TraceExecutionProc is invoked, when necessary, to execute
the actual trace command in the interpreter.

A new structure ActiveInterpTrace has been defined for internal use so
that it behaves reasonably when traces are deleted while active.
In tclVar.c file, the function CallTraces has been renamed to CallVarTraces
and iPtr->activeTracePtr has been renamed to iPtr->activeTraceVarPtr.

An additional check for \(tracePtr->level == 0\) has been added in
Tcl\_EvalObjv and TclExecuteByteCode functions, so as to enable
command tracing at all levels.

# Benchmark Results

The benchmark results corresponds to _Version 1.1_ of the reference
implementation.

One potential objection to this TIP could be that it may affect the
performance of the Tcl-core.  Therefore, I have run the
_runbench.tcl_ script from the tclbench module for comparison on a
Sun Ultra5, Solaris2.6 machine.  The results have been posted at
<http://sf.net/tracker/?func=detail&aid=462580&group\_id=10894&atid=360894>

These results show that there is hardly any performance hit, if any,
by addition of this feature.  Of course when you activate a trace on a
command, then you will see a performance hit, but since primary uses
of traces will be in profiling and debugging, that isn't an issue.

# Future Scope

This proposal does not allow for the trace invocation _command_
to do the following:

  1. modify the number of arguments passed to _name_

  2. modify the value of arguments passed to _name_

  3. modify the result value and result code returned by _name_

  4. skip invocation of _name_ altogether if desired.

Consider the example of adding a sub-command "string reverse ..."
as shown on <http://mini.net/tcl/1570.html.>
Instead of using the rename command, it should be possible to use
the trace command to do the same, as follows:

	    trace add execution string {enter} {::mylib::stringx}
	    proc ::mylib::stringx {subcmd args} {
	        switch -exact -- $subcmd {
	            "reverse" {
	                # Hmm ... this is my subcommand, process it here
	                set returnValue [code_to_reverse_string_value]

	                # We need a mechansim to return immediately here
	                # with the processed results and an appropriate
	                # code value and not invoke the original string command.

	            }
	            default {
	                # This is probably a vaild subcommand, so do nothing
	                # and let the original string command handle it

	            }
	        }
	    }

# Copyright

This document has been placed in the public domain.

Name change from tip/63.tip to tip/63.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46

TIP:            63
Title:          Add -compound Option to Menu Entries
Version:        $Revision: 1.5 $
Author:         Vince Darley <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        27-Sep-2001
Post-History:   
Tcl-Version:    8.4

~ Abstract

This TIP adds to menu entries the ability to display both textual
labels and images (or bitmaps) in exactly the same way as buttons and
menubuttons currently can, by adding a ''-compound'' option.

~ Rationale

Menu entries are very similar to labels in most respects, but they
currently lack the ability to show an image and a piece of text at the
same time.  It is a useful piece of functionality currently missing
from Tk (certainly many standard applications make use of such menu
entries, e.g. Internet Explorer).  The changes involved are relatively
small.

A very similar TIP [11] was accepted without much argument.

~ Proposal

This TIP implements the change by adding an additional option
''-compound'' to menu entries which behaves identically to Tk's
existing ''-compound'' option: it accepts the values ''none'',
''center'', ''left'', ''right'', ''top'', and ''bottom'', and controls
the relative placement of an image to the text label in the entry.

A reference implementation exists, and is available from:
ftp://ftp.ucsd.edu/pub/alpha/tcl/compoundmenu.diff (note this diff has
Windows-style line endings).  This implementation is known to work on
Windows and Unix, and since the changes are very, very similar on
MacOS I expect it to work there (except perhaps for simple editing
mistakes).

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|
|

|

|
|

|

|

|
|
|

|
|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46

# TIP 63: Add -compound Option to Menu Entries

	Author:         Vince Darley <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        27-Sep-2001
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

This TIP adds to menu entries the ability to display both textual
labels and images \(or bitmaps\) in exactly the same way as buttons and
menubuttons currently can, by adding a _-compound_ option.

# Rationale

Menu entries are very similar to labels in most respects, but they
currently lack the ability to show an image and a piece of text at the
same time.  It is a useful piece of functionality currently missing
from Tk \(certainly many standard applications make use of such menu
entries, e.g. Internet Explorer\).  The changes involved are relatively
small.

A very similar TIP [[11]](11.md) was accepted without much argument.

# Proposal

This TIP implements the change by adding an additional option
_-compound_ to menu entries which behaves identically to Tk's
existing _-compound_ option: it accepts the values _none_,
_center_, _left_, _right_, _top_, and _bottom_, and controls
the relative placement of an image to the text label in the entry.

A reference implementation exists, and is available from:
ftp://ftp.ucsd.edu/pub/alpha/tcl/compoundmenu.diff \(note this diff has
Windows-style line endings\).  This implementation is known to work on
Windows and Unix, and since the changes are very, very similar on
MacOS I expect it to work there \(except perhaps for simple editing
mistakes\).

# Copyright

This document has been placed in the public domain.

Name change from tip/64.tip to tip/64.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147

TIP:            64
Title:          Improvements to Windows Font Handling
Version:        $Revision: 1.9 $
Author:         Chris Nelson <[email protected]>
Author:         Kevin Kenny <[email protected]>
State:          Deferred
Type:           Project
Vote:           Done
Created:        27-Sep-2001
Post-History:   
Tcl-Version:    8.4
Obsoleted-By:	145

~ Abstract

This TIP improves handling of native fonts in Tk under Microsoft
Windows making Tk applications more aesthetic and more consistent with
users' expectations of 'Windows applications.

~ Background

Tk 8.4 includes platform-specific system font names which relate to
configurable aspects of the native system.

  * On UNIX, this includes all X font names (e.g. as listed by
    ''xlsfonts'').

  * On Macintosh, this includes ''system'' and ''application''.

  * On Microsoft Windows, this includes ''system'', ''systemfixed'',
    ''ansi'', ''ansifixed'', ''device'', and ''oemfixed''.

Through v8.4a3, Tk used 8pt MS Sans Serif as the default font for
widgets.  While this was almost OK, it fails in two respects:

   * Users can change the font used for various 'Windows desktop
     features so MS Sans Serif may not be the correct font for, for
     example, menus.

   * Windows 2000 and Windows XP use Tahoma, not MS Sans Serif as
     their default font.

SourceForge patch #461442 (Make Tk use the default Windows font)
[http://sf.net/tracker/?func=detail&aid=461442&group_id=12997&atid=312997]
attempts to address Tk's deficiency by adding a ''windefault'' font
based on the Message font configured for the Windows desktop.  This
appears to be wrong.

This TIP attempts to fix the default Tk font the right way as well as
giving Tk programmers access to the rest of the fonts configured for
the 'Windows desktop.

NOTE: RFE 220772 on SourceForge
[http://sf.net/tracker/?func=detail&aid=220772&group_id=12997&atid=362997]
has a related patch.

~ The Default GUI Font

The Win API call ''GetStockObject()'' accesses brushes, pens, and
fonts which are pre-configured on the system.  The available fonts
are:

    1. ANSI_FIXED_FONT

    2. ANSI_VAR_FONT

    3. DEVICE_DEFAULT_FONT

    4. DEFAULT_GUI_FONT

    5. OEM_FIXED_FONT

    6. SYSTEM_FIXED_FONT

    7. SYSTEM_FONT

The ''TkStateMap systemMap'' in ''tkWinFont.c'' listed all but one of
these, DEFAULT_GUI_FONT.  As it turns out, this is the most important
as it is the one that 'Windows uses as it's default font (for example,
in Control Panel Applets).

I propose to add DEFAULT_GUI_FONT to the ''systemMap'' with a font
name of ''defaultgui'' and to change CTL_FONT in ''tkWinDefault.h''
from ''{MS Sans Serif} 8'' to ''defaultgui''.  This will require a
change in documentation to list the new system font name but is
otherwise simple and painless.  Furthermore, it makes Tk GUIs look
right on W2k.

A reference implementation for this is available in patch 461442
(referenced above).

~ Access to Desktop Fonts

The original implementation of ''windefault'' as a new font, accessed
the message font from the NONCLIENTMETRICS structure.  While this is
not, in fact, the correct default GUI font, it is an important system
font, as are the others on the NONCLIENTMETRICS structure.  The
structure lists:

   *  Caption (title bar) font

   *  Small Caption (palette title bar) font

   *  Menu font

   *  Tooltip (and status bar) font

   *  Message box font 

The 'Windows Desktop Properties also include a font for icon labels on
the desktop.  This font is accessed with ''SystemParametersInfo()''.

I propose to add 6 desktop fonts as system fonts on Windows.  The
names would be derived from their Desktop Properties entries:
''dtIcon'', ''dtTitleBar'', ''dtMenu'', ''dtMessageBox'',
''dtPaletteTitle'', ''dtToolTip''.  The "dt" prefix associates the
fonts with the desktop properties.  (Can or should font names have
internal capital letters?)

We might also add synonyms which relate to the structure field names
and/or customary use of the font.  I'd propose adding ''dtCaption'' as
equivalent to ''dtTitleBar'', ''dtSmallCaption'' as equivalent to
''dtPaletteTitle'', and ''dtStatus'' as equivalent to ''dtToolTip''.

A reference implementation for this is available in Patch #461442
(referenced above) albeit with different font names.

~ Dynamic fonts

Many 'Windows applications respond on-the-fly to changes in the desktop
fonts.  Tk responds to changes in Tk fonts via [[font configure]].  I
propose that Tk respond to the WM_SETTINGCHANGE message from Windows
to propagate changes to the desktop fonts enumerated above as it
propagates changes to Tk fonts when they are reconfigured.  I have yet
to prototype these changes.

~ Comments

''KBK'' wonders whether the dt* fonts have logical counterparts on the
other platforms (KDE, Gnome/Gtk, Macintosh, HP-VUE, ...) and if
implementors on those platforms might want to try to mirror this
functionality.  Since nobody has commented, he assumes that they
at least do not find the idea objectionable.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|
|

|

|
|

|
|
|

|

|

|

|

|

|

|

|

|

|

|
|
|
|

|
|
|

|

|

|

|

|

|

|

|
|
|
|

|
|
|

|
|

|

|
|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147

# TIP 64: Improvements to Windows Font Handling

	Author:         Chris Nelson <[email protected]>
	Author:         Kevin Kenny <[email protected]>
	State:          Deferred
	Type:           Project
	Vote:           Done
	Created:        27-Sep-2001
	Post-History:   
	Tcl-Version:    8.4
	Obsoleted-By:	145
-----

# Abstract

This TIP improves handling of native fonts in Tk under Microsoft
Windows making Tk applications more aesthetic and more consistent with
users' expectations of 'Windows applications.

# Background

Tk 8.4 includes platform-specific system font names which relate to
configurable aspects of the native system.

  * On UNIX, this includes all X font names \(e.g. as listed by
    _xlsfonts_\).

  * On Macintosh, this includes _system_ and _application_.

  * On Microsoft Windows, this includes _system_, _systemfixed_,
    _ansi_, _ansifixed_, _device_, and _oemfixed_.

Through v8.4a3, Tk used 8pt MS Sans Serif as the default font for
widgets.  While this was almost OK, it fails in two respects:

   * Users can change the font used for various 'Windows desktop
     features so MS Sans Serif may not be the correct font for, for
     example, menus.

   * Windows 2000 and Windows XP use Tahoma, not MS Sans Serif as
     their default font.

SourceForge patch \#461442 \(Make Tk use the default Windows font\)
<http://sf.net/tracker/?func=detail&aid=461442&group_id=12997&atid=312997> 
attempts to address Tk's deficiency by adding a _windefault_ font
based on the Message font configured for the Windows desktop.  This
appears to be wrong.

This TIP attempts to fix the default Tk font the right way as well as
giving Tk programmers access to the rest of the fonts configured for
the 'Windows desktop.

NOTE: RFE 220772 on SourceForge
<http://sf.net/tracker/?func=detail&aid=220772&group_id=12997&atid=362997> 
has a related patch.

# The Default GUI Font

The Win API call _GetStockObject\(\)_ accesses brushes, pens, and
fonts which are pre-configured on the system.  The available fonts
are:

    1. ANSI\_FIXED\_FONT

    2. ANSI\_VAR\_FONT

    3. DEVICE\_DEFAULT\_FONT

    4. DEFAULT\_GUI\_FONT

    5. OEM\_FIXED\_FONT

    6. SYSTEM\_FIXED\_FONT

    7. SYSTEM\_FONT

The _TkStateMap systemMap_ in _tkWinFont.c_ listed all but one of
these, DEFAULT\_GUI\_FONT.  As it turns out, this is the most important
as it is the one that 'Windows uses as it's default font \(for example,
in Control Panel Applets\).

I propose to add DEFAULT\_GUI\_FONT to the _systemMap_ with a font
name of _defaultgui_ and to change CTL\_FONT in _tkWinDefault.h_
from _\{MS Sans Serif\} 8_ to _defaultgui_.  This will require a
change in documentation to list the new system font name but is
otherwise simple and painless.  Furthermore, it makes Tk GUIs look
right on W2k.

A reference implementation for this is available in patch 461442
\(referenced above\).

# Access to Desktop Fonts

The original implementation of _windefault_ as a new font, accessed
the message font from the NONCLIENTMETRICS structure.  While this is
not, in fact, the correct default GUI font, it is an important system
font, as are the others on the NONCLIENTMETRICS structure.  The
structure lists:

   *  Caption \(title bar\) font

   *  Small Caption \(palette title bar\) font

   *  Menu font

   *  Tooltip \(and status bar\) font

   *  Message box font 

The 'Windows Desktop Properties also include a font for icon labels on
the desktop.  This font is accessed with _SystemParametersInfo\(\)_.

I propose to add 6 desktop fonts as system fonts on Windows.  The
names would be derived from their Desktop Properties entries:
_dtIcon_, _dtTitleBar_, _dtMenu_, _dtMessageBox_,
_dtPaletteTitle_, _dtToolTip_.  The "dt" prefix associates the
fonts with the desktop properties.  \(Can or should font names have
internal capital letters?\)

We might also add synonyms which relate to the structure field names
and/or customary use of the font.  I'd propose adding _dtCaption_ as
equivalent to _dtTitleBar_, _dtSmallCaption_ as equivalent to
_dtPaletteTitle_, and _dtStatus_ as equivalent to _dtToolTip_.

A reference implementation for this is available in Patch \#461442
\(referenced above\) albeit with different font names.

# Dynamic fonts

Many 'Windows applications respond on-the-fly to changes in the desktop
fonts.  Tk responds to changes in Tk fonts via [font configure].  I
propose that Tk respond to the WM\_SETTINGCHANGE message from Windows
to propagate changes to the desktop fonts enumerated above as it
propagates changes to Tk fonts when they are reconfigured.  I have yet
to prototype these changes.

# Comments

_KBK_ wonders whether the dt\* fonts have logical counterparts on the
other platforms \(KDE, Gnome/Gtk, Macintosh, HP-VUE, ...\) and if
implementors on those platforms might want to try to mirror this
functionality.  Since nobody has commented, he assumes that they
at least do not find the idea objectionable.

# Copyright

This document has been placed in the public domain.

Name change from tip/65.tip to tip/65.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59

60
61

62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151

TIP:            65
Title:          Enhanced [info args]
Version:        $Revision: 1.6 $
Author:         Glenn Jackman <[email protected]>
Author:         Don Porter <[email protected]>
Author:         Glenn Jackman <[email protected]>
State:          Rejected
Type:           Project
Vote:           Done
Created:        18-Sep-2001
Post-History:   
Tcl-Version:    8.5

~ Abstract

This TIP proposes a new subcommand to the [[info]] command be added
that would return the list of arguments, together with any default
values in the same format as the ''args'' parameter to the [[proc]]
command.

~ Introduction

The [[proc]] man page defines ''args'' as:

  > ... the formal arguments to the procedure.  It consists of a list,
    possibly empty, each of whose elements specifies one argument.
    Each argument specifier is also a list with either one or two
    fields.  If there is only a single field in the specifier then it
    is the name of the argument; if there are two fields, then the
    first is the argument name and the second is its default value.

Suppose we define a procedure like this:

|       proc test {one {two 2} {three {3 4 5}} args} {return}

We want to determine the formal arguments for this procedure.  We want
some method to return the list:

|       one {two 2} {three {3 4 5}} args

[[info args]] fails us because it does not return default values, only
the list of argument names {one two three args}.

The [[info default]] command exists, and does partially what we want.
However [[info default]] only operates on a single argument.  To
determine the complete list of arguments with default values, we must
iterate over the arguments returned by [[info args]].  We would define
a procedure like:

|       proc info_args_with_defaults {procname} {
|           set argspec [list]
|           # [info args] throws an error if $procname is not a procedure.
|           foreach arg [info args $procname] {
|               if {[info default $procname $arg value]} {
|                   lappend argspec [list $arg $value]
|               } else {
|                   lappend argspec $arg
|               }
|           }

|           return $argspec
|       }

|       info_args_with_defaults test  ;# ==> returns {one {two 2} {three {3 4 5}} args}

A more sophisticated scripted solution is to overload the [[info]]
command itself, as described in the Wiki at
http://wiki.tcl.tk/wrappingCommands

It would be much more convenient to be able to rely on the [[info]]
command itself to return the desired information, particularly since
it ''almost'' does what we want already.

''This topic was originally raised in the news:comp.lang.tcl newsgroup
in the thread http://groups.google.com/groups?th=4b0d5dba85d2c160''

~ Specification

Add [[info formalargs]] to the set of subcommands for
Tcl's built-in [[info]] command, with syntax:

| info formalargs $procName

This command will raise an error if ''$procName'' is not the
name of a proc.  Otherwise, it will return a list of formal
arguments of the named proc, along with their default values,
if any, in a format suitable for passing to the [[proc]] command
as a second argument.

~ Rationale

With the goal of maintaining backwards compatibility in mind, two
possibilities arise: adding a new switch to the existing [[info args]]
command, and adding a completely new subcommand to [[info]].

Adding a switch to [[info args]] may break backwards compatibility.
If we use the syntax [[info args ''?-withdefaults? procname'']], there
may be trouble with existing scripts containing a procedure named
"-withdefaults".  The syntax [[info args ''procname
?-withdefaults?'']] is completely backwards compatible.  However,
among Tcl commands that take subcommands, there is currently some
inconsistency as to where switches should appear.  [[clock]]
subcommands place these options after required parameters.
[[namespace]] and [[package]] subcommands place these options before
required parameters.  Some [[file]] subcommands put them before, some
after.  Currently, no [[info]] subcommands take switches.

Rather than compound to this inconsistency, creating a new [[info]]
subcommand feels cleaner.  Possible names include:

 argspec, arglist, args_with_defaults:	These all collide with the
    "arg", "ar", "a" shorthands for [[info args ''procname'']].  And
    ''args_with_defaults'' is just *way* too ugly.

 formalargs, fullargs:	Either of these could be used.  This collides
    with the "f" shorthand for [[info functions]]

 parameters:	This collides with the "pa" shorthand for [[info
    patchlevel]]

 prototype:	This collides with the "pro" and "pr" shorthands for
    [[info procs ''?pattern?'']]

 signature:	This could be used, as it does not collide with any
    shorthand for either [[info script]] or [[info sharedlibextension]].

The term "signature" has meaning in the Java and C++ worlds: the
function name and its arguments together comprise the signature.  The
purpose of this TIP is to return only the arguments with any defaults,
so to avoid any potential confusion I will rule out "signature".

Of the remaining possibilities, my choice would be "formalargs".  The
term "formal arguments" is used in the [[proc]] man page.
"formalargs" also incorporates the word "args", indicating a
relationship to [[info args]].

~ Reference Implementation

Refer to the submitted patch, which implements an subcommand named
[[info fullargs]], at:
http://sourceforge.net/tracker/index.php?func=detail&aid=461635&group_id=10894&atid=310894

~ Reasons for Rejection

Those voting against this proposal believed that since the desired
functionality is already possible with a short script of just a few
Tcl commands, it would be unnecessary bloat to add another subcommand.
Some also pointed to [112] as another approach to letting people extend
Tcl built-in commands with their own custom subcommands.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|
|

|
|

|

|
|
|
|
|
|
|
|
<
<
>
>
|
<
>
|

|

|

|

|

|
|

|

|
|

|

|

|

|

|
|

|
|

|
|

|

|
|
|

|

|
|
|

|

|
|

|

|

|

|

|

|

|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56

57
58
59

60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151

# TIP 65: Enhanced [info args]

	Author:         Glenn Jackman <[email protected]>
	Author:         Don Porter <[email protected]>
	Author:         Glenn Jackman <[email protected]>
	State:          Rejected
	Type:           Project
	Vote:           Done
	Created:        18-Sep-2001
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

This TIP proposes a new subcommand to the [info] command be added
that would return the list of arguments, together with any default
values in the same format as the _args_ parameter to the [proc]
command.

# Introduction

The [proc] man page defines _args_ as:

  > ... the formal arguments to the procedure.  It consists of a list,
    possibly empty, each of whose elements specifies one argument.
    Each argument specifier is also a list with either one or two
    fields.  If there is only a single field in the specifier then it
    is the name of the argument; if there are two fields, then the
    first is the argument name and the second is its default value.

Suppose we define a procedure like this:

	       proc test {one {two 2} {three {3 4 5}} args} {return}

We want to determine the formal arguments for this procedure.  We want
some method to return the list:

	       one {two 2} {three {3 4 5}} args

[info args] fails us because it does not return default values, only
the list of argument names \{one two three args\}.

The [info default] command exists, and does partially what we want.
However [info default] only operates on a single argument.  To
determine the complete list of arguments with default values, we must
iterate over the arguments returned by [info args].  We would define
a procedure like:

	       proc info_args_with_defaults {procname} {
	           set argspec [list]
	           # [info args] throws an error if $procname is not a procedure.
	           foreach arg [info args $procname] {
	               if {[info default $procname $arg value]} {
	                   lappend argspec [list $arg $value]
	               } else {
	                   lappend argspec $arg

	               }
	           }
	           return $argspec

	       }
	       info_args_with_defaults test  ;# ==> returns {one {two 2} {three {3 4 5}} args}

A more sophisticated scripted solution is to overload the [info]
command itself, as described in the Wiki at
<http://wiki.tcl.tk/wrappingCommands>

It would be much more convenient to be able to rely on the [info]
command itself to return the desired information, particularly since
it _almost_ does what we want already.

_This topic was originally raised in the news:comp.lang.tcl newsgroup
in the thread <http://groups.google.com/groups?th=4b0d5dba85d2c160_>

# Specification

Add [info formalargs] to the set of subcommands for
Tcl's built-in [info] command, with syntax:

	 info formalargs $procName

This command will raise an error if _$procName_ is not the
name of a proc.  Otherwise, it will return a list of formal
arguments of the named proc, along with their default values,
if any, in a format suitable for passing to the [proc] command
as a second argument.

# Rationale

With the goal of maintaining backwards compatibility in mind, two
possibilities arise: adding a new switch to the existing [info args]
command, and adding a completely new subcommand to [info].

Adding a switch to [info args] may break backwards compatibility.
If we use the syntax [info args _?-withdefaults? procname_], there
may be trouble with existing scripts containing a procedure named
"-withdefaults".  The syntax [info args _procname
?-withdefaults?_] is completely backwards compatible.  However,
among Tcl commands that take subcommands, there is currently some
inconsistency as to where switches should appear.  [clock]
subcommands place these options after required parameters.
[namespace] and [package] subcommands place these options before
required parameters.  Some [file] subcommands put them before, some
after.  Currently, no [info] subcommands take switches.

Rather than compound to this inconsistency, creating a new [info]
subcommand feels cleaner.  Possible names include:

 argspec, arglist, args\_with\_defaults:	These all collide with the
    "arg", "ar", "a" shorthands for [info args _procname_].  And
    _args\_with\_defaults_ is just \*way\* too ugly.

 formalargs, fullargs:	Either of these could be used.  This collides
    with the "f" shorthand for [info functions]

 parameters:	This collides with the "pa" shorthand for [info
    patchlevel]

 prototype:	This collides with the "pro" and "pr" shorthands for
    [info procs _?pattern?_]

 signature:	This could be used, as it does not collide with any
    shorthand for either [info script] or [info sharedlibextension].

The term "signature" has meaning in the Java and C\+\+ worlds: the
function name and its arguments together comprise the signature.  The
purpose of this TIP is to return only the arguments with any defaults,
so to avoid any potential confusion I will rule out "signature".

Of the remaining possibilities, my choice would be "formalargs".  The
term "formal arguments" is used in the [proc] man page.
"formalargs" also incorporates the word "args", indicating a
relationship to [info args].

# Reference Implementation

Refer to the submitted patch, which implements an subcommand named
[info fullargs], at:
<http://sourceforge.net/tracker/index.php?func=detail&aid=461635&group\_id=10894&atid=310894>

# Reasons for Rejection

Those voting against this proposal believed that since the desired
functionality is already possible with a short script of just a few
Tcl commands, it would be unnecessary bloat to add another subcommand.
Some also pointed to [[112]](112.md) as another approach to letting people extend
Tcl built-in commands with their own custom subcommands.

# Copyright

This document has been placed in the public domain.

Name change from tip/66.tip to tip/66.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
TIP:            66
Title:          Stand-alone and Embedded Tcl/Tk Applications
Version:        $Revision: 1.15 $
Author:         Arjen Markus <[email protected]>
State:          Draft
Type:           Informative
Vote:           Pending
Created:        02-Oct-2001
Post-History:   
Keywords:       installation,initialisation,embedded,resources

~ Abstract

This TIP describes the development and deployment of Tcl/Tk
applications, with particular attention on how to ''embed'' the
interpreter into executables written in C or C++.

~ Introduction and Background

Usually, an application that uses Tcl/Tk in some way uses an
independent installation and the application itself is started via a
standard shell, like ''tclsh'' or ''wish''.  There are numerous
occasions when such a set-up is not convenient:

 * Installation of external software is not allowed unless the IT
   department at the client's site consents - a very reasonable
   approach to the uncountable problems that occur due to conflicting
   software in modern computing environments.

<
|
<
|
|
|
|
|
|
|
>

|

|
|

|

|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28

# TIP 66: Stand-alone and Embedded Tcl/Tk Applications

	Author:         Arjen Markus <[email protected]>
	State:          Draft
	Type:           Informative
	Vote:           Pending
	Created:        02-Oct-2001
	Post-History:   
	Keywords:       installation,initialisation,embedded,resources
-----

# Abstract

This TIP describes the development and deployment of Tcl/Tk
applications, with particular attention on how to _embed_ the
interpreter into executables written in C or C\+\+.

# Introduction and Background

Usually, an application that uses Tcl/Tk in some way uses an
independent installation and the application itself is started via a
standard shell, like _tclsh_ or _wish_.  There are numerous
occasions when such a set-up is not convenient:

 * Installation of external software is not allowed unless the IT
   department at the client's site consents - a very reasonable
   approach to the uncountable problems that occur due to conflicting
   software in modern computing environments.

︙ ︙ 
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
    application using one of the commercial tools that are available
    for this arcane job, we ran into a bizarre limitation: text
    replacement was possible for the so-called Windows INI-files only,
    but not for other types of files.  The text to be replaced was the
    name of the installation directory.  After several trials with the
    programming constructs the tool allowed, we chose a much better
    solution: a small Tcl script wrapped into a stand-alone program
    using Freewrap.  (The application itself now actually uses another
    stand-alone Tcl script to take care of the file management that
    was too complicated for ordinary DOS batch files.)

 2. The second example involves a small program that proves the
    usefulness of Tcl/Tk in on-line visualisation.  The idea there is
    that large computational programs can send their data at regular
    steps during the computation to a separate program that plots
    these results in some meaningful way.  To achieve this the program
    exports the results to the Tcl interpreter which uses the socket
    command to send them to a (primitive) viewer.  For demonstration
    purposes you must be able to copy the program along with some
    files it needs on an arbitrary computer and, later, remove it with
    just a little effort.

Applications that use Tcl/Tk as an embedded library to achieve their
goals, rather than exist as extensions or applications written in Tcl,
can be quite useful.  Examples include on-line visualisation in large

|

|

|

47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
    application using one of the commercial tools that are available
    for this arcane job, we ran into a bizarre limitation: text
    replacement was possible for the so-called Windows INI-files only,
    but not for other types of files.  The text to be replaced was the
    name of the installation directory.  After several trials with the
    programming constructs the tool allowed, we chose a much better
    solution: a small Tcl script wrapped into a stand-alone program
    using Freewrap.  \(The application itself now actually uses another
    stand-alone Tcl script to take care of the file management that
    was too complicated for ordinary DOS batch files.\)

 2. The second example involves a small program that proves the
    usefulness of Tcl/Tk in on-line visualisation.  The idea there is
    that large computational programs can send their data at regular
    steps during the computation to a separate program that plots
    these results in some meaningful way.  To achieve this the program
    exports the results to the Tcl interpreter which uses the socket
    command to send them to a \(primitive\) viewer.  For demonstration
    purposes you must be able to copy the program along with some
    files it needs on an arbitrary computer and, later, remove it with
    just a little effort.

Applications that use Tcl/Tk as an embedded library to achieve their
goals, rather than exist as extensions or applications written in Tcl,
can be quite useful.  Examples include on-line visualisation in large

︙ ︙ 
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
   environment?

 * What you can and can not do with a bare interpreter?

 * How to enhance its capabilities, such that it works as in an
   ordinary Tcl shell?

 * What (binary and script) libraries are required?

 * How to deal with other programming languages than C/C++?

 * How to create applications that can be installed without an
   independent Tcl installation?

~ Related TIPs and Discussions

The are several TIPs at the moment of this writing that are in some way
related to the subject:

 * [4] proposes to outline the release and distribution philosophy, so
   that it becomes easy to include generally useful extensions - the
   so-called "batteries included".

 * [12] focuses completely on the "batteries included" aspect of the
   source distribution.

 * [34] is intended to solve some of the more awkward issues of TEA,
   as the current build system actually requires separate versions for
   UNIX and Windows.

 * [55] defines the set-up of packages that can be automatically
   installed into an existing installation.

 * Postings on the news:comp.lang.tcl newsgroup frequently involve how
   to embed Tcl into a C application, with an emphasis on loading
   packages and the use of the ''Tcl_Init()'' function.

 * Recently discussions have been held about supporting programming
   languages other than C.  Notably: Pascal, FORTRAN, Visual Basic.

~ Contents of the Planned Document

The document that should help programmers with the issues discussed here
will have the following (tentative) table of contents:

 * Introduction, outlining its purpose.

 * Tcl's bootstrap procedure, describing how the usual shells work.

 * Creating interpreters, what a bare interpreter can and can not do,
   how to enrich it via start-up scripts like ''init.tcl''.

 * Compiling and linking, the usual issues surrounding the making of a
   binary executable.

 * Interfacing to other programming languages, though possibly a huge
   subject, it will present some guidelines, both practical
   implementation and design issues.

 * Installation and deployment, should inform about the external
   resources (environment variables, libraries, etc) for the
   application.

 * Overview, provide a checklist of the various possibilities and how
   to achieve them, with pointers for further information.

 * Literature, all the good books and other references.

----

~ Discussion

Issues that arise are:

 * what is the simplest way to embed Tcl,

 * what resources are needed (in terms of script and binary libraries)
   by such an application,

 * how can the application find everything it needs?

This TIP is meant to be a document that enables programmers who do not
have intimate knowledge of the Tcl core to build such application and
deploy them in the way they want.

Should it turn out that some automated tool would be nice to help the
programmers, then this TIP will also cover such a tool.

----

~ Using the Tcl library

There are numerous ways an application written mainly in a language like
C can use the Tcl and Tk libraries (in short: Tcl):

   * The application can simply use Tcl as a convenient library of
     C routines. In that case, Tcl would provide such facilities as
     regular expressions or channels.

   * The application can use Tcl as a scripting tool, that is, it
     will call Tcl to evaluate scripts and import the results.

   * The application can use Tcl in a more complicated mixture:
     Tcl scripts get evaluated that require binary extensions
     (both defined outside the application and as an integral
     part of the application).

   * An application that uses Tcl need not be written in C, but
     could be written in any programming language that allows
     calls to and from C routines directly or indirectly.

''Note:'' due to the fact that the author is mostly familiar with the
UNIX/LINUX and Windows platforms, no comments will be made about the
Macintosh. This is completely due to ignorance, not to arrogance.

In principle, using the Tcl/Tk libraries is very simple: just create
a Tcl interpreter, fill it with variables, commands and so on and
feed it scripts, either as a file or as a string. It gets
more complicated in the following situations:

   * The interpreter must be able to handle packages and interact
     with the environment in much the same way as tclsh or wish.

   * The application needs to intermix its own processing with Tcl
     event loops (such as continuing a calculation while a Tk window
     shows the progress).

   * It must be possible to use the application independently from
     a full Tcl installation.

The key to a successful implementation is: understanding how to
properly initialise Tcl.

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|
|

82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
   environment?

 * What you can and can not do with a bare interpreter?

 * How to enhance its capabilities, such that it works as in an
   ordinary Tcl shell?

 * What \(binary and script\) libraries are required?

 * How to deal with other programming languages than C/C\+\+?

 * How to create applications that can be installed without an
   independent Tcl installation?

# Related TIPs and Discussions

The are several TIPs at the moment of this writing that are in some way
related to the subject:

 * [[4]](4.md) proposes to outline the release and distribution philosophy, so
   that it becomes easy to include generally useful extensions - the
   so-called "batteries included".

 * [[12]](12.md) focuses completely on the "batteries included" aspect of the
   source distribution.

 * [[34]](34.md) is intended to solve some of the more awkward issues of TEA,
   as the current build system actually requires separate versions for
   UNIX and Windows.

 * [[55]](55.md) defines the set-up of packages that can be automatically
   installed into an existing installation.

 * Postings on the news:comp.lang.tcl newsgroup frequently involve how
   to embed Tcl into a C application, with an emphasis on loading
   packages and the use of the _Tcl\_Init\(\)_ function.

 * Recently discussions have been held about supporting programming
   languages other than C.  Notably: Pascal, FORTRAN, Visual Basic.

# Contents of the Planned Document

The document that should help programmers with the issues discussed here
will have the following \(tentative\) table of contents:

 * Introduction, outlining its purpose.

 * Tcl's bootstrap procedure, describing how the usual shells work.

 * Creating interpreters, what a bare interpreter can and can not do,
   how to enrich it via start-up scripts like _init.tcl_.

 * Compiling and linking, the usual issues surrounding the making of a
   binary executable.

 * Interfacing to other programming languages, though possibly a huge
   subject, it will present some guidelines, both practical
   implementation and design issues.

 * Installation and deployment, should inform about the external
   resources \(environment variables, libraries, etc\) for the
   application.

 * Overview, provide a checklist of the various possibilities and how
   to achieve them, with pointers for further information.

 * Literature, all the good books and other references.

----

# Discussion

Issues that arise are:

 * what is the simplest way to embed Tcl,

 * what resources are needed \(in terms of script and binary libraries\)
   by such an application,

 * how can the application find everything it needs?

This TIP is meant to be a document that enables programmers who do not
have intimate knowledge of the Tcl core to build such application and
deploy them in the way they want.

Should it turn out that some automated tool would be nice to help the
programmers, then this TIP will also cover such a tool.

----

# Using the Tcl library

There are numerous ways an application written mainly in a language like
C can use the Tcl and Tk libraries \(in short: Tcl\):

   * The application can simply use Tcl as a convenient library of
     C routines. In that case, Tcl would provide such facilities as
     regular expressions or channels.

   * The application can use Tcl as a scripting tool, that is, it
     will call Tcl to evaluate scripts and import the results.

   * The application can use Tcl in a more complicated mixture:
     Tcl scripts get evaluated that require binary extensions
     \(both defined outside the application and as an integral
     part of the application\).

   * An application that uses Tcl need not be written in C, but
     could be written in any programming language that allows
     calls to and from C routines directly or indirectly.

_Note:_ due to the fact that the author is mostly familiar with the
UNIX/LINUX and Windows platforms, no comments will be made about the
Macintosh. This is completely due to ignorance, not to arrogance.

In principle, using the Tcl/Tk libraries is very simple: just create
a Tcl interpreter, fill it with variables, commands and so on and
feed it scripts, either as a file or as a string. It gets
more complicated in the following situations:

   * The interpreter must be able to handle packages and interact
     with the environment in much the same way as tclsh or wish.

   * The application needs to intermix its own processing with Tcl
     event loops \(such as continuing a calculation while a Tk window
     shows the progress\).

   * It must be possible to use the application independently from
     a full Tcl installation.

The key to a successful implementation is: understanding how to
properly initialise Tcl.

︙ ︙ 
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306

307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381

382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590

591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652

and the output.

Let us assume that the application is written in some convenient
programming language like C. The reasons for using Tcl are:

   * Flexible input routines

   > By using the scripting capabilities of Tcl one can easily
     adapt the program to the input file or files that it should
     read.

   * Flexible output routines

   > Again, the scripting capabilities allow adapting the output
     to the customer's wishes, without having to recompile and
     link it. This can be done for simple files on disk, but
     also graphical output or storage in a database is possible,
     ''without changing the program itself''.

~ The simplest way: create a bare interpreter

With the Tcl routine Tcl_CreateInterp() you can create an interpreter
that is capable of all the basic commands:

|  Tcl_Interp * interp         ;
|  char       * input_filename ;
|  char       * buffer         ;
|  double       x, y, z        ;
|
|  /* Create the interp, use it to read the given input file,
|     Note:
|     Using the string API for simplicity, no error checking
|  */
|  interp = Tcl_CreateInterp() ;
|  Tcl_SetVar( interp, "input_file", input_filename, TCL_GLOBAL_ONLY ) ;
|  Tcl_EvalFile( interp, startup_script ) ;
|
|  /* Extract the input data
|  */
|  buffer = Tcl_GetVar( interp, "x", TCL_GLOBAL_ONLY ) ;
|  Tcl_GetDouble( interp, buffer, &x ) ;
|  buffer = Tcl_GetVar( interp, "y", TCL_GLOBAL_ONLY ) ;
|  Tcl_GetDouble( interp, buffer, &y ) ;
|  buffer = Tcl_GetVar( interp, "z", TCL_GLOBAL_ONLY ) ;
|  Tcl_GetDouble( interp, buffer, &z ) ;
|
|  /* Destroy the interp - if you do not need it any longer
|  */
|  Tcl_DestroyInterp( interp ) ;

The output routine contains a similar fragment (note, we assume
the Tcl interpreter was stored somewhere):

|  Tcl_Interp * interp                   ;
|  char       * output_filename          ;
|  char         buffer[TCL_DOUBLE_SPACE] ;
|  double       a, b                     ;
|
|  /* Export the results to the interpreter
|  */
|  Tcl_PrintDouble( interp, a, buffer ) ;
|  Tcl_SetVar( interp, "a", buffer, TCL_GLOBAL_ONLY ) ;
|  Tcl_PrintDouble( interp, b, buffer ) ;
|  Tcl_SetVar( interp, "b", buffer, TCL_GLOBAL_ONLY ) ;
|
|  Tcl_SetVar( interp, "output_file", input_filename, TCL_GLOBAL_ONLY ) ;
|  Tcl_EvalFile( interp, report_script ) ;
|

To add error checking (always do!), use code like this:

|
|  Tcl_Channel errChannel ;
|
|  if ( Tcl_EvalFile( ... ) != TCL_OK ) {
|     errChannel = Tcl_GetStdChannel( TCL_STDERR ) ;
|     if ( errChannel != NULL ) {
|        TclWriteObj( errChannel, Tcl_GetObjResult(interp) ) ;
|        TclWriteChars( errChannel, "\n", -1 ) ;
|        ... /* Quit the program or other error handling? */
|     }
|  }

|

With this approach you need only to worry about the Tcl binary
libraries: if the dynamic versions are linked to your application,
then distribution of your application should include these binaries.
If, on the other hand the static versions are used, your application
already contains all of Tcl it needs all by itself.

The limitations of this approach are:

   * The utilities ordinarily defined via the Tcl initialisation
     script ''init.tcl'' are not available. (Note that these
     include such procedures as ''tclPkgSetup'' and ''unknown'')

   * The Tcl variables ''argc'', ''argv'' and ''argv0'' are not set.
     This may be problematic if you want to use these variables to
     communicate with the user, e.g. provide an initial script file
     on the command-line.

   * Character encodings are not available. This will limit your
     application to ASCII characters.

~ Complete initialisation: the role of init.tcl

The next section outlines the full initialisation procedure that is used
in the standard ''tclsh'' shell. This section concentrates instead on
some practical observations:

   * The routine Tcl_FindExecutable() does a lot more than its name
     suggests: it is responsible for initialising the various subsystems
     in a controlled way, it will find all the character encodings.

   > It has to be called very early, before creating an interpreter.
     The results are stored in private variables that are used for
     all threads.

   > If it can not find the executable, no harm is done: it will
     have initialised the subsystems anyway.

   * The routine Tcl_Init() is responsible for setting up the
     script library by evaluating the script ''init.tcl''. It should be
     called after the creation of an interpreter, to add the various
     commands to it.

   > If it can not find this script, it will return with an error.

   > (The routine actually has two additional hooks to allow
     customisation, but these will probably be used in unusual
     circumstances only.)

The script ''init.tcl'' and any it sources (directly or indirectly
via auto_load) must be found via the ''tcl_library'' variable.
On UNIX this variable is initialised via the ''TCL_LIBRARY'' environment
variable is used, whereas on MS Windows the pathname of the Tcl DLLs
is used as well.

As long as these scripts can be found, they can actually reside in a
large number of directories with names related to the Tcl library path.

This leads to the following code to create a full-fledged interpreter:

|  Tcl_Interp * interp         ;
|
|  /* Initialise the Tcl library thouroughly
|  */
|  Tcl_FindExecutable( argv[0] ) ;
|
|  /* Create the interp, evaluate "init.tcl" for the script
|     level initialisation.
|  */
|  interp = Tcl_CreateInterp() ;
|
|  if ( Tcl_Init( interp ) != TCL_OK ) {
|     ... Report the error
|  }

|

With ''init.tcl'' loaded, we have a number of additional commands and
global variables:

   * tclLog, unknown, auto_load, auto_execok are the most important
     ones.

   * auto_path, errorInfo, errorCode

To create an interpreter that can handle Tk as well, you should be
aware of the following:

   * Tk-able interpreters always need to be initialised via Tk_Init()
     and therefore require the start-up scripts: these scripts contain
     the default bindings and resource definitions and are therefore
     indispensable for Tk.

   * An application written using Tk needs to process events in a
     well-defined event loop.

TODO: how to write the event loop, what choices are available?

~ Initialisation via the standard shell

The details of the initialisation done in the standard tclsh shell
are quite intricate. They involve, in addition to the initialisation via
Tcl_FindExecutable() and Tcl_Init() also:

   * processing the command-line arguments

   * customisation via various hooks

   * preparing the Tcl parser by setting the locale to "C", as only
     this guarantees everything works as expected.

A summary of the steps found in the initialisation code is given below:

   * main() is a system-dependent routine which:

   > * sets the locale (Windows version)

   > * parses the command-line according to the UNIX rules (Windows
       version)

   > * calls Tcl_Main(), which is not supposed to return

   * Tcl_Main() takes as arguments the well-known ''argc/argv''
     command-line arguments and a pointer to the initialisation routine,
     which in the case of ''tclsh'' is Tcl_AppInit():

   > * After calling Tcl_FindExecutable(), processing the
       command-line arguments and calling the initialisation routine,
       it can do either of two things:

   > > * Evaluate the script file, if the first argument does not
         start with a minus sign

   > > * Or go into an interactive loop to read the commands from
         the prompt. The preparation in that case is to evaluate
         the start-up script (such as ~/.tclshrc or ~/tclshrc.tcl)

   > * It exits by evaluating the Tcl "exit" command, not by calling
       the C routine ''exit()'' directly

   * The standard initialisation routine Tcl_AppInit() is meant to
     initialise the various application-specific commands and static
     packages via routines like Tcl_CreateCommand(). It also sets the
     Tcl variable "tcl_rcFile" to the user's start-up script.

   > (Curiously, the standard routine is found in a platform-dependent
     source file, tclXXXInit.c)

   * Tcl_Init() by the way provides two hooks for customisation:

   > * A pre-initialisation script that gets evaluated when the
       static variable "tclPreInitScript" has been set.

   > * The initScript variable that defines a Tcl procedure that
       looks up the ''init.tcl'' script.

Thus, before the shell is ready for processing, a lot of initialisation
is done. Much of this process can be customised without the need to
change the standard source files.

~ Overview

This section provides an overview of the resources that an application
requires, given the type of usage:

''Bare Tcl only interpreter:''

   * Just the Tcl dynamic libraries

''Complete initialisation for Tcl only:''

   * The Tcl dynamic libraries

   * The environment variable TCL_LIBRARY

   * The initialisation script file ''init.tcl''

   * The character encoding tables (optional)

''Customised Tcl shell (adapted Tcl_AppInit()):''

   * The Tcl dynamic libraries

   * The environment variable TCL_LIBRARY

   * The initialisation script file ''init.tcl''

   * The character encoding tables (optional)

   * Possibly a so-called RC file to define the initialisation for
     interactive use

''Customised Tk shell (wish; adapted Tk_AppInit()):''

   * The Tcl and Tk dynamic libraries

   * The environment variables TCL_LIBRARY and TK_LIBRARY

   * The initialisation script file ''init.tcl'', and the Tk specific
     bindings (in ''tk.tcl'' and others)

   * The character encoding tables (optional)

   * Possibly a so-called RC file to define the initialisation for
     interactive use

Equally important are the limitations:

''Bare Tcl only interpreter:''

   * No customisable initialisation (not automatically)

   * No access to the command-line arguments or the directory
     that contains the executable

   * No alternative character encodings

   * Possibly problems loading packages, as the auxiliary procedures
     for this are defined in ''init.tcl'' and others.

   * Possibly problems with the locale (best to explicitly set it to
     "C")

   * No interactive use

''Complete initialisation for Tcl only:''

   * Possibly problems with the locale (best to explicitly set it to
     "C")

   * No interactive use

''Customised Tcl shell (adapted Tcl_AppInit()):''

   * None

''Customised Tk shell (wish; adapted Tk_AppInit()):''

   * None

----

~ Compiling and linking

Nowadays, it seems the default to use dynamic or shared libraries. So,
with many installations, there will exist dynamic versions of the
libraries and sometimes there will be no static versions. This has a
number of advantages:

   * The executables are much smaller, the memory usage can be smaller
     as well, as the code will be shared.

   * The libraries can be replaced without the need to rebuild the
     application. This is especially true if you enable the use of
     ''stubs'' for your binary packages (see below).

However, as the Tcl libraries now reside outside your application, they
will have to be shipped with the application and the dynamic loader must
somehow be able to find the libraries. The latter certainly has
consequences: each system tends to have its own method.

When you have the Tcl/Tk sources, you can decide to create your own
libraries. Of special interest are the following two situations:

   * You want to be able to use the ''stubs'' facility, as this makes
     it possible to run with different versions of Tcl/Tk with the same
     binary.

   * You want to get rid of as much extra stuff outside your application
     as possible, so you want to use the static version of the Tcl
     libraries.

''Stubs'' were introduced to make binary extensions and applications
independent of the specific Tcl version. They are enabled by defining
the macro ''TCL_USE_STUBS'' during the compilation and linking
of the Tcl/Tk library and especially your own extension.

In the initialisation procedure for your pacakge or application you need
to initialise the stubs jump table via ''Tcl_InitStubs()'':

| #ifdef USE_TCL_STUBS
|    if (Tcl_InitStubs(interp, "8.1", 0) == NULL) {
|       return TCL_ERROR;
|    }

| #endif

(details: [http://mini.net/tcl/1687.html])

The technique, as Brent Welch explains, is simple in principle:

   > By enabling stubs, all calls to Tcl routines are turned into
     function pointers. These pointers are kept in a large table that
     is filled with the correct pointer values via the Tcl_InitStubs()
     routine.

Linking your application or extension should then be done against the
"stub version" of the Tcl/Tk libraries.

If you do not want dynamic libraries, then perhaps a build with the
option ''STATIC_BUILD'' is a solution. With this option, static
libraries are built. The libraries are then incorporated into the
executable itself.

''Note:'' On some platforms, notably Windows, the specific
calling convention is then turned to standard C (with dynamic libraries,
the calling convention exports the various routines explicitly).

When you do not care about the dynamic libraries having to be present,
at least be aware of the way the various systems want to define their
position.

The information above is summarised as follows:

''Using dynamic libraries:''

   * Most UNIX versions and LINUX use the environment
     variable LD_LIBRARY_PATH, colon-separated just like ''PATH''
     to indicate the position of dynamic libraries.

   * Some use the variable SHLIB_PATH instead (notably: HPUX).

   * Under Windows (all flavours) the PATH variable is used and a
     predefined sequence of directories to find the DLL's. One important
     case is that the libraries are found in the same directory as
     the executable.

''Building for general Tcl versions:''

   * Compile your sources with the macro ''TCL_USE_STUBS''

   * Use the proper call to Tcl_InitStubs() to initialise the
     jump table.

   * Link against the stub versions of the Tcl/Tk libraries.

''Building statically:''

   * Use the flag STATIC_BUILD to build the static Tcl/Tk libraries.

   * Use this flag for your own sources as well

   * Link against the static versions.

~ Copyright

This document is placed in the public domain.

|

|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|
|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|

|
|
|
|
|
|
|
|
|
<
<
>
>
|

|
|

|

|

|

|

|

|

|
|

|

|

|

|
|
|

|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|
|

|

|
|

|
|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|
|

|

|
|

|

|

|

|

|

|

|

|

|
|
|
<
>
|

|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

>
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303

304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379

380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588

589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
and the output.

Let us assume that the application is written in some convenient
programming language like C. The reasons for using Tcl are:

   * Flexible input routines

	   > By using the scripting capabilities of Tcl one can easily
     adapt the program to the input file or files that it should
     read.

   * Flexible output routines

	   > Again, the scripting capabilities allow adapting the output
     to the customer's wishes, without having to recompile and
     link it. This can be done for simple files on disk, but
     also graphical output or storage in a database is possible,
     _without changing the program itself_.

# The simplest way: create a bare interpreter

With the Tcl routine Tcl\_CreateInterp\(\) you can create an interpreter
that is capable of all the basic commands:

	  Tcl_Interp * interp         ;
	  char       * input_filename ;
	  char       * buffer         ;
	  double       x, y, z        ;

	  /* Create the interp, use it to read the given input file,
	     Note:
	     Using the string API for simplicity, no error checking
	  */
	  interp = Tcl_CreateInterp() ;
	  Tcl_SetVar( interp, "input_file", input_filename, TCL_GLOBAL_ONLY ) ;
	  Tcl_EvalFile( interp, startup_script ) ;

	  /* Extract the input data
	  */
	  buffer = Tcl_GetVar( interp, "x", TCL_GLOBAL_ONLY ) ;
	  Tcl_GetDouble( interp, buffer, &x ) ;
	  buffer = Tcl_GetVar( interp, "y", TCL_GLOBAL_ONLY ) ;
	  Tcl_GetDouble( interp, buffer, &y ) ;
	  buffer = Tcl_GetVar( interp, "z", TCL_GLOBAL_ONLY ) ;
	  Tcl_GetDouble( interp, buffer, &z ) ;

	  /* Destroy the interp - if you do not need it any longer
	  */
	  Tcl_DestroyInterp( interp ) ;

The output routine contains a similar fragment \(note, we assume
the Tcl interpreter was stored somewhere\):

	  Tcl_Interp * interp                   ;
	  char       * output_filename          ;
	  char         buffer[TCL_DOUBLE_SPACE] ;
	  double       a, b                     ;

	  /* Export the results to the interpreter
	  */
	  Tcl_PrintDouble( interp, a, buffer ) ;
	  Tcl_SetVar( interp, "a", buffer, TCL_GLOBAL_ONLY ) ;
	  Tcl_PrintDouble( interp, b, buffer ) ;
	  Tcl_SetVar( interp, "b", buffer, TCL_GLOBAL_ONLY ) ;

	  Tcl_SetVar( interp, "output_file", input_filename, TCL_GLOBAL_ONLY ) ;
	  Tcl_EvalFile( interp, report_script ) ;

To add error checking \(always do!\), use code like this:

	  Tcl_Channel errChannel ;

	  if ( Tcl_EvalFile( ... ) != TCL_OK ) {
	     errChannel = Tcl_GetStdChannel( TCL_STDERR ) ;
	     if ( errChannel != NULL ) {
	        TclWriteObj( errChannel, Tcl_GetObjResult(interp) ) ;
	        TclWriteChars( errChannel, "\n", -1 ) ;
	        ... /* Quit the program or other error handling? */

	     }
	  }

With this approach you need only to worry about the Tcl binary
libraries: if the dynamic versions are linked to your application,
then distribution of your application should include these binaries.
If, on the other hand the static versions are used, your application
already contains all of Tcl it needs all by itself.

The limitations of this approach are:

   * The utilities ordinarily defined via the Tcl initialisation
     script _init.tcl_ are not available. \(Note that these
     include such procedures as _tclPkgSetup_ and _unknown_\)

   * The Tcl variables _argc_, _argv_ and _argv0_ are not set.
     This may be problematic if you want to use these variables to
     communicate with the user, e.g. provide an initial script file
     on the command-line.

   * Character encodings are not available. This will limit your
     application to ASCII characters.

# Complete initialisation: the role of init.tcl

The next section outlines the full initialisation procedure that is used
in the standard _tclsh_ shell. This section concentrates instead on
some practical observations:

   * The routine Tcl\_FindExecutable\(\) does a lot more than its name
     suggests: it is responsible for initialising the various subsystems
     in a controlled way, it will find all the character encodings.

	   > It has to be called very early, before creating an interpreter.
     The results are stored in private variables that are used for
     all threads.

	   > If it can not find the executable, no harm is done: it will
     have initialised the subsystems anyway.

   * The routine Tcl\_Init\(\) is responsible for setting up the
     script library by evaluating the script _init.tcl_. It should be
     called after the creation of an interpreter, to add the various
     commands to it.

	   > If it can not find this script, it will return with an error.

	   > \(The routine actually has two additional hooks to allow
     customisation, but these will probably be used in unusual
     circumstances only.\)

The script _init.tcl_ and any it sources \(directly or indirectly
via auto\_load\) must be found via the _tcl\_library_ variable.
On UNIX this variable is initialised via the _TCL\_LIBRARY_ environment
variable is used, whereas on MS Windows the pathname of the Tcl DLLs
is used as well.

As long as these scripts can be found, they can actually reside in a
large number of directories with names related to the Tcl library path.

This leads to the following code to create a full-fledged interpreter:

	  Tcl_Interp * interp         ;

	  /* Initialise the Tcl library thouroughly
	  */
	  Tcl_FindExecutable( argv[0] ) ;

	  /* Create the interp, evaluate "init.tcl" for the script
	     level initialisation.
	  */
	  interp = Tcl_CreateInterp() ;

	  if ( Tcl_Init( interp ) != TCL_OK ) {
	     ... Report the error

	  }

With _init.tcl_ loaded, we have a number of additional commands and
global variables:

   * tclLog, unknown, auto\_load, auto\_execok are the most important
     ones.

   * auto\_path, errorInfo, errorCode

To create an interpreter that can handle Tk as well, you should be
aware of the following:

   * Tk-able interpreters always need to be initialised via Tk\_Init\(\)
     and therefore require the start-up scripts: these scripts contain
     the default bindings and resource definitions and are therefore
     indispensable for Tk.

   * An application written using Tk needs to process events in a
     well-defined event loop.

TODO: how to write the event loop, what choices are available?

# Initialisation via the standard shell

The details of the initialisation done in the standard tclsh shell
are quite intricate. They involve, in addition to the initialisation via
Tcl\_FindExecutable\(\) and Tcl\_Init\(\) also:

   * processing the command-line arguments

   * customisation via various hooks

   * preparing the Tcl parser by setting the locale to "C", as only
     this guarantees everything works as expected.

A summary of the steps found in the initialisation code is given below:

   * main\(\) is a system-dependent routine which:

	   > \* sets the locale \(Windows version\)

	   > \* parses the command-line according to the UNIX rules \(Windows
       version\)

	   > \* calls Tcl\_Main\(\), which is not supposed to return

   * Tcl\_Main\(\) takes as arguments the well-known _argc/argv_
     command-line arguments and a pointer to the initialisation routine,
     which in the case of _tclsh_ is Tcl\_AppInit\(\):

	   > \* After calling Tcl\_FindExecutable\(\), processing the
       command-line arguments and calling the initialisation routine,
       it can do either of two things:

	   > > \* Evaluate the script file, if the first argument does not
         start with a minus sign

	   > > \* Or go into an interactive loop to read the commands from
         the prompt. The preparation in that case is to evaluate
         the start-up script \(such as ~/.tclshrc or ~/tclshrc.tcl\)

	   > \* It exits by evaluating the Tcl "exit" command, not by calling
       the C routine _exit\(\)_ directly

   * The standard initialisation routine Tcl\_AppInit\(\) is meant to
     initialise the various application-specific commands and static
     packages via routines like Tcl\_CreateCommand\(\). It also sets the
     Tcl variable "tcl\_rcFile" to the user's start-up script.

	   > \(Curiously, the standard routine is found in a platform-dependent
     source file, tclXXXInit.c\)

   * Tcl\_Init\(\) by the way provides two hooks for customisation:

	   > \* A pre-initialisation script that gets evaluated when the
       static variable "tclPreInitScript" has been set.

	   > \* The initScript variable that defines a Tcl procedure that
       looks up the _init.tcl_ script.

Thus, before the shell is ready for processing, a lot of initialisation
is done. Much of this process can be customised without the need to
change the standard source files.

# Overview

This section provides an overview of the resources that an application
requires, given the type of usage:

_Bare Tcl only interpreter:_

   * Just the Tcl dynamic libraries

_Complete initialisation for Tcl only:_

   * The Tcl dynamic libraries

   * The environment variable TCL\_LIBRARY

   * The initialisation script file _init.tcl_

   * The character encoding tables \(optional\)

_Customised Tcl shell \(adapted Tcl\_AppInit\(\)\):_

   * The Tcl dynamic libraries

   * The environment variable TCL\_LIBRARY

   * The initialisation script file _init.tcl_

   * The character encoding tables \(optional\)

   * Possibly a so-called RC file to define the initialisation for
     interactive use

_Customised Tk shell \(wish; adapted Tk\_AppInit\(\)\):_

   * The Tcl and Tk dynamic libraries

   * The environment variables TCL\_LIBRARY and TK\_LIBRARY

   * The initialisation script file _init.tcl_, and the Tk specific
     bindings \(in _tk.tcl_ and others\)

   * The character encoding tables \(optional\)

   * Possibly a so-called RC file to define the initialisation for
     interactive use

Equally important are the limitations:

_Bare Tcl only interpreter:_

   * No customisable initialisation \(not automatically\)

   * No access to the command-line arguments or the directory
     that contains the executable

   * No alternative character encodings

   * Possibly problems loading packages, as the auxiliary procedures
     for this are defined in _init.tcl_ and others.

   * Possibly problems with the locale \(best to explicitly set it to
     "C"\)

   * No interactive use

_Complete initialisation for Tcl only:_

   * Possibly problems with the locale \(best to explicitly set it to
     "C"\)

   * No interactive use

_Customised Tcl shell \(adapted Tcl\_AppInit\(\)\):_

   * None

_Customised Tk shell \(wish; adapted Tk\_AppInit\(\)\):_

   * None

----

# Compiling and linking

Nowadays, it seems the default to use dynamic or shared libraries. So,
with many installations, there will exist dynamic versions of the
libraries and sometimes there will be no static versions. This has a
number of advantages:

   * The executables are much smaller, the memory usage can be smaller
     as well, as the code will be shared.

   * The libraries can be replaced without the need to rebuild the
     application. This is especially true if you enable the use of
     _stubs_ for your binary packages \(see below\).

However, as the Tcl libraries now reside outside your application, they
will have to be shipped with the application and the dynamic loader must
somehow be able to find the libraries. The latter certainly has
consequences: each system tends to have its own method.

When you have the Tcl/Tk sources, you can decide to create your own
libraries. Of special interest are the following two situations:

   * You want to be able to use the _stubs_ facility, as this makes
     it possible to run with different versions of Tcl/Tk with the same
     binary.

   * You want to get rid of as much extra stuff outside your application
     as possible, so you want to use the static version of the Tcl
     libraries.

_Stubs_ were introduced to make binary extensions and applications
independent of the specific Tcl version. They are enabled by defining
the macro _TCL\_USE\_STUBS_ during the compilation and linking
of the Tcl/Tk library and especially your own extension.

In the initialisation procedure for your pacakge or application you need
to initialise the stubs jump table via _Tcl\_InitStubs\(\)_:

	 #ifdef USE_TCL_STUBS
	    if (Tcl_InitStubs(interp, "8.1", 0) == NULL) {
	       return TCL_ERROR;

	    }
	 #endif

\(details: <http://mini.net/tcl/1687.html> \)

The technique, as Brent Welch explains, is simple in principle:

   > By enabling stubs, all calls to Tcl routines are turned into
     function pointers. These pointers are kept in a large table that
     is filled with the correct pointer values via the Tcl\_InitStubs\(\)
     routine.

Linking your application or extension should then be done against the
"stub version" of the Tcl/Tk libraries.

If you do not want dynamic libraries, then perhaps a build with the
option _STATIC\_BUILD_ is a solution. With this option, static
libraries are built. The libraries are then incorporated into the
executable itself.

_Note:_ On some platforms, notably Windows, the specific
calling convention is then turned to standard C \(with dynamic libraries,
the calling convention exports the various routines explicitly\).

When you do not care about the dynamic libraries having to be present,
at least be aware of the way the various systems want to define their
position.

The information above is summarised as follows:

_Using dynamic libraries:_

   * Most UNIX versions and LINUX use the environment
     variable LD\_LIBRARY\_PATH, colon-separated just like _PATH_
     to indicate the position of dynamic libraries.

   * Some use the variable SHLIB\_PATH instead \(notably: HPUX\).

   * Under Windows \(all flavours\) the PATH variable is used and a
     predefined sequence of directories to find the DLL's. One important
     case is that the libraries are found in the same directory as
     the executable.

_Building for general Tcl versions:_

   * Compile your sources with the macro _TCL\_USE\_STUBS_

   * Use the proper call to Tcl\_InitStubs\(\) to initialise the
     jump table.

   * Link against the stub versions of the Tcl/Tk libraries.

_Building statically:_

   * Use the flag STATIC\_BUILD to build the static Tcl/Tk libraries.

   * Use this flag for your own sources as well

   * Link against the static versions.

# Copyright

This document is placed in the public domain.

Name change from tip/67.tip to tip/67.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77

78
79
80
81
82

83
84

85
86
87
88
89
90
91
92
93
94
95

96
97
98
99

100
101
102

103
104
105
106
107
108
109
110
111
112
113
114
115

116
117
118
119
120
121
122
123

124
125
126
127
128
129
130
131

132
133
134
135
136
137
138
139
140

141
142
143
144

145
146
147
148
149
150
151
152
153
154
155
156
157
158

TIP:           67
Title:         Allow Subclassing of tk_getOpenFile, tk_getSaveFile on UNIX
Version:       $Revision: 1.5 $
Author:        Chris Nelson <[email protected]>
Author:        Al Zielaskowski <[email protected]>
State:         Withdrawn
Type:          Project
Tcl-Version:   8.5
Vote:          Pending
Created:       09-Oct-2001
Post-History:

~ Abstract

On Microsoft Windows it is possible to "subclass" a standard dialog
and add controls to it.  This TIP proposes adding that feature to the
''tk_getOpenFile'' and ''tk_getSaveFile'' dialogs for non-Windows
systems (wherever ''tkfbox.tcl'' and ''xmfbox.tcl'' are used for these
dialogs).

~ Rationale

In our work with Tk, we have need to save files in various formats and
give the user control over more than just the file name when saving.
While it is possible to have two separate dialogs - one for specifying
the file name and location and another for other attributes - this is
unwieldy and not very user friendly: all the related information
should be in one dialog

On Microsoft Windows, it is possible to add controls to standard
dialogs (indeed any window) via "subclassing" (cf
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/winui/commdlg_4qlv.asp).
(This requires C programming but it is, at least, possible.)

On UNIX, no generic technique like subclassing exists.  Even if we
wished to invade the "standard dialog," - learning about the window's
organization, adding widgets here and there - calling
''tk_getSaveFile'' blocks the caller and then returns a value after
the dialog is destroyed so we have no opportunity to manipulate the
dialog.  To work around this, we need to have ''tk_getSaveFile'' call
back into user code to add controls when the dialog is built.

~ Specification

We add a ''-subclass'' option to ''tk_getSaveFile'' and
''tk_getOpenFile'' (on UNIX only).  The value of the ''-subclass''
option is a Tcl command to evaluate to fill an extra frame near the
bottom of the dialog.  When the dialog is constructed, the subclass
command, if any, is evaluated with the path to the frame appended as
an additional argument.  The subclass command can then fill the frame
as needed.

No additional semantic changes are needed for these additional
controls to communicate with the program as such communication can be
done through side effects.  For example, user interaction with a
checkbox created by the subclass command can be detected after the
''tk_getSaveFile'' dialog is closed by examining the value of the
checkbox's global variable.

~ Reference Implementation

This proposal has been implemented by Al Zielaskowski.  A patch
relative to Tk 8.4a3 follows:

|Index: tkfbox.tcl
|===================================================================
|RCS file: /pti/prod/mrd/CvsRepository/tcl/tk/library/tkfbox.tcl,v
|retrieving revision 1.1.1.1
|diff -u -w -r1.1.1.1 tkfbox.tcl
|--- tkfbox.tcl  2001/09/04 23:51:12     1.1.1.1
|+++ tkfbox.tcl  2001/10/09 19:47:50
|@@ -898,6 +898,7 @@
|        {-initialfile "" "" ""}
|        {-parent "" "" "."}
|        {-title "" "" ""}
|+       {-subclass "" "" ""}
|     }

| 
|     # The "-multiple" option is only available for the "open" file dialog.
|@@ -1087,9 +1088,22 @@
|     # Pack all the frames together. We are done with widget construction.
|     #

|     pack $f1 -side top -fill x -pady 4
|+

|+    #
|+    # Add the user's subclass frame if one was specified
|+    #
|+    if {[string length $data(-subclass)]} {
|+       frame $w.subclass -bd 0
|+       pack $w.subclass -side bottom -fill x \
|+                -padx [list [expr [winfo reqwidth $data(typeMenuLab)] + 8] \
|+               [expr [winfo reqwidth $data(okBtn)] + 8]]
|+       eval $data(-subclass) $w.subclass
|+    }
|+

|     pack $f3 -side bottom -fill x
|     pack $f2 -side bottom -fill x
|     pack $data(icons) -expand yes -fill both -padx 4 -pady 1
|+

| 
|     # Set up the event handlers that are common to Directory and File Dialogs
|     #

|Index: xmfbox.tcl
|===================================================================
|RCS file: /pti/prod/mrd/CvsRepository/tcl/tk/library/xmfbox.tcl,v
|retrieving revision 1.1.1.1
|diff -u -w -r1.1.1.1 xmfbox.tcl
|--- xmfbox.tcl  2001/09/04 23:51:12     1.1.1.1
|+++ xmfbox.tcl  2001/10/09 19:05:57
|@@ -216,6 +216,7 @@
|        {-initialfile "" "" ""}
|        {-parent "" "" "."}
|        {-title "" "" ""}
|+       {-subclass "" "" ""}
|     }

|     if { [string equal $type "open"] } {
|        lappend specs {-multiple "" "" "0"}
|@@ -277,6 +278,7 @@
|     if {![winfo exists $data(-parent)]} {
|        error "bad window path name \"$data(-parent)\""
|     }
|+
| }

| 
| # ::tk::MotifFDialog_BuildUI --
|@@ -360,6 +362,17 @@
| 
|     pack $bot.ok $bot.filter $bot.cancel -padx 10 -pady 10 -expand yes \
|        -side left
|+
|+

|+    #
|+    # Add the user's subclass frame if one was specified
|+    #
|+    if {[string length $data(-subclass)]} {
|+       frame $f3.subclass -bd 0
|+       pack $f3.subclass -side bottom -fill x -padx 4 -pady 4
|+       eval $data(-subclass) $f3.subclass
|+    }
|+

| 
|     # Create the bindings:
|     #

~ Notice of Withdrawal

This TIP was Withdrawn by the TIP Editor following discussion on the
tcl-core mailing list.  The following is a summary of reasons for
withdrawal:

 > This would make porting code between platforms obscenely difficult
   as there is no way for the subclassing to work the same way on all
   platforms.  Better for people to roll their own, perhaps starting
   from the foundations of the UNIX file browsing code if they wish.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|
|

|

|
|
|

|

|

|

|
|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
>
|
<
>
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
<
<
<
>
>
>
|
|
|
|
|
|
<
<
>
>
|
|
|
|
|
|
|
|
<
>
|
|
<
|
>
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75

76
77
78
79
80

81
82

83
84
85
86
87
88
89
90
91
92
93

94
95
96
97

98
99
100

101
102
103
104
105
106
107
108
109
110
111
112
113

114
115
116
117
118
119

120
121
122
123
124
125
126
127
128

129
130
131
132
133
134
135
136
137
138

139
140
141

142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158

# TIP 67: Allow Subclassing of tk_getOpenFile, tk_getSaveFile on UNIX

	Author:        Chris Nelson <[email protected]>
	Author:        Al Zielaskowski <[email protected]>
	State:         Withdrawn
	Type:          Project
	Tcl-Version:   8.5
	Vote:          Pending
	Created:       09-Oct-2001
	Post-History:
-----

# Abstract

On Microsoft Windows it is possible to "subclass" a standard dialog
and add controls to it.  This TIP proposes adding that feature to the
_tk\_getOpenFile_ and _tk\_getSaveFile_ dialogs for non-Windows
systems \(wherever _tkfbox.tcl_ and _xmfbox.tcl_ are used for these
dialogs\).

# Rationale

In our work with Tk, we have need to save files in various formats and
give the user control over more than just the file name when saving.
While it is possible to have two separate dialogs - one for specifying
the file name and location and another for other attributes - this is
unwieldy and not very user friendly: all the related information
should be in one dialog

On Microsoft Windows, it is possible to add controls to standard
dialogs \(indeed any window\) via "subclassing" \(cf
<http://msdn.microsoft.com/library/default.asp?url=/library/en-us/winui/commdlg\_4qlv.asp\).>
\(This requires C programming but it is, at least, possible.\)

On UNIX, no generic technique like subclassing exists.  Even if we
wished to invade the "standard dialog," - learning about the window's
organization, adding widgets here and there - calling
_tk\_getSaveFile_ blocks the caller and then returns a value after
the dialog is destroyed so we have no opportunity to manipulate the
dialog.  To work around this, we need to have _tk\_getSaveFile_ call
back into user code to add controls when the dialog is built.

# Specification

We add a _-subclass_ option to _tk\_getSaveFile_ and
_tk\_getOpenFile_ \(on UNIX only\).  The value of the _-subclass_
option is a Tcl command to evaluate to fill an extra frame near the
bottom of the dialog.  When the dialog is constructed, the subclass
command, if any, is evaluated with the path to the frame appended as
an additional argument.  The subclass command can then fill the frame
as needed.

No additional semantic changes are needed for these additional
controls to communicate with the program as such communication can be
done through side effects.  For example, user interaction with a
checkbox created by the subclass command can be detected after the
_tk\_getSaveFile_ dialog is closed by examining the value of the
checkbox's global variable.

# Reference Implementation

This proposal has been implemented by Al Zielaskowski.  A patch
relative to Tk 8.4a3 follows:

	Index: tkfbox.tcl
	===================================================================
	RCS file: /pti/prod/mrd/CvsRepository/tcl/tk/library/tkfbox.tcl,v
	retrieving revision 1.1.1.1
	diff -u -w -r1.1.1.1 tkfbox.tcl
	--- tkfbox.tcl  2001/09/04 23:51:12     1.1.1.1
	+++ tkfbox.tcl  2001/10/09 19:47:50
	@@ -898,6 +898,7 @@
	        {-initialfile "" "" ""}
	        {-parent "" "" "."}
	        {-title "" "" ""}
	+       {-subclass "" "" ""}

	     }

	     # The "-multiple" option is only available for the "open" file dialog.
	@@ -1087,9 +1088,22 @@
	     # Pack all the frames together. We are done with widget construction.

	     #
	     pack $f1 -side top -fill x -pady 4

	+
	+    #
	+    # Add the user's subclass frame if one was specified
	+    #
	+    if {[string length $data(-subclass)]} {
	+       frame $w.subclass -bd 0
	+       pack $w.subclass -side bottom -fill x \
	+                -padx [list [expr [winfo reqwidth $data(typeMenuLab)] + 8] \
	+               [expr [winfo reqwidth $data(okBtn)] + 8]]
	+       eval $data(-subclass) $w.subclass
	+    }

	+
	     pack $f3 -side bottom -fill x
	     pack $f2 -side bottom -fill x
	     pack $data(icons) -expand yes -fill both -padx 4 -pady 1

	+

	     # Set up the event handlers that are common to Directory and File Dialogs

	     #
	Index: xmfbox.tcl
	===================================================================
	RCS file: /pti/prod/mrd/CvsRepository/tcl/tk/library/xmfbox.tcl,v
	retrieving revision 1.1.1.1
	diff -u -w -r1.1.1.1 xmfbox.tcl
	--- xmfbox.tcl  2001/09/04 23:51:12     1.1.1.1
	+++ xmfbox.tcl  2001/10/09 19:05:57
	@@ -216,6 +216,7 @@
	        {-initialfile "" "" ""}
	        {-parent "" "" "."}
	        {-title "" "" ""}
	+       {-subclass "" "" ""}

	     }
	     if { [string equal $type "open"] } {
	        lappend specs {-multiple "" "" "0"}
	@@ -277,6 +278,7 @@
	     if {![winfo exists $data(-parent)]} {
	        error "bad window path name \"$data(-parent)\""

	     }
	+
	 }

	 # ::tk::MotifFDialog_BuildUI --
	@@ -360,6 +362,17 @@

	     pack $bot.ok $bot.filter $bot.cancel -padx 10 -pady 10 -expand yes \
	        -side left

	+
	+
	+    #
	+    # Add the user's subclass frame if one was specified
	+    #
	+    if {[string length $data(-subclass)]} {
	+       frame $f3.subclass -bd 0
	+       pack $f3.subclass -side bottom -fill x -padx 4 -pady 4
	+       eval $data(-subclass) $f3.subclass
	+    }

	+

	     # Create the bindings:

	     #

# Notice of Withdrawal

This TIP was Withdrawn by the TIP Editor following discussion on the
tcl-core mailing list.  The following is a summary of reasons for
withdrawal:

 > This would make porting code between platforms obscenely difficult
   as there is no way for the subclassing to work the same way on all
   platforms.  Better for people to roll their own, perhaps starting
   from the foundations of the UNIX file browsing code if they wish.

# Copyright

This document has been placed in the public domain.

Name change from tip/68.tip to tip/68.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117

TIP:		68
Title:		Dynamic Trace Result Handling
Version:	$Revision: 1.5 $
Author:		Donal K. Fellows <[email protected]>
State:		Final
Type:		Project
Tcl-Version:	8.4
Vote:		Done
Created:	16-Oct-2001
Post-History:	

~ Abstract

This TIP proposes an extension to the ''Tcl_TraceVar'' API to cope
with dynamically allocated results.

~ Rationale

The current API for handling errors during variable accesses is based
on static strings, which is perfectly adequate for the errors that Tcl
generates of its own accord, but which is substantially at odds with
the setting of traces on variables which may produce errors.  The
problem is that as those errors come from a Tcl script, they are
allocated dynamically and fail to satisfy the static allocation rule
mentioned previously.  Normally this does not cause a problem, but
under some circumstances (as set out in Bug #219393
http://sf.net/tracker/?func=detail&aid=219393&group_id=10894&atid=110894)
it is possible for this to cause a memory fault or even memory
corruption.  This is because it can sometimes happen that the pointer
to the supposedly static string winds up dangling as the string it was
pointing to gets deleted out from underneath it (the storage area used
to static-ify the string is part of the trace structure, but the trace
is permitted to delete that structure...)  Obviously this is not
desirable!

There are several possible fixes, but the two main ones are to:

 1. use the ''Tcl_Preserve'' mechanism to postpone deletion of the
    allocated memory block until it has been copied into something
    more permanent.

 2. add special handling so as to mark the error result coming back
    from the trace mechanism as something other than a static string.
    The main alternatives here are:

 > * A dynamically-allocated C string, to be disposed of with
     ''ckfree''.

 > * A dynamically-allocated ''Tcl_Obj'' reference, to be disposed of
     with a single call to ''Tcl_DecrRefCount''.

Although option 1 is the easiest to implement, it has the disadvantage
of putting a new ''non-obvious'' requirement on all variable traces,
and that is that their results are all ''Tcl_Preserve''d before the
end of the trace.  This is feasible for the Tcl core, but unreasonable
to ask of extension writers.

Instead I prefer option 2, and it is possible to introduce this change
in such a way that existing software does not see an API change (i.e.
there are no serious backward-compatibility issues) and both styles of
result listed above are supported.  The advantage of supporting both
of these is that dynamically allocated strings are a very easy
interface for extension writers to use though not particularly
efficient, and objects are a very efficient interface well-suited to
the core itself but are not as easy to use.  (It is far easier to
adapt existing code to use dynamic strings as no understanding of
lifespan management is required.)

~ Changes

To achieve this, the following new flags will be defined:

|#define TCL_TRACE_RESULT_DYNAMIC  0x2000
|#define TCL_TRACE_RESULT_OBJECT   0x4000

These flags, when passed to the ''flags'' argument of ''Tcl_TraceVar''
(and related functions) alter the interpretation of the value returned
by the call to the ''proc'' parameter from the default behaviour (a
static string) to be either a string to be deallocated by Tcl as and
when it sees fit using ''ckfree'' (when ''TCL_TRACE_RESULT_DYNAMIC''
is specified) or to be a ''Tcl_Obj'' (which must be cast to a ''char
*'' for type compatibility) to be disposed of when the error message
is no longer required (when ''TCL_TRACE_RESULT_OBJECT'' is specified.)
It is an error to specify both flags on the same call.

The core will then be modified to use this mechanism for variable
traces as set up by the ''trace'' command.

~ Copyright

This TIP is placed in the public domain.

~ Reference

For reference, the pre-TIP definition of the ''Tcl_TraceVar'' function
is as follows:

|     int
|     Tcl_TraceVar(Tcl_Interp *interp, char *varName, int flags,
|                  Tcl_VarTraceProc *proc, ClientData clientData)

(There is a corresponding function that takes the variable name as a
pair of strings.)  All parameters have the usual obvious
interpretations, with the ''flags'' being an OR-ed combination of the
following flags:

  TCL_TRACE_READS: Invoke the callback when the variable is read.

  TCL_TRACE_WRITES: Invoke the callback when the variable is written.

  TCL_TRACE_UNSETS: Invoke the callback when the variable is unset.

  TCL_TRACE_ARRAY: Invoke the callback when the variable is accessed
     as an array.

  TCL_GLOBAL_ONLY: Force the lookup of the variable in the global
     scope, and not the current one.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|
|

|

|

|

|
|

|
|

|
|

|
|

|

|

|

|
|

|
|
|
|
|
|
|
|

|

|

|

|

|
|
|

|
|
|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117

# TIP 68: Dynamic Trace Result Handling

	Author:		Donal K. Fellows <[email protected]>
	State:		Final
	Type:		Project
	Tcl-Version:	8.4
	Vote:		Done
	Created:	16-Oct-2001
	Post-History:	
-----

# Abstract

This TIP proposes an extension to the _Tcl\_TraceVar_ API to cope
with dynamically allocated results.

# Rationale

The current API for handling errors during variable accesses is based
on static strings, which is perfectly adequate for the errors that Tcl
generates of its own accord, but which is substantially at odds with
the setting of traces on variables which may produce errors.  The
problem is that as those errors come from a Tcl script, they are
allocated dynamically and fail to satisfy the static allocation rule
mentioned previously.  Normally this does not cause a problem, but
under some circumstances \(as set out in Bug \#219393
<http://sf.net/tracker/?func=detail&aid=219393&group\_id=10894&atid=110894\)>
it is possible for this to cause a memory fault or even memory
corruption.  This is because it can sometimes happen that the pointer
to the supposedly static string winds up dangling as the string it was
pointing to gets deleted out from underneath it \(the storage area used
to static-ify the string is part of the trace structure, but the trace
is permitted to delete that structure...\)  Obviously this is not
desirable!

There are several possible fixes, but the two main ones are to:

 1. use the _Tcl\_Preserve_ mechanism to postpone deletion of the
    allocated memory block until it has been copied into something
    more permanent.

 2. add special handling so as to mark the error result coming back
    from the trace mechanism as something other than a static string.
    The main alternatives here are:

	 > \* A dynamically-allocated C string, to be disposed of with
     _ckfree_.

	 > \* A dynamically-allocated _Tcl\_Obj_ reference, to be disposed of
     with a single call to _Tcl\_DecrRefCount_.

Although option 1 is the easiest to implement, it has the disadvantage
of putting a new _non-obvious_ requirement on all variable traces,
and that is that their results are all _Tcl\_Preserve_d before the
end of the trace.  This is feasible for the Tcl core, but unreasonable
to ask of extension writers.

Instead I prefer option 2, and it is possible to introduce this change
in such a way that existing software does not see an API change \(i.e.
there are no serious backward-compatibility issues\) and both styles of
result listed above are supported.  The advantage of supporting both
of these is that dynamically allocated strings are a very easy
interface for extension writers to use though not particularly
efficient, and objects are a very efficient interface well-suited to
the core itself but are not as easy to use.  \(It is far easier to
adapt existing code to use dynamic strings as no understanding of
lifespan management is required.\)

# Changes

To achieve this, the following new flags will be defined:

	#define TCL_TRACE_RESULT_DYNAMIC  0x2000
	#define TCL_TRACE_RESULT_OBJECT   0x4000

These flags, when passed to the _flags_ argument of _Tcl\_TraceVar_
\(and related functions\) alter the interpretation of the value returned
by the call to the _proc_ parameter from the default behaviour \(a
static string\) to be either a string to be deallocated by Tcl as and
when it sees fit using _ckfree_ \(when _TCL\_TRACE\_RESULT\_DYNAMIC_
is specified\) or to be a _Tcl\_Obj_ \(which must be cast to a _char
*_ for type compatibility\) to be disposed of when the error message
is no longer required \(when _TCL\_TRACE\_RESULT\_OBJECT_ is specified.\)
It is an error to specify both flags on the same call.

The core will then be modified to use this mechanism for variable
traces as set up by the _trace_ command.

# Copyright

This TIP is placed in the public domain.

# Reference

For reference, the pre-TIP definition of the _Tcl\_TraceVar_ function
is as follows:

	     int
	     Tcl_TraceVar(Tcl_Interp *interp, char *varName, int flags,
	                  Tcl_VarTraceProc *proc, ClientData clientData)

\(There is a corresponding function that takes the variable name as a
pair of strings.\)  All parameters have the usual obvious
interpretations, with the _flags_ being an OR-ed combination of the
following flags:

  TCL\_TRACE\_READS: Invoke the callback when the variable is read.

  TCL\_TRACE\_WRITES: Invoke the callback when the variable is written.

  TCL\_TRACE\_UNSETS: Invoke the callback when the variable is unset.

  TCL\_TRACE\_ARRAY: Invoke the callback when the variable is accessed
     as an array.

  TCL\_GLOBAL\_ONLY: Force the lookup of the variable in the global
     scope, and not the current one.

Name change from tip/69.tip to tip/69.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59

60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97

98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119

120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148

149
150
151
152
153

154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179

180
181
182
183
184

185
186
187
188
189
190
191
192
193
194
195
196
197
198
199

200
201
202
203
204

205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227

228
229
230
231
232
233
234

235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396

397
398
399
400
401

402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474

TIP:            69
Title:          Improvements for the Tcl Hash Table
Version:        $Revision: 1.10 $
Author:         George A. Howlett <[email protected]>
Author:         Don Porter <[email protected]>
Author:         Donal K. Fellows <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        16-Oct-2001
Post-History:   
Discussions-To: news:comp.lang.tcl
Tcl-Version:    9.0

~ Abstract

This document describes various improvements to the existing Tcl hash
table.  They include support for 64 bit platforms, better memory
performance, and improved array hashing.  The goal is a hash table
that improves Tcl/Tk, but also can be used in industrial strength
applications.

~ Introduction

A strength of Tcl that has not diminished in the advance of other
scripting languages (Perl, Python, Ruby, etc.) is the easy way its
command language can be extended with C/C++ code.  For example, the
prominence of Tcl in Electronic Design Automation (EDA) tools is
striking.  It's hard to find EDA tools that do not use Tcl to some
degree.  At the same time, there is a current trend toward 64-bit
computing platforms.  The impetus has been from industry (like EDA)
rather than office or home users, wanting to solve bigger problems,
faster.  If Tcl applications are to operate on 64-bit platforms, a big
first step towards this goal will be a 64-bit version of the Tcl hash
table.

The current Tcl hash table performs well on 32-bit platforms.  It has
been tuned and wrung out handling internal Tcl/Tk code.  But its one
word hash function

| #define RANDOM_INDEX(tablePtr, i) \
|    (((((long) (i))*1103515245) >> (tablePtr)->downShift) & (tablePtr)->mask)

can not hash the longer 64-bit addresses properly.  

Example:

|    Tcl_HashTable t;
|    unsigned long i;
|    char *base, *addr;	
|    int isNew;
|    char *mesg;
|
|    Tcl_InitHashTable(&t, TCL_ONE_WORD_KEYS);
|    base = 0xFFFFFF000000000UL;
|    for (i = 0; i < 100; i++) {
|        addr = base + i * 0x100000000;
|        hPtr = Tcl_CreateHashEntry(&t, addr, &isNew);
|    }

|    mesg = Tcl_HashStats(&t);
|    fprintf(stderr, "Stats\n%s\n", mesg);
|    free((char *)mesg;

Note that the keys all have zeros in the lower 32 bits.  All 100
entries will hash to the same value.  Driving the need for 64-bit
systems is the ability to address more memory.  So it's imperative
that Tcl hash table be able to hash large virtual addresses.

Building upon the current hash table implementation, the following
sections describe specific areas for improvement:

	* improved array/structure hashing

	* better memory performance 

	* support for 64-bit platforms

The goal is an improved Tcl hash table for internal Tcl and Tk code,
but also high performance applications.

~ Improved Array/Structure Hashing

The Tcl hash table handles three types of hash keys: string keys, one
word keys, and multi-word keys.  Each key type has its own hash
function associated with it.  The benefit of this approach is that
better hash functions can be used for the specific types, than one
general function for all types.  The string and one word hash
functions are very good for typical keys.  The multi-word or array
hash is not as good.

The array hash sums the each word of the array and then randomizes
the result.  

|    for (index = 0, count = tablePtr->keyType, iPtr1 = arrayPtr;
|	    count > 0; count--, iPtr1++) {
|	index += *iPtr1;
|    }

|    index = RANDOM_INDEX(tablePtr, index);

This works poorly for many types of hash keys.  For an contrived
example of hashing 1 million 3D coordinates,

|    typedef struct {
|       double x, y, z;
|    }  Double3;
|    double d3;
|
|    Tcl_InitHashTable(&t, sizeof(Double3) / sizeof(int));
|	
|    for (i = 0; i < 100; i++) {
|	for (j = 0; j < 100; j++) {
|	    for (k = 0; k < 100; k++) {
|		d3.x = (double)i;
|		d3.y = (double)j;
|		d3.z = (double)k;
|		hPtr = Tcl_CreateHashEntry(&t, (char *)&d3, &isNew);
|	    }
|	}
|    }

we get a hash table with an average search distance of 1082.3.  The
maximum distance is 3324!  Replacing the hash function with Bob
Jenkins' [http://burtleburtle.net/bob] 32-bit mixing function

|   #define MIX32(a,b,c) \
|	a -= b, a -= c, a ^= (c >> 13), \
|	b -= c, b -= a, b ^= (a <<  8), \
|	c -= a, c -= b, c ^= (b >> 13), \
|	a -= b, a -= c, a ^= (c >> 12), \
|	b -= c, b -= a, b ^= (a << 16), \
|	c -= a, c -= b, c ^= (b >>  5), \
|	a -= b, a -= c, a ^= (c >>  3), \
|	b -= c, b -= a, b ^= (a << 10), \
|	c -= a, c -= b, c ^= (b >> 15)
|
|   int a, b, c, len;
|
|   len = length;
|   a = b = GOLDEN_RATIO32;	/* An arbitrary value */
|   c = 0;			/* Previous hash value */
|
|   while (len >= 3) {		/* Handle most of the key */
|	a += key[0];
|	b += key[1];
|	c += key[2];
|	MIX32(a, b, c);
|	key += 3; len -= 3;
|    }

|    c += length;		
|    switch(len) {
|    case 2 : b += key[1];
|    case 1 : a += key[0];
|    }

|    MIX32(a, b, c);
|    return c;

yields a table with an average search distance of 1.48.  The maximum
distance is 8.  The Jenkins' hash function provides good results for
many different types of arrays and structures.  The disadvantage is
that the hash function is slightly more expensive to compute.  

~ Improving RebuildTable.

The cost of computing a hash function is especially felt each time
table is rebuilt as new entries are added.  The ''RebuildTable'' function
calls the hash function of each entry to recompute its new location in
the bigger table.

|    for (oldChainPtr = oldBuckets; oldSize > 0; oldSize--, oldChainPtr++) {
|	for (hPtr = *oldChainPtr; hPtr != NULL; hPtr = *oldChainPtr) {
|	    *oldChainPtr = hPtr->nextPtr;
|	    if (tablePtr->keyType == TCL_STRING_KEYS) {
|		index = HashString(hPtr->key.string) & tablePtr->mask;
|	    } else if (tablePtr->keyType == TCL_ONE_WORD_KEYS) {
|		index = RANDOM_INDEX(tablePtr, hPtr->key.oneWordValue);
|	    } else {
|		index = HashArray(hPtr->key.words, tablePtr->keyType) &
|			tablePtr->mask;
|	    }

|	    hPtr->bucketPtr = &(tablePtr->buckets[index]);
|	    hPtr->nextPtr = *hPtr->bucketPtr;
|	    *hPtr->bucketPtr = hPtr;
|	}
|    }

The new bucket location is then stored in the hash entry.

Except for one word keys, the hash value is invariant of the table
size.  If the hash value was stored with each entry, then it would not
need to be recomputed each time the table is rebuilt.

|    for (oldChainPtr = oldBuckets; oldSize > 0; oldSize--, oldChainPtr++) {
|	for (hPtr = *oldChainPtr; hPtr != NULL; hPtr = *oldChainPtr) {
|	    *oldChainPtr = hPtr->nextPtr;
|	    if (tablePtr->keyType == TCL_ONE_WORD_KEYS) {
|		index = RANDOM_INDEX(tablePtr, hPtr->key.oneWordValue);
|	    } else {
|		index = hPtr->hval & tablePtr->mask;
|	    }

|	    hPtr->bucketPtr = &(tablePtr->buckets[index]);
|	    hPtr->nextPtr = *hPtr->bucketPtr;
|	    *hPtr->bucketPtr = hPtr;
|	}
|    }

This would increase size of an hash entry, except that the pointer to
the hash bucket is now redundant, since it can cheaply be computed.

|    bucketPtr = tablePtr->buckets + (hPtr->hval & tablePtr->mask);

An added benefit is that hash table lookups become faster and easier
to perform.  If there is more than one hash entry in a bucket, you
don't need to examine the key unless the entry has the same hash
value.

|    for (hPtr = tablePtr->buckets[hindex]; hPtr != NULL;
|	    hPtr = hPtr->nextPtr) {
|       /* Don't look at entry unless the hash value is the same. */
|	if (hPtr->hval == hval) { 
|	    register int *iPtr1, *iPtr2;
|	    int count;
|
|	    for (iPtr1 = arrayPtr, iPtr2 = (int *)hPtr->key.words,
|		     count = tablePtr->keyType; ; count--, iPtr1++, iPtr2++) {
|		if (count == 0) {
|		    return hPtr;
|		}

|		if (*iPtr1 != *iPtr2) {
|		    break;
|		}
|	    }
|	}
|    }

''Don Porter <[email protected]>''

 > ''It appears that the recommendations of this section have already
   been implemented in Tcl 8.4.  In particular, when the symbol
   TCL_HASH_KEY_STORE_HASH == 1 (as it does by default), then the
   hash value is stored in each entry instead of the bucketPtr.''

 > ''If that is correct, then I recommend this section of the TIP be
   removed.  If not, more detail about how this proposal differs
   from the 8.4 implementation, and an argument why the proposal
   is superior are in order.''

~ Better Memory Performance 

One enduring complaint of the Tcl hash table on comp.lang.tcl is its
unexpected memory costs.  A table of 1 million one word key entries
uses over 36 Megabytes.

A hash entry is 20 bytes long.

| typedef struct Tcl_HashEntry {
|    struct Tcl_HashEntry *nextPtr;	/* Pointer to next entry in this
|					 * hash bucket, or NULL for end of
|					 * chain. */
|    struct Tcl_HashTable *tablePtr;	/* Pointer to table containing entry. */
|    struct Tcl_HashEntry **bucketPtr;	/* Pointer to bucket that points to
|					 * first entry in this entry's chain:
|					 * used for deleting the entry. */
|    ClientData clientData;		/* Application stores something here
|					 * with Tcl_SetHashValue. */
|    union {				/* Key has one of these forms: */
|	char *oneWordValue;		/* One-word value for key. */
|	int words[1];			/* Multiple integer words for key.
|					 * The actual size will be as large
|					 * as necessary for this table's
|					 * keys. */
|	char string[4];			/* String for key.  The actual size
|					 * will be as large as needed to hold
|					 * the key. */
|    } key;				/* MUST BE LAST FIELD IN RECORD!! */
| } Tcl_HashEntry;

Each entry stores a pointer to its hash table.  This field is used
only for deleting a hash entry.  But if the hash table is passed to
''Tcl_DeleteHashEntry'', the hash entry can be reduced to 16 bytes.
Inspecting Tcl/Tk code, I could not find a case where the hash table
was not easily available to pass as a parameter.

Each hash entry is allocated using ''malloc''.  System memory
allocators typically add 8-16 bytes overhead for each allocation.
Worse, calls to ''malloc'' and ''free'' tend to dominate the cost of
large hash tables.  ''Tcl_DeleteHashTable'' becomes very slow, freeing
hash entries scattered across pages of virtual memory.

For large hash tables, a pool allocation scheme can improve both
reduce the amount of memory used and improve memory performance.  By
allocating memory in larger chunks, the number of ''malloc'' and
''free'' calls is dramatically reduced.  Fixed size allocators (one
word keys and array keys) can also reclaim and reuse memory from
deleted entries.

The disadvantage of pool allocation is that memory is not released
until the hash table is deleted.  This is less of an issue for large
tables which tend to grow to a steady-state size.  Both Tcl and Tk use
hash tables to keep track of small amounts of information that
probably don't pool allocation.

So to retain compatibility, a new specialized initialization routine
can be used to indicate when to use pool-based allocation.

|    Tcl_InitHashTableWithPool(&table, TCL_ONE_WORD_KEYS);

The standard ''Tcl_InitHashTable'' call

|    Tcl_InitHashTable(&table, TCL_ONE_WORD_KEYS);

will still use ''malloc'' and ''free''.

~ Support for 64-bit Platforms.

While the C language makes no guarantees of a type's size or its
relation to other types, current programming practice assumes that
integers, longs, and pointers are all 32 bits long.  This, of course,
changes with 64-bit systems where pointers are 64-bits wide.
Depending upon the programming model, longs and ints may or may not be
64 bits too.

| Datatype      LP64    ILP64   LLP64   ILP32   LP32
|  char           8       8       8       8       8
|  short         16      16      16      16      16
|  _int32                        32
| int            32      64      32      32      16
| long           64      64      32      32      32
| long long                      64
| pointer        64      64      64      32      32

ILP32 is typical for 32 bit systems.  Windows 3.1 was a LP32 model.

In the LP64 model, pointers and longs are 64 bits, but ints remain 32
bits wide.  The LLP model retains the 32-bits for ints and longs, but
adds a 64-bit "long long" type.  Most 64-bit Unix systems (Solaris,
HP-UX, Tru64, AIX) are LP64.  I believe that Win64 is LLP.

The first problem is that addresses are now 64-bits, not 32.  This
means that existing code such as

|    Tcl_InitHashTable(&table, TCL_ONE_WORD_KEYS);
|
|    ptr = CreateSomeObject();
|    hPtr = Tcl_CreateHashEntry(&table, (void *)ptr, &isNew);

can possibly fail because the 32-bit one word hash function can't
properly hash the 64-bit pointer address.

''Don Porter <[email protected]>''

 > ''Pardon the interruption, but I do not understand what is meant
   by the assertion that hashing of 64-bit pointers "can possibly
   fail".  I've used Tcl on a 64-bit Alpha system for years, hashing
   64-bit pointers the whole time.  What failures should I be seeing?''

| #define RANDOM_INDEX(tablePtr, i) \
|    (((((long) (i))*1103515245) >> (tablePtr)->downShift) & (tablePtr)->mask)

The above one word hash function can be replaced with a 64-bit version
of Donald Knuth's multiplicative hash function.

|    ((key * GOLDEN_RATIO64) >> downShift) & tablePtr->mask)

where downShift is 64 - log2(tableSize) and the GOLDEN_RATION64 is a
prime approximately equal to (sqrt(64) - 1) / 2.  

The 64-bit array function is again from Bob Jenkins.  This time it's a
64-bit mixing function.

| #define MIX64(a,b,c) \
| 	a -= b, a -= c, a ^= (c >> 43), \
| 	b -= c, b -= a, b ^= (a <<  9), \
| 	c -= a, c -= b, c ^= (b >>  8), \
| 	a -= b, a -= c, a ^= (c >> 38), \
| 	b -= c, b -= a, b ^= (a << 23), \
| 	c -= a, c -= b, c ^= (b >>  5), \
| 	a -= b, a -= c, a ^= (c >> 35), \
| 	b -= c, b -= a, b ^= (a << 49), \
| 	c -= a, c -= b, c ^= (b >> 11), \
| 	a -= b, a -= c, a ^= (c >> 12), \
| 	b -= c, b -= a, b ^= (a << 18), \
| 	c -= a, c -= b, c ^= (b >> 22)
| 
|     uint64_t a, b, c, len;
| 
|     len = length;
|     a = b = GOLDEN_RATIO64;	/* An arbitrary value */
|     c = 0;			/* Previous hash value */
| 
|     while (len >= 3) {	/* Handle most of the key */
| 	a += key[0];
| 	b += key[1];
| 	c += key[2];
| 	MIX64(a,b,c);
| 	key += 3; len -= 3;
|     }

|     c += length;		
|     switch(len) {
|     case 2 : b += key[1];
|     case 1 : a += key[0];
|     }

|     MIX64(a,b,c);
|     return c;

Note that it also takes advantage of the 64-bit word size.

~ Summary

The following improvements to the current Tcl hash table have been
suggested.  

 * Replace the current array hash function.

 * Replace the bucket pointer in the hash entry with its hash value.
   This allows the table to be rebuilt without rehashing each entry.
   It also speeds bucket searches.

 * Remove the tablePtr from the hash entry, a 20% savings.  This
   requires that callers of ''Tcl_DeleteHashEntry'' pass the hash
   table as a parameter.

 * Allow the hash table to use fixed or variable size pool allocation
   since ''malloc'' and ''free'' costs dominate large tables.  Pool
   allocation substantial speeds large tables while also saving 8-16
   bytes per entry.  This can be done while still providing the normal
   ''malloc''/''free'' versions.

 * Support 64-bit platforms.  This requires 64-bit versions of one
   word and array hash functions.

The suggested changes are nothing new and can be found in most hash
table implementations.  This work builds on the already solid
foundation of the current hash table.  With the above improvements,
the Tcl hash table can be used in high performance applications.  It
also adds a useful piece to the 64-bit Tcl/Tk port.

I've created and tested a new hash table implementation under the
following systems.

|	System			32     64
|	linux-ix86-gcc		x	
|	Solaris-v9-cc		x	x
|	Solaris-v9-gcc		x	x
|	HPUX-11-cc		x	x
| 	HPUX-11-gcc		x
|	Win2k			x

It will be made publicly available on SourceForge.

~ Hashing of Malicious Strings

''Donal K. Fellows adds:''

In 2003 a possible denial-of-service attack on hash tables was published
that operated by making a majority of keys map to the same bucket.  While
this would not make the hashes function incorrectly - there would be no
extra memory consumed or incorrect accesses to memory - it still permits
an attacker to escalate the cost of hash accesses from O(1) to O(n) in
the normal case (and with obvious knock-on effects for the order of other
algorithms) and so mount an attack out-of-scale with the effort required
to set the attack up.

The way to fix this is to use a different hashing function for string
hashing that varies the exact hashing algorithm on a table-by-table
basis, and to base that algorithm on a hashing function with better
spectral properties than Tcl's current (extremely simple) one.  An
algorithm that might be suitable for such uses is described online
[http://burtleburtle.net/bob/hash/evahash.html] though the code would
need substantial adaption (including the addition of a fairly strong
random number generator) before being placed in the core.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|

|
|
|

|

|
|

|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|

|

|
|
|
<
>
|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
<
<
>
>
>

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|

|

|

|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
<
<
>
>

|
|
|
|
|
|
|
<
>
|
|
|
<
<
>
>

|

|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
<
<
<
<
|
>
>
>
>
|

|

|
|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|

|

|

|
|

|
|
|

|

|

|

|

|

|
|
|
|
|
|
|
|

|
|

|
|
|
|

|

|

|

|
|

|

|
|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|

|

|

|

|

|
|
|
|
|
|
|

|

|

|
|
|

|

|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57

58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95

96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115

116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146

147
148
149
150
151

152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177

178
179
180
181

182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197

198
199
200
201

202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225

226
227
228

229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394

395
396
397
398
399

400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474

# TIP 69: Improvements for the Tcl Hash Table

	Author:         George A. Howlett <[email protected]>
	Author:         Don Porter <[email protected]>
	Author:         Donal K. Fellows <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        16-Oct-2001
	Post-History:   
	Discussions-To: news:comp.lang.tcl
	Tcl-Version:    9.0
-----

# Abstract

This document describes various improvements to the existing Tcl hash
table.  They include support for 64 bit platforms, better memory
performance, and improved array hashing.  The goal is a hash table
that improves Tcl/Tk, but also can be used in industrial strength
applications.

# Introduction

A strength of Tcl that has not diminished in the advance of other
scripting languages \(Perl, Python, Ruby, etc.\) is the easy way its
command language can be extended with C/C\+\+ code.  For example, the
prominence of Tcl in Electronic Design Automation \(EDA\) tools is
striking.  It's hard to find EDA tools that do not use Tcl to some
degree.  At the same time, there is a current trend toward 64-bit
computing platforms.  The impetus has been from industry \(like EDA\)
rather than office or home users, wanting to solve bigger problems,
faster.  If Tcl applications are to operate on 64-bit platforms, a big
first step towards this goal will be a 64-bit version of the Tcl hash
table.

The current Tcl hash table performs well on 32-bit platforms.  It has
been tuned and wrung out handling internal Tcl/Tk code.  But its one
word hash function

	 #define RANDOM_INDEX(tablePtr, i) \
	    (((((long) (i))*1103515245) >> (tablePtr)->downShift) & (tablePtr)->mask)

can not hash the longer 64-bit addresses properly.  

Example:

	    Tcl_HashTable t;
	    unsigned long i;
	    char *base, *addr;	
	    int isNew;
	    char *mesg;

	    Tcl_InitHashTable(&t, TCL_ONE_WORD_KEYS);
	    base = 0xFFFFFF000000000UL;
	    for (i = 0; i < 100; i++) {
	        addr = base + i * 0x100000000;
	        hPtr = Tcl_CreateHashEntry(&t, addr, &isNew);

	    }
	    mesg = Tcl_HashStats(&t);
	    fprintf(stderr, "Stats\n%s\n", mesg);
	    free((char *)mesg;

Note that the keys all have zeros in the lower 32 bits.  All 100
entries will hash to the same value.  Driving the need for 64-bit
systems is the ability to address more memory.  So it's imperative
that Tcl hash table be able to hash large virtual addresses.

Building upon the current hash table implementation, the following
sections describe specific areas for improvement:

	* improved array/structure hashing

	* better memory performance 

	* support for 64-bit platforms

The goal is an improved Tcl hash table for internal Tcl and Tk code,
but also high performance applications.

# Improved Array/Structure Hashing

The Tcl hash table handles three types of hash keys: string keys, one
word keys, and multi-word keys.  Each key type has its own hash
function associated with it.  The benefit of this approach is that
better hash functions can be used for the specific types, than one
general function for all types.  The string and one word hash
functions are very good for typical keys.  The multi-word or array
hash is not as good.

The array hash sums the each word of the array and then randomizes
the result.  

	    for (index = 0, count = tablePtr->keyType, iPtr1 = arrayPtr;
		    count > 0; count--, iPtr1++) {
		index += *iPtr1;

	    }
	    index = RANDOM_INDEX(tablePtr, index);

This works poorly for many types of hash keys.  For an contrived
example of hashing 1 million 3D coordinates,

	    typedef struct {
	       double x, y, z;
	    }  Double3;
	    double d3;

	    Tcl_InitHashTable(&t, sizeof(Double3) / sizeof(int));

	    for (i = 0; i < 100; i++) {
		for (j = 0; j < 100; j++) {
		    for (k = 0; k < 100; k++) {
			d3.x = (double)i;
			d3.y = (double)j;
			d3.z = (double)k;
			hPtr = Tcl_CreateHashEntry(&t, (char *)&d3, &isNew);

		    }
		}
	    }

we get a hash table with an average search distance of 1082.3.  The
maximum distance is 3324!  Replacing the hash function with Bob
Jenkins' <http://burtleburtle.net/bob>  32-bit mixing function

	   #define MIX32(a,b,c) \
		a -= b, a -= c, a ^= (c >> 13), \
		b -= c, b -= a, b ^= (a <<  8), \
		c -= a, c -= b, c ^= (b >> 13), \
		a -= b, a -= c, a ^= (c >> 12), \
		b -= c, b -= a, b ^= (a << 16), \
		c -= a, c -= b, c ^= (b >>  5), \
		a -= b, a -= c, a ^= (c >>  3), \
		b -= c, b -= a, b ^= (a << 10), \
		c -= a, c -= b, c ^= (b >> 15)

	   int a, b, c, len;

	   len = length;
	   a = b = GOLDEN_RATIO32;	/* An arbitrary value */
	   c = 0;			/* Previous hash value */

	   while (len >= 3) {		/* Handle most of the key */
		a += key[0];
		b += key[1];
		c += key[2];
		MIX32(a, b, c);
		key += 3; len -= 3;

	    }
	    c += length;		
	    switch(len) {
	    case 2 : b += key[1];
	    case 1 : a += key[0];

	    }
	    MIX32(a, b, c);
	    return c;

yields a table with an average search distance of 1.48.  The maximum
distance is 8.  The Jenkins' hash function provides good results for
many different types of arrays and structures.  The disadvantage is
that the hash function is slightly more expensive to compute.  

# Improving RebuildTable.

The cost of computing a hash function is especially felt each time
table is rebuilt as new entries are added.  The _RebuildTable_ function
calls the hash function of each entry to recompute its new location in
the bigger table.

	    for (oldChainPtr = oldBuckets; oldSize > 0; oldSize--, oldChainPtr++) {
		for (hPtr = *oldChainPtr; hPtr != NULL; hPtr = *oldChainPtr) {
		    *oldChainPtr = hPtr->nextPtr;
		    if (tablePtr->keyType == TCL_STRING_KEYS) {
			index = HashString(hPtr->key.string) & tablePtr->mask;
		    } else if (tablePtr->keyType == TCL_ONE_WORD_KEYS) {
			index = RANDOM_INDEX(tablePtr, hPtr->key.oneWordValue);
		    } else {
			index = HashArray(hPtr->key.words, tablePtr->keyType) &
				tablePtr->mask;

		    }
		    hPtr->bucketPtr = &(tablePtr->buckets[index]);
		    hPtr->nextPtr = *hPtr->bucketPtr;
		    *hPtr->bucketPtr = hPtr;

		}
	    }

The new bucket location is then stored in the hash entry.

Except for one word keys, the hash value is invariant of the table
size.  If the hash value was stored with each entry, then it would not
need to be recomputed each time the table is rebuilt.

	    for (oldChainPtr = oldBuckets; oldSize > 0; oldSize--, oldChainPtr++) {
		for (hPtr = *oldChainPtr; hPtr != NULL; hPtr = *oldChainPtr) {
		    *oldChainPtr = hPtr->nextPtr;
		    if (tablePtr->keyType == TCL_ONE_WORD_KEYS) {
			index = RANDOM_INDEX(tablePtr, hPtr->key.oneWordValue);
		    } else {
			index = hPtr->hval & tablePtr->mask;

		    }
		    hPtr->bucketPtr = &(tablePtr->buckets[index]);
		    hPtr->nextPtr = *hPtr->bucketPtr;
		    *hPtr->bucketPtr = hPtr;

		}
	    }

This would increase size of an hash entry, except that the pointer to
the hash bucket is now redundant, since it can cheaply be computed.

	    bucketPtr = tablePtr->buckets + (hPtr->hval & tablePtr->mask);

An added benefit is that hash table lookups become faster and easier
to perform.  If there is more than one hash entry in a bucket, you
don't need to examine the key unless the entry has the same hash
value.

	    for (hPtr = tablePtr->buckets[hindex]; hPtr != NULL;
		    hPtr = hPtr->nextPtr) {
	       /* Don't look at entry unless the hash value is the same. */
		if (hPtr->hval == hval) { 
		    register int *iPtr1, *iPtr2;
		    int count;

		    for (iPtr1 = arrayPtr, iPtr2 = (int *)hPtr->key.words,
			     count = tablePtr->keyType; ; count--, iPtr1++, iPtr2++) {
			if (count == 0) {
			    return hPtr;

			}
			if (*iPtr1 != *iPtr2) {
			    break;

			}
		    }
		}
	    }

_Don Porter <[email protected]>_

 > _It appears that the recommendations of this section have already
   been implemented in Tcl 8.4.  In particular, when the symbol
   TCL\_HASH\_KEY\_STORE\_HASH == 1 \(as it does by default\), then the
   hash value is stored in each entry instead of the bucketPtr._

 > _If that is correct, then I recommend this section of the TIP be
   removed.  If not, more detail about how this proposal differs
   from the 8.4 implementation, and an argument why the proposal
   is superior are in order._

# Better Memory Performance 

One enduring complaint of the Tcl hash table on comp.lang.tcl is its
unexpected memory costs.  A table of 1 million one word key entries
uses over 36 Megabytes.

A hash entry is 20 bytes long.

	 typedef struct Tcl_HashEntry {
	    struct Tcl_HashEntry *nextPtr;	/* Pointer to next entry in this
						 * hash bucket, or NULL for end of
						 * chain. */
	    struct Tcl_HashTable *tablePtr;	/* Pointer to table containing entry. */
	    struct Tcl_HashEntry **bucketPtr;	/* Pointer to bucket that points to
						 * first entry in this entry's chain:
						 * used for deleting the entry. */
	    ClientData clientData;		/* Application stores something here
						 * with Tcl_SetHashValue. */
	    union {				/* Key has one of these forms: */
		char *oneWordValue;		/* One-word value for key. */
		int words[1];			/* Multiple integer words for key.
						 * The actual size will be as large
						 * as necessary for this table's
						 * keys. */
		char string[4];			/* String for key.  The actual size
						 * will be as large as needed to hold
						 * the key. */
	    } key;				/* MUST BE LAST FIELD IN RECORD!! */
	 } Tcl_HashEntry;

Each entry stores a pointer to its hash table.  This field is used
only for deleting a hash entry.  But if the hash table is passed to
_Tcl\_DeleteHashEntry_, the hash entry can be reduced to 16 bytes.
Inspecting Tcl/Tk code, I could not find a case where the hash table
was not easily available to pass as a parameter.

Each hash entry is allocated using _malloc_.  System memory
allocators typically add 8-16 bytes overhead for each allocation.
Worse, calls to _malloc_ and _free_ tend to dominate the cost of
large hash tables.  _Tcl\_DeleteHashTable_ becomes very slow, freeing
hash entries scattered across pages of virtual memory.

For large hash tables, a pool allocation scheme can improve both
reduce the amount of memory used and improve memory performance.  By
allocating memory in larger chunks, the number of _malloc_ and
_free_ calls is dramatically reduced.  Fixed size allocators \(one
word keys and array keys\) can also reclaim and reuse memory from
deleted entries.

The disadvantage of pool allocation is that memory is not released
until the hash table is deleted.  This is less of an issue for large
tables which tend to grow to a steady-state size.  Both Tcl and Tk use
hash tables to keep track of small amounts of information that
probably don't pool allocation.

So to retain compatibility, a new specialized initialization routine
can be used to indicate when to use pool-based allocation.

	    Tcl_InitHashTableWithPool(&table, TCL_ONE_WORD_KEYS);

The standard _Tcl\_InitHashTable_ call

	    Tcl_InitHashTable(&table, TCL_ONE_WORD_KEYS);

will still use _malloc_ and _free_.

# Support for 64-bit Platforms.

While the C language makes no guarantees of a type's size or its
relation to other types, current programming practice assumes that
integers, longs, and pointers are all 32 bits long.  This, of course,
changes with 64-bit systems where pointers are 64-bits wide.
Depending upon the programming model, longs and ints may or may not be
64 bits too.

		 Datatype      LP64    ILP64   LLP64   ILP32   LP32
		  char           8       8       8       8       8
		  short         16      16      16      16      16
		  _int32                        32
		 int            32      64      32      32      16
		 long           64      64      32      32      32
		 long long                      64
		 pointer        64      64      64      32      32

ILP32 is typical for 32 bit systems.  Windows 3.1 was a LP32 model.

In the LP64 model, pointers and longs are 64 bits, but ints remain 32
bits wide.  The LLP model retains the 32-bits for ints and longs, but
adds a 64-bit "long long" type.  Most 64-bit Unix systems \(Solaris,
HP-UX, Tru64, AIX\) are LP64.  I believe that Win64 is LLP.

The first problem is that addresses are now 64-bits, not 32.  This
means that existing code such as

	    Tcl_InitHashTable(&table, TCL_ONE_WORD_KEYS);

	    ptr = CreateSomeObject();
	    hPtr = Tcl_CreateHashEntry(&table, (void *)ptr, &isNew);

can possibly fail because the 32-bit one word hash function can't
properly hash the 64-bit pointer address.

_Don Porter <[email protected]>_

 > _Pardon the interruption, but I do not understand what is meant
   by the assertion that hashing of 64-bit pointers "can possibly
   fail".  I've used Tcl on a 64-bit Alpha system for years, hashing
   64-bit pointers the whole time.  What failures should I be seeing?_

	 #define RANDOM_INDEX(tablePtr, i) \
	    (((((long) (i))*1103515245) >> (tablePtr)->downShift) & (tablePtr)->mask)

The above one word hash function can be replaced with a 64-bit version
of Donald Knuth's multiplicative hash function.

	    ((key * GOLDEN_RATIO64) >> downShift) & tablePtr->mask)

where downShift is 64 - log2\(tableSize\) and the GOLDEN\_RATION64 is a
prime approximately equal to \(sqrt\(64\) - 1\) / 2.  

The 64-bit array function is again from Bob Jenkins.  This time it's a
64-bit mixing function.

	 #define MIX64(a,b,c) \
	 	a -= b, a -= c, a ^= (c >> 43), \
	 	b -= c, b -= a, b ^= (a <<  9), \
	 	c -= a, c -= b, c ^= (b >>  8), \
	 	a -= b, a -= c, a ^= (c >> 38), \
	 	b -= c, b -= a, b ^= (a << 23), \
	 	c -= a, c -= b, c ^= (b >>  5), \
	 	a -= b, a -= c, a ^= (c >> 35), \
	 	b -= c, b -= a, b ^= (a << 49), \
	 	c -= a, c -= b, c ^= (b >> 11), \
	 	a -= b, a -= c, a ^= (c >> 12), \
	 	b -= c, b -= a, b ^= (a << 18), \
	 	c -= a, c -= b, c ^= (b >> 22)

	     uint64_t a, b, c, len;

	     len = length;
	     a = b = GOLDEN_RATIO64;	/* An arbitrary value */
	     c = 0;			/* Previous hash value */

	     while (len >= 3) {	/* Handle most of the key */
	 	a += key[0];
	 	b += key[1];
	 	c += key[2];
	 	MIX64(a,b,c);
	 	key += 3; len -= 3;

	     }
	     c += length;		
	     switch(len) {
	     case 2 : b += key[1];
	     case 1 : a += key[0];

	     }
	     MIX64(a,b,c);
	     return c;

Note that it also takes advantage of the 64-bit word size.

# Summary

The following improvements to the current Tcl hash table have been
suggested.  

 * Replace the current array hash function.

 * Replace the bucket pointer in the hash entry with its hash value.
   This allows the table to be rebuilt without rehashing each entry.
   It also speeds bucket searches.

 * Remove the tablePtr from the hash entry, a 20% savings.  This
   requires that callers of _Tcl\_DeleteHashEntry_ pass the hash
   table as a parameter.

 * Allow the hash table to use fixed or variable size pool allocation
   since _malloc_ and _free_ costs dominate large tables.  Pool
   allocation substantial speeds large tables while also saving 8-16
   bytes per entry.  This can be done while still providing the normal
   _malloc_/_free_ versions.

 * Support 64-bit platforms.  This requires 64-bit versions of one
   word and array hash functions.

The suggested changes are nothing new and can be found in most hash
table implementations.  This work builds on the already solid
foundation of the current hash table.  With the above improvements,
the Tcl hash table can be used in high performance applications.  It
also adds a useful piece to the 64-bit Tcl/Tk port.

I've created and tested a new hash table implementation under the
following systems.

		System			32     64
		linux-ix86-gcc		x	
		Solaris-v9-cc		x	x
		Solaris-v9-gcc		x	x
		HPUX-11-cc		x	x
	 	HPUX-11-gcc		x
		Win2k			x

It will be made publicly available on SourceForge.

# Hashing of Malicious Strings

_Donal K. Fellows adds:_

In 2003 a possible denial-of-service attack on hash tables was published
that operated by making a majority of keys map to the same bucket.  While
this would not make the hashes function incorrectly - there would be no
extra memory consumed or incorrect accesses to memory - it still permits
an attacker to escalate the cost of hash accesses from O\(1\) to O\(n\) in
the normal case \(and with obvious knock-on effects for the order of other
algorithms\) and so mount an attack out-of-scale with the effort required
to set the attack up.

The way to fix this is to use a different hashing function for string
hashing that varies the exact hashing algorithm on a table-by-table
basis, and to base that algorithm on a hashing function with better
spectral properties than Tcl's current \(extremely simple\) one.  An
algorithm that might be suitable for such uses is described online
<http://burtleburtle.net/bob/hash/evahash.html>  though the code would
need substantial adaption \(including the addition of a fairly strong
random number generator\) before being placed in the core.

# Copyright

This document has been placed in the public domain.

Name change from tip/7.tip to tip/7.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236

237
238
239
240

241
242
243
244

245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288

289
290

291
292
293
294

295
296
297
298
299
300
301
302
303
304

305
306
307
308

309
310
311
312
313
314
315
316

317
318
319
320
321
322
323
324
325
326
327
328
329
330

331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355

356
357
358
359
360
361
362
363
364

365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390

391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408

409
410
411
412
413

414
415
416
417
418
419
420
421

422
423
424
425
426
427
428
429
430

431
432
433
434
435
436
437
438
439
440
441
442
443

444
445

446
447
448

449
450

451
452
453
454

455
456

457
458
459

460
461
462

463
464
465

466
467
468
469
470

471
472
473
474
475
476
477
478
479

480
481
482
483

484
485
486
487
488

489
490
491
492
493
494
495
496

497
498

499
500
501
502

503
504

505
506
507
508
509
510

511
512

513
514
515
516
517
518
519
520
521

522
523

524
525
526
527
528
529

530
531
532
533

534
535
536
537
538
539
540

541
542

543
544
545
546
547
548

549
550
551
552
553

554
555
556

557
558
559
560

561

562
563
564
565
566
567

568
569
570

571
572
573
574
575

576
577
578
579
580
581
582
583
584

585
586
587
588
589
590
591

592
593
594
595
596
597

598
599
600
601
602
603
604

605
606
607
608

609
610
611
612
613

614
615
616
617
618
619

620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636

637
638
639
640
641
642

643
644
645
646
647

648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667

668
669

670
671
672
673
674

675
676
677

678
679
680
681

682
683
684
685

686
687
688

689
690

691
692
693
694
695
696
697
698
699
700

701
702
703

704
705

706
707

708
709
710
711
712

713
714
715
716
717
718

719
720
721
722

723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749

750
751

752
753
754
755
756

757
758
759
760
761
762

763
764

765
766
767

768
769
770
771
772

773
774

775
776

777
778

779
780
781

782
783
784

785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804

805
806

807
808
809
810
811
812
813
814
815
816
817
818
819
820

821
822
823
824

825
826
827

828
829
830
831

832
833

834
835
836

837
838

839
840

841
842

843
844
845
846
847

848
849

850
851
852
853
854
855

856
857
858
859
860
861

862
863
864
865
866

867
868
869
870
871
872
873
874

875
876
877
878
879
880

881
882
883

884
885
886
887

888
889
890
891
892

893
894
895

896
897
898
899
900
901
902
903
904
905

906
907
908

909
910

911
912
913
914
915
916

917
918
919

920
921
922
923

924
925
926
927
928
929
930

931
932
933

934
935
936
937
938
939
940
941
942
943
944
945

946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967

968
969
970

TIP:            7
Title:          Increased resolution for TclpGetTime on Windows
Version:        $Revision: 1.4 $
Author:         Kevin Kenny <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        26-Oct-2000
Tcl-Version:    8.4
Discussions-To: news:comp.lang.tcl
Post-History: 

~ Abstract

Tcl users on the Windows platform have long been at a disadvantage in
attempting to do code timing studies, owing to the poor resolution of
the Windows system clock.  The ''time'' command, the ''clock clicks''
command, and all related functions are limited to a resolution of
(typically) 10 milliseconds.  This proposal offers a solution based on
the Windows performance counter.  It presents a means of disciplining
this counter to the system clock so that ''TclpGetTime'' (the
underlying call that the above commands use) can return times to
microsecond precision with accuracy in the tens of microseconds.

~ Change history

''2 November 2000:'' Modified the TIP to discuss the issues surrounding
the fact that some multiprocessor kernels on Windows NT use the CPU timestamp
counter as a performance counter.  Modified the proposed patch to test for
the two frequencies in common use on 8254-compatible real-time clocks, and
enable using the performance counter only if its frequency matches one of
them.  Included the proposed patch inline for review rather than as a
pointer off to dejanews.

~ Rationale

The Windows implementation of ''TclpGetTime'', as of Tcl 8.3.2, uses
the ''ftime'' call in the C library to extract the current system
clock in seconds and milliseconds.  While this time value has
millisecond precision, its actual resolution is limited by the tick
rate of the Windows system clock, normally 100 Hz.  Similarly,
''TclpGetClicks'' uses the ''GetTickCount'' function of
''kernel32.dll'' to get the number of milliseconds since bootload;
once again, the actual resolution of this call is limited to the tick
rate of the system clock.

The Windows Platform APIs offer several timers of different accuracy.
The most precise of these is ''QueryPerformanceCounter'', which
operates at an unspecified frequency (returned by
''QueryPerformanceFrequency'') that is typically about 1.19 MHz.
[http://support.microsoft.com/support/kb/articles/Q172/3/38.asp] has
details of the call, with sample code.

The documentation for Windows suggests that this function is available
only on certain versions of the operating system; in fact, it is
implemented in every extant version of Win32 with the exception of
Win32s and Windows CE 1.0.  Since Visual C++ 6, on which the Tcl
distribution depends, will no longer compile code for those two
platforms, I assert that they may be safely ignored.

The documentation for Windows also states that
''QueryPerformanceCounter'' is available only on certain hardware.  In
practice, this is not an issue; I have never encountered a Windows
implementation on an x86 platform that lacks it, and Alpha has it as
well.  In any case, the reference implementation tests for the success
or failure of the system calls in question, and falls back on the old
way of getting time should they return an error indication.  Users of
any platform on which the performance counter is not supported should
therefore be no worse off than they have ever been.

A worse problem with the performance counter is that its frequency is
poorly calibrated, and is frequently off by as much as 200 parts per
million.  Moreover, the frequency drifts over time, frequently having
a sensitive dependency to temperatures inside the computer's case.

This problem is not insurmountable.  The fix is to maintain the
observed frequency of the performance counter (calibrated against the
system clock) as a variable at run time, and use that variable
together with the value of the performance counter to derive Tcl's
concept of the time.  This technique is well known to electronic
engineers as the "phase locked loop" and is used in network protocols
such as NTP[http://www.eecis.udel.edu/~ntp/].

One problem that is apparently insurmountable is that certain
multiprocessor systems have hardware abstraction layers that derive
the performance counter from the CPU timestamp counter in place of a
real-time clock reference.  This implementation causes the performance
counter on one CPU to drift with respect to the other over time; if a
thread is moved from one processor to another, it cannot derive a
meaningful result from comparing two successive values of the counter.
Moreover, if the CPU clock uses a "gearshift" technique for power
management (as on Intel SpeedStep or Transmeta machines), the CPU
timestamp counter ticks at a non-constant rate.

The proposed implementation addresses the problem by using the
performance counter only if its nominal frequency is either 1.193182
MHz or 3.579545 MHz.  These two frquencies are the common rates when
8254-compatible real-time clock chips are used; virtually all PCI bus
controllers have such chips on board.  This solution therefore adapts
to the vast majority of workstation-class Windows boxes, and is
virtually certain to exclude implementations derived from the CPU
clock since no modern CPU is that slow.  

The patch has been tested on several desktop and laptop machines from
Compaq, Dell, Gateway, HP, Micron, and Packard Bell, with processors
ranging from a 50 MHz 486 to a 750 MHz Pentium III, including laptops
using SpeedStep technology.  It passes the clock-related test cases on
all these platforms; it falls back to the old clocks with 10-ms
precision on multiprocessor servers from Compaq and HP.  (Using the
performance counter actually would have worked on the HP server, which
apparently has some way of making sure that the results of
''QueryPerformanceCounter'' are consistent from one CPU to another.
The performance counter on the Compaq machine was observed to be
inconsistent between the two CPU's.)

~ Specification

This document proposes the following changes to the Tcl core:

   1.  (tclWinTime.c) Add to the static data a set of variables that
       manage the phase-locked techniques, including a
       ''CRITICAL_SECTION'' to guard them so that multi-threaded code
       is stable.

   2.  (tclWinTime.c) Modify ''TclpGetSeconds'' to call
       ''TclpGetTime'' and return the 'seconds' portion of the result.
       This change is necessary to make sure that the two times are
       consistent near the rollover from one second to another.

   3.  (tclWinTime.c) Modify ''TclpGetClicks'' to use
       TclpGetTime to determine the click count as a number of
       microseconds.

   4.  (tclWinTime.c) Modify ''TclpGetTime'' to return the time as
       M*Q+B, where Q is the result of ''QueryPerformanceCounter'',
       and M and B are variables maintained by the phase-locked loop
       to keep the result as close as possible to the system clock.
       The ''TclpGetTime'' call will also launch the phase-lock
       management in a separate thread the first time that it is
       invoked.  If the performance counter is unavailable,
       or if its frequency is not one of the two common 8254-compatible
       rates, then
       ''TclpGetTime'' will return the result of ''ftime'' as it does
       in Tcl 8.3.2.

   5.  (tclWinTime.c) Add the clock calibration procedure.  The
       calibration is somewhat complex; to save space, the reader is
       referred to the reference implementation for the details of how
       the time base and frequency are maintained.

   6.  (tclWinNotify.c) Modify ''Tcl_Sleep'' to test that the process
       has, in fact, slept for the requisite time by calling
       ''TclpGetTime'' and comparing with the desired time.
       Otherwise, roundoff errors may cause the process to awaken
       early.

   7.  (tclWinTest.c) Add a ''testwinclock'' command.  This command
       returns a four element list comprising the seconds and
       microseconds portions of the system clock and the seconds and
       microseconds portions of the Tcl clock.

   8.  (winTime.test) Add to the test suite a test that makes sure
       that the Tcl clock stays within 1.1 ms of the system clock over
       the duration of the test.

~ Reference implementation

This change was submitted as a patch to the old bug-tracking system at
Scriptics [http://www.deja.com/getdoc.xp?AN=666545441&fmt=text].  It
is being recycled as a TIP now that the Tcl Core Team is in place,
since the process for advancing the old patches to the Core is not
well defined.  The link above should not be used to retrieve
the current version of the patch, which appears below as an Appendix.

Tests on several Wintel boxes have shown that the initial startup
transient is less than about 10 seconds (during which time the Tcl
clock may be running 500 ppm fast or slow to bring it into step);
following this period, the motion of the Tcl clock is highly
repeatable and uniform.

If the system clock changes by more than 1 second during a run, as
when the operator sets it using the eyeball-and-wristwatch method, the
method of adjusting the performance frequency to preserve monotonicity
and accuracy of interval measurements is hopeless.  This is the only
case where the Tcl clock is allowed to jump.

The startup of the calibration loop does not introduce new
instabilities in the behavior of [[clock clocks]] or ''TclpGetTime''.

[[clock clicks]] and other times that derive from
''TclpGetTime'' also ought to be reliable from the beginning -
assuming that ''QueryPerformanceFrequency'' actually matches the
crystal.  The worst case while the initial calibration is going on
ought to be that the Tcl clock runs 0.1% fast or slow.  The point of
the calibration loop is to correct for long-term drift.

The problem, otherwise, is that ''QueryPerformanceFrequency'' may be
off by some tens of parts per million with respect to the system
clock.  Over a period of days, that would cause the Tcl clock to veer
off from the system clock.  For instance, once my machine is warmed up
(temperature is significant, believe it or not),
''QueryPerformanceFrequency'' is consistently 0.99985 of the correct
value; without calibration, the performance-counter-derived clock
drifts 13 seconds per day.

The ''capture transient'' of the calibration loop is a little
different every time, but the one shown below is typical.  The Tcl
time starts out 2 ms fast with respect to the system time, and the
initial estimate of performance frequency is off, too.  At 2 seconds
in, the calibration loop takes over and makes the clock run 0.1% slow
to bring it in line; by 5 seconds in, it's lined up.  There's some
phase noise over the next 40 seconds or so, by which time the
performance frequency is locked on quite closely. The outliers above
the line represent the fact that [[after]] events sometimes arrive
late because of various other things going on in Windows.

#image:7capture Typical capture transient

The script that gathered the raw data plotted above appears below.

|foreach { syssecs sysusec tclsecs tclusec } [testwinclock] {}
|set basesecs $syssecs
|set baseusec $sysusec
|set nTrials 10000
|for { set i 0 } { $i < $nTrials } { incr i } {
|    set values {}
|    for { set j 0 } { $j < 5 } { incr j } {
|	foreach { syssecs sysusec tclsecs tclusec } [testwinclock] {}
|	set systime [expr { ($syssecs - $basesecs)
|			    + 1.0e-6 * $sysusec - 1.0e-6 * $baseusec }]
|	set tcltime [expr { ($tclsecs - $basesecs)
|			    + 1.0e-6 * $tclusec - 1.0e-6 * $baseusec }]
|	set timediff [expr { $tcltime - $systime }]
|	lappend values [list $systime $timediff $tcltime]
|	after 1
|    }

|    foreach { elapsed timediff tcltime } \
|	[lindex [lsort -real -index 1 $values] 0] {}
|    lappend history $elapsed $timediff $tcltime
|}

|set f [open ~/test2.dat w]
|foreach { elapsed timediff tcltime} $history {
|    puts $f "$elapsed\t$timediff\t$tcltime"
|}

|close $f

To quantify how reproducible the measurements are, I threw a patched
tclsh the torture test of executing [[time {}]] ten million times, and
made a histogram of the results.  The figure below shows the results.
The dots represent individual sample bins, and the solid line is the
cumulative count of samples.  The vast majority of samples show either
five or six microseconds. 99.9% take fewer than nine.  There are many
samples that take longer, owing to either servicing interrupts or
losing the processor to other processes.

The lines at 21, 31 and 42 microseconds show up in repeated runs on my
machine; I suspect that they represent time spent servicing different
sorts of video interrupts.  It's less clear to me what the other
outliers might be; Windows has a tremendous amount of stuff going on
even when it's apparently idle.

#image:7histogram Histogram of results of [time {}].

All tests in the test suite continue to pass with the patch applied.

~ Notes

If you care about time to the absolute precision that this change can
achieve, it is of course necessary to discipline the Windows system
clock as well.  Perhaps the best way is to use one of the available
NTP packages ([http://www.eecis.udel.edu/~ntp/] for further
information).

~ Copyright

This document has been placed in the public domain.

~ Appendix

The proposed set of patches to the Tcl 8.3.2 code base appears here.

|*** ../tcl8.3.2base/src/tcl8.3.2/win/tclWinNotify.c Fri Jul  2 18:08:30 1999
|--- ./src/tcl8.3.2/win/tclWinNotify.c Thu Aug 24 23:29:12 2000
|***************
|*** 510,514 ****
|  Tcl_Sleep(ms)
|      int ms;			/* Number of milliseconds to sleep. */
|  {

|!     Sleep(ms);
|  }

|--- 510,548 ----
|  Tcl_Sleep(ms)
|      int ms;			/* Number of milliseconds to sleep. */
|  {

|!     /*
|!      * Simply calling 'Sleep' for the requisite number of milliseconds
|!      * can make the process appear to wake up early because it isn't
|!      * synchronized with the CPU performance counter that is used in
|!      * tclWinTime.c.  This behavior is probably benign, but messes
|!      * up some of the corner cases in the test suite.  We get around
|!      * this problem by repeating the 'Sleep' call as many times
|!      * as necessary to make the clock advance by the requisite amount.
|!      */
|! 

|!     Tcl_Time now;		/* Current wall clock time */
|!     Tcl_Time desired;		/* Desired wakeup time */
|!     int sleepTime = ms;		/* Time to sleep */
|! 

|!     TclpGetTime( &now );
|!     desired.sec = now.sec + ( ms / 1000 );
|!     desired.usec = now.usec + 1000 * ( ms % 1000 );
|!     if ( desired.usec > 1000000 ) {
|! 	++desired.sec;
|! 	desired.usec -= 1000000;
|!     }
|! 	

|!     for ( ; ; ) {
|! 	Sleep( sleepTime );
|! 	TclpGetTime( &now );
|! 	if ( now.sec > desired.sec ) {
|! 	    break;
|! 	} else if ( ( now.sec == desired.sec )
|! 	     && ( now.usec >= desired.usec ) ) {
|! 	    break;
|! 	}
|! 	sleepTime = ( ( 1000 * ( desired.sec - now.sec ) )
|! 		      + ( ( desired.usec - now.usec ) / 1000 ) );
|!     }
|! 
|  }

|*** ../tcl8.3.2base/src/tcl8.3.2/win/tclWinTest.c Thu Oct 28 23:05:14 1999
|--- ./src/tcl8.3.2/win/tclWinTest.c Mon Sep  4 22:45:56 2000
|***************
|*** 22,27 ****
|--- 22,31 ----
|  static int	TestvolumetypeCmd _ANSI_ARGS_((ClientData dummy,
|	 Tcl_Interp *interp, int objc,
|	 Tcl_Obj *CONST objv[]));
|+ static int      TestwinclockCmd _ANSI_ARGS_(( ClientData dummy,
|+ 					      Tcl_Interp* interp,
|+ 					      int objc,
|+ 					      Tcl_Obj *CONST objv[] ));
|  
|  /*
|   *----------------------------------------------------------------------
|***************
|*** 52,57 ****
|--- 56,63 ----
|	       (ClientData) 0, (Tcl_CmdDeleteProc *) NULL);
|      Tcl_CreateObjCommand(interp, "testvolumetype", TestvolumetypeCmd,
|	       (ClientData) 0, (Tcl_CmdDeleteProc *) NULL);
|+     Tcl_CreateObjCommand(interp, "testwinclock", TestwinclockCmd,
|+             (ClientData) 0, (Tcl_CmdDeleteProc *) NULL);
|      return TCL_OK;
|  }

|  
|***************
|*** 187,190 ****
|--- 193,267 ----
|      Tcl_SetResult(interp, volType, TCL_VOLATILE);
|      return TCL_OK;
|  #undef VOL_BUF_SIZE
|+ }
|+ 

|+ /*
|+  *----------------------------------------------------------------------
|+  *
|+  * TestclockCmd --
|+  *
|+  *	Command that returns the seconds and microseconds portions of
|+  *	the system clock and of the Tcl clock so that they can be
|+  *	compared to validate that the Tcl clock is staying in sync.
|+  *
|+  * Usage:
|+  *	testclock
|+  *
|+  * Parameters:
|+  *	None.
|+  *
|+  * Results:
|+  *	Returns a standard Tcl result comprising a four-element list:
|+  *	the seconds and microseconds portions of the system clock,
|+  *	and the seconds and microseconds portions of the Tcl clock.
|+  *
|+  * Side effects:
|+  *	None.
|+  *
|+  *----------------------------------------------------------------------
|+  */
|+ 

|+ static int
|+ TestwinclockCmd( ClientData dummy,
|+ 				/* Unused */
|+ 		 Tcl_Interp* interp,
|+ 				/* Tcl interpreter */
|+ 		 int objc,
|+ 				/* Argument count */
|+ 		 Tcl_Obj *CONST objv[] )
|+ 				/* Argument vector */
|+ {
|+     CONST static FILETIME posixEpoch = { 0xD53E8000, 0x019DB1DE };
|+ 				/* The Posix epoch, expressed as a
|+ 				 * Windows FILETIME */
|+     Tcl_Time tclTime;		/* Tcl clock */
|+     FILETIME sysTime;		/* System clock */
|+     Tcl_Obj* result;		/* Result of the command */
|+     LARGE_INTEGER t1, t2;
|+ 

|+     if ( objc != 1 ) {
|+ 	Tcl_WrongNumArgs( interp, 1, objv, "" );
|+ 	return TCL_ERROR;
|+     }
|+ 

|+     TclpGetTime( &tclTime );
|+     GetSystemTimeAsFileTime( &sysTime );
|+     t1.LowPart = posixEpoch.dwLowDateTime;
|+     t1.HighPart = posixEpoch.dwHighDateTime;
|+     t2.LowPart = sysTime.dwLowDateTime;
|+     t2.HighPart = sysTime.dwHighDateTime;
|+     t2.QuadPart -= t1.QuadPart;
|+ 

|+     result = Tcl_NewObj();
|+     Tcl_ListObjAppendElement
|+ 	( interp, result, Tcl_NewIntObj( (int) (t2.QuadPart / 10000000 ) ) );
|+     Tcl_ListObjAppendElement
|+ 	( interp, result,
|+ 	  Tcl_NewIntObj( (int) ( (t2.QuadPart / 10 ) % 1000000 ) ) );
|+     Tcl_ListObjAppendElement( interp, result, Tcl_NewIntObj( tclTime.sec ) );
|+     Tcl_ListObjAppendElement( interp, result, Tcl_NewIntObj( tclTime.usec ) );
|+ 

|+     Tcl_SetObjResult( interp, result );
|+ 
|+     return TCL_OK;
|  }
|*** ../tcl8.3.2base/src/tcl8.3.2/win/tclWinTime.c Tue Nov 30 19:08:44 1999
|--- ./src/tcl8.3.2/win/tclWinTime.c Thu Nov  2 14:25:56 2000
|***************
|*** 38,47 ****
|--- 38,114 ----
|  static Tcl_ThreadDataKey dataKey;
|  
|  /*
|+  * Calibration interval for the high-resolution timer, in msec

|+  */
|+ 

|+ static CONST unsigned long clockCalibrateWakeupInterval = 10000;
|+ 				/* FIXME: 10 s -- should be about 10 min! */
|+ 

|+ /*
|+  * Data for managing high-resolution timers.

|+  */
|+ 
|+ typedef struct TimeInfo {
|+ 

|+     CRITICAL_SECTION cs;	/* Mutex guarding this structure */
|+ 

|+     int initialized;		/* Flag == 1 if this structure is
|+ 				 * initialized. */
|+ 

|+     int perfCounterAvailable;	/* Flag == 1 if the hardware has a
|+ 				 * performance counter */
|+ 

|+     HANDLE calibrationThread;	/* Handle to the thread that keeps the
|+ 				 * virtual clock calibrated. */
|+ 

|+     HANDLE readyEvent;		/* System event used to
|+ 				 * trigger the requesting thread
|+ 				 * when the clock calibration procedure
|+ 				 * is initialized for the first time */
|+ 

|+     /*
|+      * The following values are used for calculating virtual time.
|+      * Virtual time is always equal to:
|+      *    lastFileTime + (current perf counter - lastCounter) 
|+      *				* 10000000 / curCounterFreq
|+      * and lastFileTime and lastCounter are updated any time that
|+      * virtual time is returned to a caller.
|+      */
|+ 

|+     ULARGE_INTEGER lastFileTime;
|+     LARGE_INTEGER lastCounter;
|+     LARGE_INTEGER curCounterFreq;
|+ 

|+     /* 
|+      * The next two values are used only in the calibration thread, to track
|+      * the frequency of the performance counter.
|+      */
|+ 

|+     LONGLONG lastPerfCounter;	/* Performance counter the last time
|+ 				 * that UpdateClockEachSecond was called */
|+     LONGLONG lastSysTime;	/* System clock at the last time
|+ 				 * that UpdateClockEachSecond was called */
|+     LONGLONG estPerfCounterFreq;
|+ 				/* Current estimate of the counter frequency
|+ 				 * using the system clock as the standard */
|+ 

|+ } TimeInfo;
|+ 

|+ static TimeInfo timeInfo = {
|+     NULL, 0, 0, NULL, NULL, 0, 0, 0, 0, 0
|+ };
|+ 

|+ CONST static FILETIME posixEpoch = { 0xD53E8000, 0x019DB1DE };
|+     

|+ /*
|   * Declarations for functions defined later in this file.
|   */
|  
|  static struct tm *	ComputeGMT _ANSI_ARGS_((const time_t *tp));
|+ 

|+ static DWORD WINAPI     CalibrationThread _ANSI_ARGS_(( LPVOID arg ));
|+ 

|+ static void 		UpdateTimeEachSecond _ANSI_ARGS_(( void ));
|  
|  /*
|   *----------------------------------------------------------------------
|***************
|*** 63,69 ****
|  unsigned long
|  TclpGetSeconds()
|  {

|!     return (unsigned long) time((time_t *) NULL);
|  }

|  
|  /*
|--- 130,138 ----
|  unsigned long
|  TclpGetSeconds()
|  {

|!     Tcl_Time t;
|!     TclpGetTime( &t );
|!     return t.sec;
|  }

|  
|  /*
|***************
|*** 89,95 ****
|  unsigned long
|  TclpGetClicks()
|  {

|!     return GetTickCount();
|  }

|  
|  /*
|--- 158,175 ----
|  unsigned long
|  TclpGetClicks()
|  {

|!     /*
|!      * Use the TclpGetTime abstraction to get the time in microseconds,
|!      * as nearly as we can, and return it.
|!      */
|! 

|!     Tcl_Time now;		/* Current Tcl time */
|!     unsigned long retval;	/* Value to return */
|! 

|!     TclpGetTime( &now );
|!     retval = ( now.sec * 1000000 ) + now.usec;
|!     return retval;
|! 

|  }

|  
|  /*
|***************
|*** 134,140 ****
|   *	Returns the current time in timePtr.
|   *

|   * Side effects:
|!  *	None.
|   *

|   *----------------------------------------------------------------------
|   */
|--- 214,226 ----
|   *	Returns the current time in timePtr.
|   *

|   * Side effects:
|!  *	On the first call, initializes a set of static variables to
|!  *	keep track of the base value of the performance counter, the
|!  *	corresponding wall clock (obtained through ftime) and the
|!  *	frequency of the performance counter.  Also spins a thread
|!  *	whose function is to wake up periodically and monitor these
|!  *	values, adjusting them as necessary to correct for drift
|!  *	in the performance counter's oscillator.
|   *

|   *----------------------------------------------------------------------
|   */
|***************
|*** 143,153 ****
|  TclpGetTime(timePtr)
|      Tcl_Time *timePtr;		/* Location to store time information. */
|  {

|      struct timeb t;
|  
|!     ftime(&t);
|!     timePtr->sec = t.time;
|!     timePtr->usec = t.millitm * 1000;
|  }

|  
|  /*
|--- 229,342 ----
|  TclpGetTime(timePtr)
|      Tcl_Time *timePtr;		/* Location to store time information. */
|  {
|+ 	

|      struct timeb t;
|  
|!     /* Initialize static storage on the first trip through. */
|! 

|!     /*
|!      * Note: Outer check for 'initialized' is a performance win
|!      * since it avoids an extra mutex lock in the common case.
|!      */
|! 

|!     if ( !timeInfo.initialized ) { 
|! 	TclpInitLock();
|! 	if ( !timeInfo.initialized ) {
|! 	    timeInfo.perfCounterAvailable
|! 		= QueryPerformanceFrequency( &timeInfo.curCounterFreq );
|! 

|! 	    /*
|! 	     * Some hardware abstraction layers use the CPU clock
|! 	     * in place of the real-time clock as a performance counter
|! 	     * reference.  This results in:
|! 	     *    - inconsistent results among the processors on
|! 	     *      multi-processor systems.
|! 	     *    - unpredictable changes in performance counter frequency
|! 	     *      on "gearshift" processors such as Transmeta and
|! 	     *      SpeedStep.
|! 	     * There seems to be no way to test whether the performance
|! 	     * counter is reliable, but a useful heuristic is that
|! 	     * if its frequency is 1.193182 MHz or 3.579545 MHz, it's
|! 	     * derived from a colorburst crystal and is therefore
|! 	     * the RTC rather than the TSC.  If it's anything else, we
|! 	     * presume that the performance counter is unreliable.
|! 	     */
|! 

|! 	    if ( timeInfo.perfCounterAvailable
|! 		 && timeInfo.curCounterFreq.QuadPart != 1193182ui64
|! 		 && timeInfo.curCounterFreq.QuadPart != 3579545ui64 ) {
|! 		timeInfo.perfCounterAvailable = FALSE;
|! 	    }
|! 

|! 	    /*
|! 	     * If the performance counter is available, start a thread to
|! 	     * calibrate it.
|! 	     */
|! 

|! 	    if ( timeInfo.perfCounterAvailable ) {
|! 		DWORD id;
|! 		InitializeCriticalSection( &timeInfo.cs );
|! 		timeInfo.readyEvent = CreateEvent( NULL, FALSE, FALSE, NULL );
|! 		timeInfo.calibrationThread = CreateThread( NULL,
|! 							   8192,
|! 							   CalibrationThread,
|! 							   (LPVOID) NULL,
|! 							   0,
|! 							   &id );
|! 		SetThreadPriority( timeInfo.calibrationThread,
|! 				   THREAD_PRIORITY_HIGHEST );
|! 		WaitForSingleObject( timeInfo.readyEvent, INFINITE );
|! 		CloseHandle( timeInfo.readyEvent );
|! 	    }
|! 	    timeInfo.initialized = TRUE;
|! 	}
|! 	TclpInitUnlock();
|!     }
|! 

|!     if ( timeInfo.perfCounterAvailable ) {
|! 	

|! 	/*
|! 	 * Query the performance counter and use it to calculate the
|! 	 * current time.
|! 	 */
|! 

|! 	LARGE_INTEGER curCounter;
|! 				/* Current performance counter */
|! 

|! 	LONGLONG curFileTime;
|! 				/* Current estimated time, expressed
|! 				 * as 100-ns ticks since the Windows epoch */
|! 

|! 	static const LARGE_INTEGER posixEpoch = { 0xD53E8000, 0x019DB1DE };
|! 				/* Posix epoch expressed as 100-ns ticks
|! 				 * since the windows epoch */
|! 

|! 	LONGLONG usecSincePosixEpoch;
|! 				/* Current microseconds since Posix epoch */
|! 

|! 	EnterCriticalSection( &timeInfo.cs );
|! 

|! 	QueryPerformanceCounter( &curCounter );
|! 	curFileTime = timeInfo.lastFileTime.QuadPart
|! 	    + ( ( curCounter.QuadPart - timeInfo.lastCounter.QuadPart )
|! 		* 10000000 / timeInfo.curCounterFreq.QuadPart );
|! 	timeInfo.lastFileTime.QuadPart = curFileTime;
|! 	timeInfo.lastCounter.QuadPart = curCounter.QuadPart;
|! 	usecSincePosixEpoch = ( curFileTime - posixEpoch.QuadPart ) / 10;
|! 	timePtr->sec = (time_t) ( usecSincePosixEpoch / 1000000 );
|! 	timePtr->usec = (unsigned long ) ( usecSincePosixEpoch % 1000000 );
|! 	

|! 	LeaveCriticalSection( &timeInfo.cs );
|! 
|! 	

|!     } else {
|! 	

|! 	/* High resolution timer is not available.  Just use ftime */
|! 	

|! 	ftime(&t);
|! 	timePtr->sec = t.time;
|! 	timePtr->usec = t.millitm * 1000;
|!     }
|  }

|  
|  /*
|***************
|*** 439,442 ****
|--- 628,843 ----
|      }

|  
|      return tmPtr;
|+ }
|+ 

|+ /*
|+  *----------------------------------------------------------------------
|+  *
|+  * CalibrationThread --
|+  *
|+  *	Thread that manages calibration of the hi-resolution time
|+  *	derived from the performance counter, to keep it synchronized
|+  *	with the system clock.
|+  *
|+  * Parameters:
|+  *	arg -- Client data from the CreateThread call.  This parameter
|+  *             points to the static TimeInfo structure.
|+  *
|+  * Return value:
|+  *	None.  This thread embeds an infinite loop.
|+  *
|+  * Side effects:
|+  *	At an interval of clockCalibrateWakeupInterval ms, this thread
|+  *	performs virtual time discipline.
|+  *
|+  * Note: When this thread is entered, TclpInitLock has been called
|+  * to safeguard the static storage.  There is therefore no synchronization
|+  * in the body of this procedure.
|+  *
|+  *----------------------------------------------------------------------
|+  */
|+ 

|+ static DWORD WINAPI
|+ CalibrationThread( LPVOID arg )

|+ {
|+     FILETIME curFileTime;
|+ 
|+     /* Get initial system time and performance counter */
|+ 

|+     GetSystemTimeAsFileTime( &curFileTime );
|+     QueryPerformanceCounter( &timeInfo.lastCounter );
|+     QueryPerformanceFrequency( &timeInfo.curCounterFreq );
|+     timeInfo.lastFileTime.LowPart = curFileTime.dwLowDateTime;
|+     timeInfo.lastFileTime.HighPart = curFileTime.dwHighDateTime;
|+ 

|+     /* Initialize the working storage for the calibration callback */
|+ 

|+     timeInfo.lastPerfCounter = timeInfo.lastCounter.QuadPart;
|+     timeInfo.estPerfCounterFreq = timeInfo.curCounterFreq.QuadPart;
|+ 

|+     /*
|+      * Wake up the calling thread.  When it wakes up, it will release the
|+      * initialization lock.
|+      */
|+ 

|+     SetEvent( timeInfo.readyEvent );
|+ 

|+     /* Run the calibration once a second */
|+ 

|+     for ( ; ; ) {
|+ 

|+ 	Sleep( 1000 );
|+ 	UpdateTimeEachSecond();
|+ 	

|+     }
|+ }
|+ 

|+ /*
|+  *----------------------------------------------------------------------
|+  *
|+  * UpdateTimeEachSecond --
|+  *
|+  *	Callback from the waitable timer in the clock calibration thread
|+  *	that updates system time.
|+  *
|+  * Parameters:
|+  *	info -- Pointer to the static TimeInfo structure
|+  *
|+  * Results:
|+  *	None.
|+  *
|+  * Side effects:
|+  *	Performs virtual time calibration discipline.
|+  *
|+  *----------------------------------------------------------------------
|+  */
|+ 

|+ static void
|+ UpdateTimeEachSecond()

|+ {
|+ 
|+     LARGE_INTEGER curPerfCounter;
|+ 				/* Current value returned from
|+ 				 * QueryPerformanceCounter */
|+ 
|+     LONGLONG perfCounterDiff;	/* Difference between the current value
|+ 				 * and the value of 1 second ago */
|+ 
|+     FILETIME curSysTime;	/* Current system time */
|+ 
|+     LARGE_INTEGER curFileTime;	/* File time at the time this callback
|+ 				 * was scheduled. */
|+ 

|+     LONGLONG fileTimeDiff;	/* Elapsed time on the system clock
|+ 				 * since the last time this procedure
|+ 				 * was called */
|+ 

|+     LONGLONG instantFreq;	/* Instantaneous estimate of the
|+ 				 * performance counter frequency */
|+ 

|+     LONGLONG delta;		/* Increment to add to the estimated
|+ 				 * performance counter frequency in the
|+ 				 * loop filter */
|+ 

|+     LONGLONG fuzz;		/* Tolerance for the perf counter frequency */
|+ 

|+     LONGLONG lowBound;		/* Lower bound for the frequency assuming
|+ 				 * 1000 ppm tolerance */
|+ 

|+     LONGLONG hiBound;		/* Upper bound for the frequency */
|+ 

|+     /*
|+      * Get current performance counter and system time.

|+      */
|+ 

|+     QueryPerformanceCounter( &curPerfCounter );
|+     GetSystemTimeAsFileTime( &curSysTime );
|+     curFileTime.LowPart = curSysTime.dwLowDateTime;
|+     curFileTime.HighPart = curSysTime.dwHighDateTime;
|+ 

|+     EnterCriticalSection( &timeInfo.cs );
|+ 

|+     /*
|+      * Find out how many ticks of the performance counter and the
|+      * system clock have elapsed since we got into this procedure.
|+      * Estimate the current frequency.
|+      */
|+ 

|+     perfCounterDiff = curPerfCounter.QuadPart - timeInfo.lastPerfCounter;
|+     timeInfo.lastPerfCounter = curPerfCounter.QuadPart;
|+     fileTimeDiff = curFileTime.QuadPart - timeInfo.lastSysTime;
|+     timeInfo.lastSysTime = curFileTime.QuadPart;
|+     instantFreq = ( 10000000 * perfCounterDiff / fileTimeDiff );
|+ 

|+     /*
|+      * Consider this a timing glitch if instant frequency varies
|+      * significantly from the current estimate.
|+      */
|+ 

|+     fuzz = timeInfo.estPerfCounterFreq >> 10;
|+     lowBound = timeInfo.estPerfCounterFreq - fuzz;
|+     hiBound = timeInfo.estPerfCounterFreq + fuzz;
|+     if ( instantFreq < lowBound || instantFreq > hiBound ) {
|+ 	LeaveCriticalSection( &timeInfo.cs );
|+ 	return;
|+     }
|+ 

|+     /*
|+      * Update the current estimate of performance counter frequency.
|+      * This code is equivalent to the loop filter of a phase locked
|+      * loop.
|+      */
|+ 

|+     delta = ( instantFreq - timeInfo.estPerfCounterFreq ) >> 6;
|+     timeInfo.estPerfCounterFreq += delta;
|+ 

|+     /*
|+      * Update the current virtual time.
|+      */
|+ 

|+     timeInfo.lastFileTime.QuadPart
|+ 	+= ( ( curPerfCounter.QuadPart - timeInfo.lastCounter.QuadPart )
|+ 	     * 10000000 / timeInfo.curCounterFreq.QuadPart );
|+     timeInfo.lastCounter.QuadPart = curPerfCounter.QuadPart;
|+ 

|+     delta = curFileTime.QuadPart - timeInfo.lastFileTime.QuadPart;
|+     if ( delta > 10000000 || delta < -10000000 ) {
|+ 

|+ 	/*
|+ 	 * If the virtual time slip exceeds one second, then adjusting
|+ 	 * the counter frequency is hopeless (it'll take over fifteen
|+ 	 * minutes to line up with the system clock).  The most likely
|+ 	 * cause of this large a slip is a sudden change to the system
|+ 	 * clock, perhaps because it was being corrected by wristwatch
|+ 	 * and eyeball.  Accept the system time, and set the performance
|+ 	 * counter frequency to the current estimate.
|+ 	 */
|+ 

|+ 	timeInfo.lastFileTime.QuadPart = curFileTime.QuadPart;
|+ 	timeInfo.curCounterFreq.QuadPart = timeInfo.estPerfCounterFreq;
|+ 

|+     } else {
|+ 

|+ 	/*
|+ 	 * Compute a counter frequency that will cause virtual time to line
|+ 	 * up with system time one second from now, assuming that the
|+ 	 * performance counter continues to tick at timeInfo.estPerfCounterFreq.
|+ 	 */
|+ 	

|+ 	timeInfo.curCounterFreq.QuadPart
|+ 	    = 10000000 * timeInfo.estPerfCounterFreq / ( delta + 10000000 );
|+ 

|+ 	/*
|+ 	 * Limit frequency excursions to 1000 ppm from estimate
|+ 	 */
|+ 	

|+ 	if ( timeInfo.curCounterFreq.QuadPart < lowBound ) {
|+ 	    timeInfo.curCounterFreq.QuadPart = lowBound;
|+ 	} else if ( timeInfo.curCounterFreq.QuadPart > hiBound ) {
|+ 	    timeInfo.curCounterFreq.QuadPart = hiBound;
|+ 	}
|+     }
|+ 

|+     LeaveCriticalSection( &timeInfo.cs );
|+ 
|  }

|*** ../tcl8.3.2base/src/tcl8.3.2/test/winTime.test Mon Apr 10 13:19:08 2000
|--- ./tcl8.3.2/src/tcl8.3.2/test/winTime.test Wed Sep  6 14:55:30 2000
|***************
|*** 33,38 ****
|--- 33,64 ----
|      set result
|  } {1969}
|  
|+ # Next test tries to make sure that the Tcl clock stays in step
|+ # with the Windows clock.  3000 iterations really isn't enough,
|+ # but how many does a tester have patience for?
|+ 

|+ test winTime-2.1 {Synchronization of Tcl and Windows clocks} {pcOnly} {
|+     set failed 0
|+     foreach { sys_sec sys_usec tcl_sec tcl_usec } [testwinclock] {}
|+     set olddiff [expr { abs ( $tcl_sec - $sys_sec
|+ 			   + 1.0e-6 * ( $tcl_usec - $sys_usec ) ) }]
|+     set ok 1
|+     for { set i 0 } { $i < 3000 } { incr i } {
|+ 	foreach { sys_sec sys_usec tcl_sec tcl_usec } [testwinclock] {}
|+ 	set diff [expr { abs ( $tcl_sec - $sys_sec
|+ 			       + 1.0e-6 * ( $tcl_usec - $sys_usec ) ) }]
|+ 	if { ( $diff > $olddiff + 1000 )
|+ 	     || ( $diff > 11000 ) } {
|+ 	    set failed 1
|+ 	    break
|+ 	} else {
|+ 	    set olddiff $diff
|+ 	    after 1
|+ 	}
|+     }
|+     set failed
|+ } {0}
|+ 

|  # cleanup
|  ::tcltest::cleanupTests
|  return

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|
|

|

|

|

|
|

|
|

|
|
|
|

|

|

|
|

|

|

|

|

|

|

|

|

|
|

|

|
|

|

|

|

|

|

|

|

|

|

|
|

|

|
|
|

|

|
|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
<
>
|
|
|
<
>
|

|

|

|

|
|

|

|

|
|
|
|
|
|
<
>
|
<
>
|
|
|
<
>
|
|
|
|
|
|
|
|
|
<
>
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
<
<
>
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
<
>
|
<
<
<
<
<
<
<
<
<
<
<
<
>
>
>
>
>
>
>
>
>
>
>
>
|
<
>
|
|
<
>
|
|
>
>
>
<
<
<
<
>
|
<
>
|
|
<
>
|
|
<
>
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
|
<
>
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
<
>
|
|
|
<
>
|
<
>
|
|
|
|
|
<
>
|
<
>
|
|
|
|
|
|
|
|
<
>
|
<
>
|
|
|
|
|
<
>
|
|
|
<
>
|
|
|
|
|
|
<
>
|
<
>
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|
<
>
|
|
|
<
>
<
>
|
|
|
|
|
<
>
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
|
|
|
<
<
>
>
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
<
>
|
|
|
|
<
>
|
|
<
>
|
|
|
<
>
|
|
|
<
>
|
|
<
>
|
<
>
|
|
|
|
|
|
|
|
|
<
>
|
<
<
>
>
|
<
>
|
<
>
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
>
>
>
>
<
<
<
<
<
>
|
|
|
|
|
<
>
|
<
>
|
|
<
>
|
|
|
|
<
>
|
<
>
|
<
>
|
<
>
|
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
>
>
>
>
>
>
>
>
>
>
>
>
>
<
<
<
<
<
<
<
<
<
<
<
<
<
<
>
|
|
|
<
>
|
|
<
>
|
|
|
<
>
|
<
>
|
|
<
>
|
<
>
|
|
>
<
<
>
|
|
|
|
<
>
|
<
>
|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
<
>
|
|
|
<
>
|
|
|
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
|
<
>
|
|
<
>
|
<
>
|
|
|
|
|
<
>
|
|
<
>
|
|
|
<
>
|
|
|
|
|
|
<
>
|
<
<
>
>
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234

235
236
237
238

239
240
241
242

243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286

287
288

289
290
291
292

293
294
295
296
297
298
299
300
301
302

303
304
305
306

307
308
309
310
311
312
313
314

315
316
317
318
319
320
321
322
323
324
325
326
327

328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353

354
355
356
357
358
359
360
361
362

363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388

389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406

407
408
409
410
411

412
413
414
415
416
417
418
419

420
421
422
423
424
425
426
427
428

429
430

431
432
433
434
435
436
437
438
439
440
441
442
443

444
445
446

447
448
449
450
451
452

453
454

455
456
457

458
459
460

461
462
463

464
465
466
467
468

469
470
471
472
473
474
475
476
477

478
479
480
481

482
483
484
485
486

487
488
489
490
491
492
493
494

495
496

497
498
499
500

501
502

503
504
505
506
507
508

509
510

511
512
513
514
515
516
517
518
519

520
521

522
523
524
525
526
527

528
529
530
531

532
533
534
535
536
537
538

539
540

541
542
543
544
545
546

547
548
549
550
551

552
553
554

555
556
557
558

559

560
561
562
563
564
565

566
567
568

569
570
571
572
573

574
575
576
577
578
579
580
581
582

583
584
585
586
587
588
589

590
591
592
593
594
595

596
597
598
599
600
601

602
603
604
605
606

607
608
609
610
611

612
613
614
615
616
617

618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634

635
636
637
638
639
640

641
642
643
644
645

646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665

666
667

668
669
670
671
672

673
674
675

676
677
678
679

680
681
682
683

684
685
686

687
688

689
690
691
692
693
694
695
696
697
698

699
700

701
702
703

704
705

706
707
708
709
710

711
712
713
714
715
716

717
718
719
720

721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747

748
749
750
751
752
753
754

755
756
757
758
759
760

761
762

763
764
765

766
767
768
769
770

771
772

773
774

775
776

777
778
779

780
781
782

783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802

803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818

819
820
821
822

823
824
825

826
827
828
829

830
831

832
833
834

835
836

837
838
839
840

841
842
843
844
845

846
847

848
849
850
851
852
853

854
855
856
857
858
859

860
861
862
863
864

865
866
867
868
869
870
871
872

873
874
875
876
877
878

879
880
881

882
883
884
885

886
887
888
889
890

891
892
893

894
895
896
897
898
899
900
901
902
903

904
905
906

907
908

909
910
911
912
913
914

915
916
917

918
919
920
921

922
923
924
925
926
927
928

929
930

931
932
933
934
935
936
937
938
939
940
941
942
943

944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965

966
967
968
969
970

# TIP 7: Increased resolution for TclpGetTime on Windows

	Author:         Kevin Kenny <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        26-Oct-2000
	Tcl-Version:    8.4
	Discussions-To: news:comp.lang.tcl
	Post-History: 
-----

# Abstract

Tcl users on the Windows platform have long been at a disadvantage in
attempting to do code timing studies, owing to the poor resolution of
the Windows system clock.  The _time_ command, the _clock clicks_
command, and all related functions are limited to a resolution of
\(typically\) 10 milliseconds.  This proposal offers a solution based on
the Windows performance counter.  It presents a means of disciplining
this counter to the system clock so that _TclpGetTime_ \(the
underlying call that the above commands use\) can return times to
microsecond precision with accuracy in the tens of microseconds.

# Change history

_2 November 2000:_ Modified the TIP to discuss the issues surrounding
the fact that some multiprocessor kernels on Windows NT use the CPU timestamp
counter as a performance counter.  Modified the proposed patch to test for
the two frequencies in common use on 8254-compatible real-time clocks, and
enable using the performance counter only if its frequency matches one of
them.  Included the proposed patch inline for review rather than as a
pointer off to dejanews.

# Rationale

The Windows implementation of _TclpGetTime_, as of Tcl 8.3.2, uses
the _ftime_ call in the C library to extract the current system
clock in seconds and milliseconds.  While this time value has
millisecond precision, its actual resolution is limited by the tick
rate of the Windows system clock, normally 100 Hz.  Similarly,
_TclpGetClicks_ uses the _GetTickCount_ function of
_kernel32.dll_ to get the number of milliseconds since bootload;
once again, the actual resolution of this call is limited to the tick
rate of the system clock.

The Windows Platform APIs offer several timers of different accuracy.
The most precise of these is _QueryPerformanceCounter_, which
operates at an unspecified frequency \(returned by
_QueryPerformanceFrequency_\) that is typically about 1.19 MHz.
<http://support.microsoft.com/support/kb/articles/Q172/3/38.asp>  has
details of the call, with sample code.

The documentation for Windows suggests that this function is available
only on certain versions of the operating system; in fact, it is
implemented in every extant version of Win32 with the exception of
Win32s and Windows CE 1.0.  Since Visual C\+\+ 6, on which the Tcl
distribution depends, will no longer compile code for those two
platforms, I assert that they may be safely ignored.

The documentation for Windows also states that
_QueryPerformanceCounter_ is available only on certain hardware.  In
practice, this is not an issue; I have never encountered a Windows
implementation on an x86 platform that lacks it, and Alpha has it as
well.  In any case, the reference implementation tests for the success
or failure of the system calls in question, and falls back on the old
way of getting time should they return an error indication.  Users of
any platform on which the performance counter is not supported should
therefore be no worse off than they have ever been.

A worse problem with the performance counter is that its frequency is
poorly calibrated, and is frequently off by as much as 200 parts per
million.  Moreover, the frequency drifts over time, frequently having
a sensitive dependency to temperatures inside the computer's case.

This problem is not insurmountable.  The fix is to maintain the
observed frequency of the performance counter \(calibrated against the
system clock\) as a variable at run time, and use that variable
together with the value of the performance counter to derive Tcl's
concept of the time.  This technique is well known to electronic
engineers as the "phase locked loop" and is used in network protocols
such as NTP<http://www.eecis.udel.edu/~ntp/> .

One problem that is apparently insurmountable is that certain
multiprocessor systems have hardware abstraction layers that derive
the performance counter from the CPU timestamp counter in place of a
real-time clock reference.  This implementation causes the performance
counter on one CPU to drift with respect to the other over time; if a
thread is moved from one processor to another, it cannot derive a
meaningful result from comparing two successive values of the counter.
Moreover, if the CPU clock uses a "gearshift" technique for power
management \(as on Intel SpeedStep or Transmeta machines\), the CPU
timestamp counter ticks at a non-constant rate.

The proposed implementation addresses the problem by using the
performance counter only if its nominal frequency is either 1.193182
MHz or 3.579545 MHz.  These two frquencies are the common rates when
8254-compatible real-time clock chips are used; virtually all PCI bus
controllers have such chips on board.  This solution therefore adapts
to the vast majority of workstation-class Windows boxes, and is
virtually certain to exclude implementations derived from the CPU
clock since no modern CPU is that slow.  

The patch has been tested on several desktop and laptop machines from
Compaq, Dell, Gateway, HP, Micron, and Packard Bell, with processors
ranging from a 50 MHz 486 to a 750 MHz Pentium III, including laptops
using SpeedStep technology.  It passes the clock-related test cases on
all these platforms; it falls back to the old clocks with 10-ms
precision on multiprocessor servers from Compaq and HP.  \(Using the
performance counter actually would have worked on the HP server, which
apparently has some way of making sure that the results of
_QueryPerformanceCounter_ are consistent from one CPU to another.
The performance counter on the Compaq machine was observed to be
inconsistent between the two CPU's.\)

# Specification

This document proposes the following changes to the Tcl core:

   1.  \(tclWinTime.c\) Add to the static data a set of variables that
       manage the phase-locked techniques, including a
       _CRITICAL\_SECTION_ to guard them so that multi-threaded code
       is stable.

   2.  \(tclWinTime.c\) Modify _TclpGetSeconds_ to call
       _TclpGetTime_ and return the 'seconds' portion of the result.
       This change is necessary to make sure that the two times are
       consistent near the rollover from one second to another.

   3.  \(tclWinTime.c\) Modify _TclpGetClicks_ to use
       TclpGetTime to determine the click count as a number of
       microseconds.

   4.  \(tclWinTime.c\) Modify _TclpGetTime_ to return the time as
       M\*Q\+B, where Q is the result of _QueryPerformanceCounter_,
       and M and B are variables maintained by the phase-locked loop
       to keep the result as close as possible to the system clock.
       The _TclpGetTime_ call will also launch the phase-lock
       management in a separate thread the first time that it is
       invoked.  If the performance counter is unavailable,
       or if its frequency is not one of the two common 8254-compatible
       rates, then
       _TclpGetTime_ will return the result of _ftime_ as it does
       in Tcl 8.3.2.

   5.  \(tclWinTime.c\) Add the clock calibration procedure.  The
       calibration is somewhat complex; to save space, the reader is
       referred to the reference implementation for the details of how
       the time base and frequency are maintained.

   6.  \(tclWinNotify.c\) Modify _Tcl\_Sleep_ to test that the process
       has, in fact, slept for the requisite time by calling
       _TclpGetTime_ and comparing with the desired time.
       Otherwise, roundoff errors may cause the process to awaken
       early.

   7.  \(tclWinTest.c\) Add a _testwinclock_ command.  This command
       returns a four element list comprising the seconds and
       microseconds portions of the system clock and the seconds and
       microseconds portions of the Tcl clock.

   8.  \(winTime.test\) Add to the test suite a test that makes sure
       that the Tcl clock stays within 1.1 ms of the system clock over
       the duration of the test.

# Reference implementation

This change was submitted as a patch to the old bug-tracking system at
Scriptics <http://www.deja.com/getdoc.xp?AN=666545441&fmt=text> .  It
is being recycled as a TIP now that the Tcl Core Team is in place,
since the process for advancing the old patches to the Core is not
well defined.  The link above should not be used to retrieve
the current version of the patch, which appears below as an Appendix.

Tests on several Wintel boxes have shown that the initial startup
transient is less than about 10 seconds \(during which time the Tcl
clock may be running 500 ppm fast or slow to bring it into step\);
following this period, the motion of the Tcl clock is highly
repeatable and uniform.

If the system clock changes by more than 1 second during a run, as
when the operator sets it using the eyeball-and-wristwatch method, the
method of adjusting the performance frequency to preserve monotonicity
and accuracy of interval measurements is hopeless.  This is the only
case where the Tcl clock is allowed to jump.

The startup of the calibration loop does not introduce new
instabilities in the behavior of [clock clocks] or _TclpGetTime_.

[clock clicks] and other times that derive from
_TclpGetTime_ also ought to be reliable from the beginning -
assuming that _QueryPerformanceFrequency_ actually matches the
crystal.  The worst case while the initial calibration is going on
ought to be that the Tcl clock runs 0.1% fast or slow.  The point of
the calibration loop is to correct for long-term drift.

The problem, otherwise, is that _QueryPerformanceFrequency_ may be
off by some tens of parts per million with respect to the system
clock.  Over a period of days, that would cause the Tcl clock to veer
off from the system clock.  For instance, once my machine is warmed up
\(temperature is significant, believe it or not\),
_QueryPerformanceFrequency_ is consistently 0.99985 of the correct
value; without calibration, the performance-counter-derived clock
drifts 13 seconds per day.

The _capture transient_ of the calibration loop is a little
different every time, but the one shown below is typical.  The Tcl
time starts out 2 ms fast with respect to the system time, and the
initial estimate of performance frequency is off, too.  At 2 seconds
in, the calibration loop takes over and makes the clock run 0.1% slow
to bring it in line; by 5 seconds in, it's lined up.  There's some
phase noise over the next 40 seconds or so, by which time the
performance frequency is locked on quite closely. The outliers above
the line represent the fact that [after] events sometimes arrive
late because of various other things going on in Windows.

![Typical capture transient](../assets/7capture.gif)

The script that gathered the raw data plotted above appears below.

	foreach { syssecs sysusec tclsecs tclusec } [testwinclock] {}
	set basesecs $syssecs
	set baseusec $sysusec
	set nTrials 10000
	for { set i 0 } { $i < $nTrials } { incr i } {
	    set values {}
	    for { set j 0 } { $j < 5 } { incr j } {
		foreach { syssecs sysusec tclsecs tclusec } [testwinclock] {}
		set systime [expr { ($syssecs - $basesecs)
				    + 1.0e-6 * $sysusec - 1.0e-6 * $baseusec }]
		set tcltime [expr { ($tclsecs - $basesecs)
				    + 1.0e-6 * $tclusec - 1.0e-6 * $baseusec }]
		set timediff [expr { $tcltime - $systime }]
		lappend values [list $systime $timediff $tcltime]
		after 1

	    }
	    foreach { elapsed timediff tcltime } \
		[lindex [lsort -real -index 1 $values] 0] {}
	    lappend history $elapsed $timediff $tcltime

	}
	set f [open ~/test2.dat w]
	foreach { elapsed timediff tcltime} $history {
	    puts $f "$elapsed\t$timediff\t$tcltime"

	}
	close $f

To quantify how reproducible the measurements are, I threw a patched
tclsh the torture test of executing [time {}] ten million times, and
made a histogram of the results.  The figure below shows the results.
The dots represent individual sample bins, and the solid line is the
cumulative count of samples.  The vast majority of samples show either
five or six microseconds. 99.9% take fewer than nine.  There are many
samples that take longer, owing to either servicing interrupts or
losing the processor to other processes.

The lines at 21, 31 and 42 microseconds show up in repeated runs on my
machine; I suspect that they represent time spent servicing different
sorts of video interrupts.  It's less clear to me what the other
outliers might be; Windows has a tremendous amount of stuff going on
even when it's apparently idle.

![Histogram of results of {[time} {{}].}](../assets/7histogram.gif)

All tests in the test suite continue to pass with the patch applied.

# Notes

If you care about time to the absolute precision that this change can
achieve, it is of course necessary to discipline the Windows system
clock as well.  Perhaps the best way is to use one of the available
NTP packages \(<http://www.eecis.udel.edu/~ntp/>  for further
information\).

# Copyright

This document has been placed in the public domain.

# Appendix

The proposed set of patches to the Tcl 8.3.2 code base appears here.

	*** ../tcl8.3.2base/src/tcl8.3.2/win/tclWinNotify.c Fri Jul  2 18:08:30 1999
	--- ./src/tcl8.3.2/win/tclWinNotify.c Thu Aug 24 23:29:12 2000
	***************
	*** 510,514 ****
	  Tcl_Sleep(ms)
	      int ms;			/* Number of milliseconds to sleep. */

	  {
	!     Sleep(ms);

	  }
	--- 510,548 ----
	  Tcl_Sleep(ms)
	      int ms;			/* Number of milliseconds to sleep. */

	  {
	!     /*
	!      * Simply calling 'Sleep' for the requisite number of milliseconds
	!      * can make the process appear to wake up early because it isn't
	!      * synchronized with the CPU performance counter that is used in
	!      * tclWinTime.c.  This behavior is probably benign, but messes
	!      * up some of the corner cases in the test suite.  We get around
	!      * this problem by repeating the 'Sleep' call as many times
	!      * as necessary to make the clock advance by the requisite amount.
	!      */

	! 
	!     Tcl_Time now;		/* Current wall clock time */
	!     Tcl_Time desired;		/* Desired wakeup time */
	!     int sleepTime = ms;		/* Time to sleep */

	! 
	!     TclpGetTime( &now );
	!     desired.sec = now.sec + ( ms / 1000 );
	!     desired.usec = now.usec + 1000 * ( ms % 1000 );
	!     if ( desired.usec > 1000000 ) {
	! 	++desired.sec;
	! 	desired.usec -= 1000000;
	!     }

	! 	
	!     for ( ; ; ) {
	! 	Sleep( sleepTime );
	! 	TclpGetTime( &now );
	! 	if ( now.sec > desired.sec ) {
	! 	    break;
	! 	} else if ( ( now.sec == desired.sec )
	! 	     && ( now.usec >= desired.usec ) ) {
	! 	    break;
	! 	}
	! 	sleepTime = ( ( 1000 * ( desired.sec - now.sec ) )
	! 		      + ( ( desired.usec - now.usec ) / 1000 ) );
	!     }

	! 
	  }
	*** ../tcl8.3.2base/src/tcl8.3.2/win/tclWinTest.c Thu Oct 28 23:05:14 1999
	--- ./src/tcl8.3.2/win/tclWinTest.c Mon Sep  4 22:45:56 2000
	***************
	*** 22,27 ****
	--- 22,31 ----
	  static int	TestvolumetypeCmd _ANSI_ARGS_((ClientData dummy,
		 Tcl_Interp *interp, int objc,
		 Tcl_Obj *CONST objv[]));
	+ static int      TestwinclockCmd _ANSI_ARGS_(( ClientData dummy,
	+ 					      Tcl_Interp* interp,
	+ 					      int objc,
	+ 					      Tcl_Obj *CONST objv[] ));

	  /*
	   *----------------------------------------------------------------------
	***************
	*** 52,57 ****
	--- 56,63 ----
		       (ClientData) 0, (Tcl_CmdDeleteProc *) NULL);
	      Tcl_CreateObjCommand(interp, "testvolumetype", TestvolumetypeCmd,
		       (ClientData) 0, (Tcl_CmdDeleteProc *) NULL);
	+     Tcl_CreateObjCommand(interp, "testwinclock", TestwinclockCmd,
	+             (ClientData) 0, (Tcl_CmdDeleteProc *) NULL);
	      return TCL_OK;

	  }

	***************
	*** 187,190 ****
	--- 193,267 ----
	      Tcl_SetResult(interp, volType, TCL_VOLATILE);
	      return TCL_OK;
	  #undef VOL_BUF_SIZE
	+ }

	+ 
	+ /*
	+  *----------------------------------------------------------------------
	+  *
	+  * TestclockCmd --
	+  *
	+  *	Command that returns the seconds and microseconds portions of
	+  *	the system clock and of the Tcl clock so that they can be
	+  *	compared to validate that the Tcl clock is staying in sync.
	+  *
	+  * Usage:
	+  *	testclock
	+  *
	+  * Parameters:
	+  *	None.
	+  *
	+  * Results:
	+  *	Returns a standard Tcl result comprising a four-element list:
	+  *	the seconds and microseconds portions of the system clock,
	+  *	and the seconds and microseconds portions of the Tcl clock.
	+  *
	+  * Side effects:
	+  *	None.
	+  *
	+  *----------------------------------------------------------------------
	+  */

	+ 
	+ static int
	+ TestwinclockCmd( ClientData dummy,
	+ 				/* Unused */
	+ 		 Tcl_Interp* interp,
	+ 				/* Tcl interpreter */
	+ 		 int objc,
	+ 				/* Argument count */
	+ 		 Tcl_Obj *CONST objv[] )
	+ 				/* Argument vector */
	+ {
	+     CONST static FILETIME posixEpoch = { 0xD53E8000, 0x019DB1DE };
	+ 				/* The Posix epoch, expressed as a
	+ 				 * Windows FILETIME */
	+     Tcl_Time tclTime;		/* Tcl clock */
	+     FILETIME sysTime;		/* System clock */
	+     Tcl_Obj* result;		/* Result of the command */
	+     LARGE_INTEGER t1, t2;

	+ 
	+     if ( objc != 1 ) {
	+ 	Tcl_WrongNumArgs( interp, 1, objv, "" );
	+ 	return TCL_ERROR;
	+     }

	+ 
	+     TclpGetTime( &tclTime );
	+     GetSystemTimeAsFileTime( &sysTime );
	+     t1.LowPart = posixEpoch.dwLowDateTime;
	+     t1.HighPart = posixEpoch.dwHighDateTime;
	+     t2.LowPart = sysTime.dwLowDateTime;
	+     t2.HighPart = sysTime.dwHighDateTime;
	+     t2.QuadPart -= t1.QuadPart;

	+ 
	+     result = Tcl_NewObj();
	+     Tcl_ListObjAppendElement
	+ 	( interp, result, Tcl_NewIntObj( (int) (t2.QuadPart / 10000000 ) ) );
	+     Tcl_ListObjAppendElement
	+ 	( interp, result,
	+ 	  Tcl_NewIntObj( (int) ( (t2.QuadPart / 10 ) % 1000000 ) ) );
	+     Tcl_ListObjAppendElement( interp, result, Tcl_NewIntObj( tclTime.sec ) );
	+     Tcl_ListObjAppendElement( interp, result, Tcl_NewIntObj( tclTime.usec ) );

	+ 
	+     Tcl_SetObjResult( interp, result );

	+ 
	+     return TCL_OK;
	  }
	*** ../tcl8.3.2base/src/tcl8.3.2/win/tclWinTime.c Tue Nov 30 19:08:44 1999
	--- ./src/tcl8.3.2/win/tclWinTime.c Thu Nov  2 14:25:56 2000
	***************
	*** 38,47 ****
	--- 38,114 ----
	  static Tcl_ThreadDataKey dataKey;

	  /*
	+  * Calibration interval for the high-resolution timer, in msec
	+  */

	+ 
	+ static CONST unsigned long clockCalibrateWakeupInterval = 10000;
	+ 				/* FIXME: 10 s -- should be about 10 min! */

	+ 
	+ /*
	+  * Data for managing high-resolution timers.
	+  */
	+ 
	+ typedef struct TimeInfo {

	+ 
	+     CRITICAL_SECTION cs;	/* Mutex guarding this structure */

	+ 
	+     int initialized;		/* Flag == 1 if this structure is
	+ 				 * initialized. */

	+ 
	+     int perfCounterAvailable;	/* Flag == 1 if the hardware has a
	+ 				 * performance counter */

	+ 
	+     HANDLE calibrationThread;	/* Handle to the thread that keeps the
	+ 				 * virtual clock calibrated. */

	+ 
	+     HANDLE readyEvent;		/* System event used to
	+ 				 * trigger the requesting thread
	+ 				 * when the clock calibration procedure
	+ 				 * is initialized for the first time */

	+ 
	+     /*
	+      * The following values are used for calculating virtual time.
	+      * Virtual time is always equal to:
	+      *    lastFileTime + (current perf counter - lastCounter) 
	+      *				* 10000000 / curCounterFreq
	+      * and lastFileTime and lastCounter are updated any time that
	+      * virtual time is returned to a caller.
	+      */

	+ 
	+     ULARGE_INTEGER lastFileTime;
	+     LARGE_INTEGER lastCounter;
	+     LARGE_INTEGER curCounterFreq;

	+ 
	+     /* 
	+      * The next two values are used only in the calibration thread, to track
	+      * the frequency of the performance counter.
	+      */

	+ 
	+     LONGLONG lastPerfCounter;	/* Performance counter the last time
	+ 				 * that UpdateClockEachSecond was called */
	+     LONGLONG lastSysTime;	/* System clock at the last time
	+ 				 * that UpdateClockEachSecond was called */
	+     LONGLONG estPerfCounterFreq;
	+ 				/* Current estimate of the counter frequency
	+ 				 * using the system clock as the standard */

	+ 
	+ } TimeInfo;

	+ 
	+ static TimeInfo timeInfo = {
	+     NULL, 0, 0, NULL, NULL, 0, 0, 0, 0, 0
	+ };

	+ 
	+ CONST static FILETIME posixEpoch = { 0xD53E8000, 0x019DB1DE };

	+     
	+ /*
	   * Declarations for functions defined later in this file.
	   */

	  static struct tm *	ComputeGMT _ANSI_ARGS_((const time_t *tp));

	+ 
	+ static DWORD WINAPI     CalibrationThread _ANSI_ARGS_(( LPVOID arg ));

	+ 
	+ static void 		UpdateTimeEachSecond _ANSI_ARGS_(( void ));

	  /*
	   *----------------------------------------------------------------------
	***************
	*** 63,69 ****
	  unsigned long
	  TclpGetSeconds()

	  {
	!     return (unsigned long) time((time_t *) NULL);

	  }

	  /*
	--- 130,138 ----
	  unsigned long
	  TclpGetSeconds()

	  {
	!     Tcl_Time t;
	!     TclpGetTime( &t );
	!     return t.sec;

	  }

	  /*
	***************
	*** 89,95 ****
	  unsigned long
	  TclpGetClicks()

	  {
	!     return GetTickCount();

	  }

	  /*
	--- 158,175 ----
	  unsigned long
	  TclpGetClicks()

	  {
	!     /*
	!      * Use the TclpGetTime abstraction to get the time in microseconds,
	!      * as nearly as we can, and return it.
	!      */

	! 
	!     Tcl_Time now;		/* Current Tcl time */
	!     unsigned long retval;	/* Value to return */

	! 
	!     TclpGetTime( &now );
	!     retval = ( now.sec * 1000000 ) + now.usec;
	!     return retval;

	! 

	  }

	  /*
	***************
	*** 134,140 ****
	   *	Returns the current time in timePtr.

	   *
	   * Side effects:
	!  *	None.

	   *
	   *----------------------------------------------------------------------
	   */
	--- 214,226 ----
	   *	Returns the current time in timePtr.

	   *
	   * Side effects:
	!  *	On the first call, initializes a set of static variables to
	!  *	keep track of the base value of the performance counter, the
	!  *	corresponding wall clock (obtained through ftime) and the
	!  *	frequency of the performance counter.  Also spins a thread
	!  *	whose function is to wake up periodically and monitor these
	!  *	values, adjusting them as necessary to correct for drift
	!  *	in the performance counter's oscillator.

	   *
	   *----------------------------------------------------------------------
	   */
	***************
	*** 143,153 ****
	  TclpGetTime(timePtr)
	      Tcl_Time *timePtr;		/* Location to store time information. */

	  {
	      struct timeb t;

	!     ftime(&t);
	!     timePtr->sec = t.time;
	!     timePtr->usec = t.millitm * 1000;

	  }

	  /*
	--- 229,342 ----
	  TclpGetTime(timePtr)
	      Tcl_Time *timePtr;		/* Location to store time information. */

	  {
	+ 	
	      struct timeb t;

	!     /* Initialize static storage on the first trip through. */

	! 
	!     /*
	!      * Note: Outer check for 'initialized' is a performance win
	!      * since it avoids an extra mutex lock in the common case.
	!      */

	! 
	!     if ( !timeInfo.initialized ) { 
	! 	TclpInitLock();
	! 	if ( !timeInfo.initialized ) {
	! 	    timeInfo.perfCounterAvailable
	! 		= QueryPerformanceFrequency( &timeInfo.curCounterFreq );

	! 
	! 	    /*
	! 	     * Some hardware abstraction layers use the CPU clock
	! 	     * in place of the real-time clock as a performance counter
	! 	     * reference.  This results in:
	! 	     *    - inconsistent results among the processors on
	! 	     *      multi-processor systems.
	! 	     *    - unpredictable changes in performance counter frequency
	! 	     *      on "gearshift" processors such as Transmeta and
	! 	     *      SpeedStep.
	! 	     * There seems to be no way to test whether the performance
	! 	     * counter is reliable, but a useful heuristic is that
	! 	     * if its frequency is 1.193182 MHz or 3.579545 MHz, it's
	! 	     * derived from a colorburst crystal and is therefore
	! 	     * the RTC rather than the TSC.  If it's anything else, we
	! 	     * presume that the performance counter is unreliable.
	! 	     */

	! 
	! 	    if ( timeInfo.perfCounterAvailable
	! 		 && timeInfo.curCounterFreq.QuadPart != 1193182ui64
	! 		 && timeInfo.curCounterFreq.QuadPart != 3579545ui64 ) {
	! 		timeInfo.perfCounterAvailable = FALSE;
	! 	    }

	! 
	! 	    /*
	! 	     * If the performance counter is available, start a thread to
	! 	     * calibrate it.
	! 	     */

	! 
	! 	    if ( timeInfo.perfCounterAvailable ) {
	! 		DWORD id;
	! 		InitializeCriticalSection( &timeInfo.cs );
	! 		timeInfo.readyEvent = CreateEvent( NULL, FALSE, FALSE, NULL );
	! 		timeInfo.calibrationThread = CreateThread( NULL,
	! 							   8192,
	! 							   CalibrationThread,
	! 							   (LPVOID) NULL,
	! 							   0,
	! 							   &id );
	! 		SetThreadPriority( timeInfo.calibrationThread,
	! 				   THREAD_PRIORITY_HIGHEST );
	! 		WaitForSingleObject( timeInfo.readyEvent, INFINITE );
	! 		CloseHandle( timeInfo.readyEvent );
	! 	    }
	! 	    timeInfo.initialized = TRUE;
	! 	}
	! 	TclpInitUnlock();
	!     }

	! 
	!     if ( timeInfo.perfCounterAvailable ) {

	! 	
	! 	/*
	! 	 * Query the performance counter and use it to calculate the
	! 	 * current time.
	! 	 */

	! 
	! 	LARGE_INTEGER curCounter;
	! 				/* Current performance counter */

	! 
	! 	LONGLONG curFileTime;
	! 				/* Current estimated time, expressed
	! 				 * as 100-ns ticks since the Windows epoch */

	! 
	! 	static const LARGE_INTEGER posixEpoch = { 0xD53E8000, 0x019DB1DE };
	! 				/* Posix epoch expressed as 100-ns ticks
	! 				 * since the windows epoch */

	! 
	! 	LONGLONG usecSincePosixEpoch;
	! 				/* Current microseconds since Posix epoch */

	! 
	! 	EnterCriticalSection( &timeInfo.cs );

	! 
	! 	QueryPerformanceCounter( &curCounter );
	! 	curFileTime = timeInfo.lastFileTime.QuadPart
	! 	    + ( ( curCounter.QuadPart - timeInfo.lastCounter.QuadPart )
	! 		* 10000000 / timeInfo.curCounterFreq.QuadPart );
	! 	timeInfo.lastFileTime.QuadPart = curFileTime;
	! 	timeInfo.lastCounter.QuadPart = curCounter.QuadPart;
	! 	usecSincePosixEpoch = ( curFileTime - posixEpoch.QuadPart ) / 10;
	! 	timePtr->sec = (time_t) ( usecSincePosixEpoch / 1000000 );
	! 	timePtr->usec = (unsigned long ) ( usecSincePosixEpoch % 1000000 );

	! 	
	! 	LeaveCriticalSection( &timeInfo.cs );

	! 
	! 	
	!     } else {

	! 	
	! 	/* High resolution timer is not available.  Just use ftime */

	! 	
	! 	ftime(&t);
	! 	timePtr->sec = t.time;
	! 	timePtr->usec = t.millitm * 1000;
	!     }

	  }

	  /*
	***************
	*** 439,442 ****
	--- 628,843 ----

	      }

	      return tmPtr;
	+ }

	+ 
	+ /*
	+  *----------------------------------------------------------------------
	+  *
	+  * CalibrationThread --
	+  *
	+  *	Thread that manages calibration of the hi-resolution time
	+  *	derived from the performance counter, to keep it synchronized
	+  *	with the system clock.
	+  *
	+  * Parameters:
	+  *	arg -- Client data from the CreateThread call.  This parameter
	+  *             points to the static TimeInfo structure.
	+  *
	+  * Return value:
	+  *	None.  This thread embeds an infinite loop.
	+  *
	+  * Side effects:
	+  *	At an interval of clockCalibrateWakeupInterval ms, this thread
	+  *	performs virtual time discipline.
	+  *
	+  * Note: When this thread is entered, TclpInitLock has been called
	+  * to safeguard the static storage.  There is therefore no synchronization
	+  * in the body of this procedure.
	+  *
	+  *----------------------------------------------------------------------
	+  */

	+ 
	+ static DWORD WINAPI
	+ CalibrationThread( LPVOID arg )
	+ {
	+     FILETIME curFileTime;
	+ 
	+     /* Get initial system time and performance counter */

	+ 
	+     GetSystemTimeAsFileTime( &curFileTime );
	+     QueryPerformanceCounter( &timeInfo.lastCounter );
	+     QueryPerformanceFrequency( &timeInfo.curCounterFreq );
	+     timeInfo.lastFileTime.LowPart = curFileTime.dwLowDateTime;
	+     timeInfo.lastFileTime.HighPart = curFileTime.dwHighDateTime;

	+ 
	+     /* Initialize the working storage for the calibration callback */

	+ 
	+     timeInfo.lastPerfCounter = timeInfo.lastCounter.QuadPart;
	+     timeInfo.estPerfCounterFreq = timeInfo.curCounterFreq.QuadPart;

	+ 
	+     /*
	+      * Wake up the calling thread.  When it wakes up, it will release the
	+      * initialization lock.
	+      */

	+ 
	+     SetEvent( timeInfo.readyEvent );

	+ 
	+     /* Run the calibration once a second */

	+ 
	+     for ( ; ; ) {

	+ 
	+ 	Sleep( 1000 );
	+ 	UpdateTimeEachSecond();

	+ 	
	+     }
	+ }

	+ 
	+ /*
	+  *----------------------------------------------------------------------
	+  *
	+  * UpdateTimeEachSecond --
	+  *
	+  *	Callback from the waitable timer in the clock calibration thread
	+  *	that updates system time.
	+  *
	+  * Parameters:
	+  *	info -- Pointer to the static TimeInfo structure
	+  *
	+  * Results:
	+  *	None.
	+  *
	+  * Side effects:
	+  *	Performs virtual time calibration discipline.
	+  *
	+  *----------------------------------------------------------------------
	+  */

	+ 
	+ static void
	+ UpdateTimeEachSecond()
	+ {
	+ 
	+     LARGE_INTEGER curPerfCounter;
	+ 				/* Current value returned from
	+ 				 * QueryPerformanceCounter */
	+ 
	+     LONGLONG perfCounterDiff;	/* Difference between the current value
	+ 				 * and the value of 1 second ago */
	+ 
	+     FILETIME curSysTime;	/* Current system time */
	+ 
	+     LARGE_INTEGER curFileTime;	/* File time at the time this callback
	+ 				 * was scheduled. */

	+ 
	+     LONGLONG fileTimeDiff;	/* Elapsed time on the system clock
	+ 				 * since the last time this procedure
	+ 				 * was called */

	+ 
	+     LONGLONG instantFreq;	/* Instantaneous estimate of the
	+ 				 * performance counter frequency */

	+ 
	+     LONGLONG delta;		/* Increment to add to the estimated
	+ 				 * performance counter frequency in the
	+ 				 * loop filter */

	+ 
	+     LONGLONG fuzz;		/* Tolerance for the perf counter frequency */

	+ 
	+     LONGLONG lowBound;		/* Lower bound for the frequency assuming
	+ 				 * 1000 ppm tolerance */

	+ 
	+     LONGLONG hiBound;		/* Upper bound for the frequency */

	+ 
	+     /*
	+      * Get current performance counter and system time.
	+      */

	+ 
	+     QueryPerformanceCounter( &curPerfCounter );
	+     GetSystemTimeAsFileTime( &curSysTime );
	+     curFileTime.LowPart = curSysTime.dwLowDateTime;
	+     curFileTime.HighPart = curSysTime.dwHighDateTime;

	+ 
	+     EnterCriticalSection( &timeInfo.cs );

	+ 
	+     /*
	+      * Find out how many ticks of the performance counter and the
	+      * system clock have elapsed since we got into this procedure.
	+      * Estimate the current frequency.
	+      */

	+ 
	+     perfCounterDiff = curPerfCounter.QuadPart - timeInfo.lastPerfCounter;
	+     timeInfo.lastPerfCounter = curPerfCounter.QuadPart;
	+     fileTimeDiff = curFileTime.QuadPart - timeInfo.lastSysTime;
	+     timeInfo.lastSysTime = curFileTime.QuadPart;
	+     instantFreq = ( 10000000 * perfCounterDiff / fileTimeDiff );

	+ 
	+     /*
	+      * Consider this a timing glitch if instant frequency varies
	+      * significantly from the current estimate.
	+      */

	+ 
	+     fuzz = timeInfo.estPerfCounterFreq >> 10;
	+     lowBound = timeInfo.estPerfCounterFreq - fuzz;
	+     hiBound = timeInfo.estPerfCounterFreq + fuzz;
	+     if ( instantFreq < lowBound || instantFreq > hiBound ) {
	+ 	LeaveCriticalSection( &timeInfo.cs );
	+ 	return;
	+     }

	+ 
	+     /*
	+      * Update the current estimate of performance counter frequency.
	+      * This code is equivalent to the loop filter of a phase locked
	+      * loop.
	+      */

	+ 
	+     delta = ( instantFreq - timeInfo.estPerfCounterFreq ) >> 6;
	+     timeInfo.estPerfCounterFreq += delta;

	+ 
	+     /*
	+      * Update the current virtual time.
	+      */

	+ 
	+     timeInfo.lastFileTime.QuadPart
	+ 	+= ( ( curPerfCounter.QuadPart - timeInfo.lastCounter.QuadPart )
	+ 	     * 10000000 / timeInfo.curCounterFreq.QuadPart );
	+     timeInfo.lastCounter.QuadPart = curPerfCounter.QuadPart;

	+ 
	+     delta = curFileTime.QuadPart - timeInfo.lastFileTime.QuadPart;
	+     if ( delta > 10000000 || delta < -10000000 ) {

	+ 
	+ 	/*
	+ 	 * If the virtual time slip exceeds one second, then adjusting
	+ 	 * the counter frequency is hopeless (it'll take over fifteen
	+ 	 * minutes to line up with the system clock).  The most likely
	+ 	 * cause of this large a slip is a sudden change to the system
	+ 	 * clock, perhaps because it was being corrected by wristwatch
	+ 	 * and eyeball.  Accept the system time, and set the performance
	+ 	 * counter frequency to the current estimate.
	+ 	 */

	+ 
	+ 	timeInfo.lastFileTime.QuadPart = curFileTime.QuadPart;
	+ 	timeInfo.curCounterFreq.QuadPart = timeInfo.estPerfCounterFreq;

	+ 
	+     } else {

	+ 
	+ 	/*
	+ 	 * Compute a counter frequency that will cause virtual time to line
	+ 	 * up with system time one second from now, assuming that the
	+ 	 * performance counter continues to tick at timeInfo.estPerfCounterFreq.
	+ 	 */

	+ 	
	+ 	timeInfo.curCounterFreq.QuadPart
	+ 	    = 10000000 * timeInfo.estPerfCounterFreq / ( delta + 10000000 );

	+ 
	+ 	/*
	+ 	 * Limit frequency excursions to 1000 ppm from estimate
	+ 	 */

	+ 	
	+ 	if ( timeInfo.curCounterFreq.QuadPart < lowBound ) {
	+ 	    timeInfo.curCounterFreq.QuadPart = lowBound;
	+ 	} else if ( timeInfo.curCounterFreq.QuadPart > hiBound ) {
	+ 	    timeInfo.curCounterFreq.QuadPart = hiBound;
	+ 	}
	+     }

	+ 
	+     LeaveCriticalSection( &timeInfo.cs );

	+ 
	  }
	*** ../tcl8.3.2base/src/tcl8.3.2/test/winTime.test Mon Apr 10 13:19:08 2000
	--- ./tcl8.3.2/src/tcl8.3.2/test/winTime.test Wed Sep  6 14:55:30 2000
	***************
	*** 33,38 ****
	--- 33,64 ----
	      set result
	  } {1969}

	+ # Next test tries to make sure that the Tcl clock stays in step
	+ # with the Windows clock.  3000 iterations really isn't enough,
	+ # but how many does a tester have patience for?

	+ 
	+ test winTime-2.1 {Synchronization of Tcl and Windows clocks} {pcOnly} {
	+     set failed 0
	+     foreach { sys_sec sys_usec tcl_sec tcl_usec } [testwinclock] {}
	+     set olddiff [expr { abs ( $tcl_sec - $sys_sec
	+ 			   + 1.0e-6 * ( $tcl_usec - $sys_usec ) ) }]
	+     set ok 1
	+     for { set i 0 } { $i < 3000 } { incr i } {
	+ 	foreach { sys_sec sys_usec tcl_sec tcl_usec } [testwinclock] {}
	+ 	set diff [expr { abs ( $tcl_sec - $sys_sec
	+ 			       + 1.0e-6 * ( $tcl_usec - $sys_usec ) ) }]
	+ 	if { ( $diff > $olddiff + 1000 )
	+ 	     || ( $diff > 11000 ) } {
	+ 	    set failed 1
	+ 	    break
	+ 	} else {
	+ 	    set olddiff $diff
	+ 	    after 1
	+ 	}
	+     }
	+     set failed
	+ } {0}

	+ 
	  # cleanup
	  ::tcltest::cleanupTests
	  return

Name change from tip/70.tip to tip/70.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112

113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149

150
151

152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186

187
188
189
190
191
192

193
194
195

196
197
198
199
200
201

202
203
204

205
206
207
208
209

210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231

232
233
234
235
236
237

238
239
240
241
242
243
244
245
TIP:            70
Title:          A Relational Switch Control Structure
Version:        $Revision: 1.8 $
Author:         Bhushit Joshipura <[email protected]>
Author:         Donal K. Fellows <[email protected]>
State:          Withdrawn
Type:           Project
Vote:           Pending
Created:        20-Oct-2001
Post-History:   
Tcl-Version:    8.5

~ Abstract

This TIP proposes the introduction of a new control structure, ''rswitch'',
which is a relational parallel to switch-case control structure. It
consists of two lists: condition list and situation-reaction list. At
the maximum two conditions can be specified. Based on situation,
reaction is executed. The situation is selected on "first true and
only the first true" basis.

~Rationale

Theoretically only two controls - ''if'' and ''goto'' - are sufficient
to implement all algorithms. However, languages provide more control
structures for better representation of algorithms. To many structural
programmers like me, a ''switch'' statement gives much better picture of
the program than equivalent ''if''-''elseif''-...-''else'' chain. It
pronounces the course of decision of a big chunk of program in a single
statement, making understanding and maintaining software easier. It
also helps to optimize the software better if it is written in
''switch'' form. However, ''switch'' is strictly data based i.e.
''switch'' happens strictly on data value. 

The proposed ''rswitch'' command is a control structure similar to (and
more general than) ''switch''. (As a matter of fact, Tcl's ''foreach''
control structure is a special case of its for control structure.) Using
''rswitch'' it should be possible to take decisions based on relations
between entities.

In response to comments on Revision: 1.2 of the draft, Bhushit
Joshipura wrote [Re-edited]:

Why rswitch? [Re-written] 

 1. if..elseif..else contains elements of surprize spread in the code.
    From maintenance point of view, once the
    ''if''..''elseif''..''else'' code goes beyond horizon (one display
    length), burden of reference retention comes on human brain making
    maintenance bug-prone.

 2. In case of ''rswitch'', if a situation refers to one or more
    conditions, we had reduced this burden by stating upfront which
    variable is in the spotlight. The maintainer can then easily jump
    irrelavant cases. 

 3. In ''if''..''elseif''..''else'', a string of three conditions (with
    one of them being a `not' string) can mystify me for at least an
    hour. 

 4. In case of an ''rswitch'' (even in a situation which does not
    refer to any conditions):

 > 1. `and' is nested rswitch

 > 2. `or' is a fall through case

 > 3. `not' could be written as "default" case. `not' free from `and'
    and `or' is less confusing. We can write almost a `not-less' code
    using "default". 

 > Logical connectives result into visual presentation.

 5. Object Oriented Programming is trying to eliminate ''switch-case''
    statements by identifying localization of references and finding a
    heirarchy of data-actions. 

 6. In similar way one can think of ''rswitch'' cases (situations).
    Identifying localized references (w.r.t. situations) and a
    heirarchy of situations-reactions. However, this is a research
    issue.

Moreover, the writer does not need to be an artist to be able to write understandable code.

~Implementation in Other Languages

I queried about proposal of such a control structure in C to Dr. Dennis
M. Ritchie in February 2001. (At that time I thought only of
bi-conditional relational switch. See a few pages down for currently
proposed control structure.)

 >  Absence of relational switch I know this can be odd for other languages - but not for C.
    C is so near to machine and a relational switch could be ideal for many many
    machine-cycle saving situations.

 >  Apart from machine-orientedness, it could avoid many usages of not-so-structured ?:
    operator.

 >  It could simplify a lot of control and signal processing code too.

 >  Why did C become more data-biased for a control structure?

|    relational-switch(expr1,expr2){
|    case ==: statements;
|            break;
|    case > : statements;
|            break;
|    case < : statements;
|            break;
|    default: statements;
|            break;
|    }

TCL need not be so optimized, as C has to be. However, clarity and
maintainability remain formidable reasons for relational switch
implementation.

In a quick reply, Dr. Ritchie wrote back:

 >  The relational switch idea is (so far as I know) for C a new suggestion, although I have
    no idea of all things that were proposed for the 1989 or 1999 C standards. If seems to
    hark back to the old Fortran GOTO statement

|    IF (expression) 2, 3, 4

 >  which went to various places depending on whether the expression was -, 0 or +. It's
    also a bit strange syntactically (though it might work in the grammar) in that the case
    values aren't expressions, but just operators.

 >  Regards, Dennis"

Thus the structure is absent from C and its whole family. It is absent
from Pascal, PERL, BASIC, shell scripts - and of course, TCL.

Fortran's computed goto is near to bi-conditional rswitch. (That way
Chimpanzees are near to Homo sapiens too.) However, clarity of
presentation of default and fall through are not achievable through
computed goto. Mono-conditional rswitch, however, does not have a
parallel in languages of my knowledge (C, C++, Java, Pascal, BASIC,
Fortran, shell scripts). 

~Grammar and Behavior

Overall:

|rswitch {[condition(s)]} {
|    <situation-1> {
|        <reaction-block-1>
|    }

|    ...
|}

The condition list may have no, one or two variables.

A situation is legal if:

 1. It is a valid expression or

 2. If the condition list had at least one element 'x', {$x $situation} is a valid expression or

 > 1. If the condition list had exactly one element 'x', {$situation $x} is a valid expression or

 > 2. If the condition list had two elements 'x' and 'y', {$x $situation $y} is a valid expression or

 3. If the condition list had two elements 'x' and 'y' and {$situation $y} is a valid expression or

 4. If $situation == "default"

A reaction block is legal if:

 1. It is not the last block and $reactionBlock == "-" (fall through) or

 2. It is a valid TCL action block

Let us call a non-default extracted valid expression a SITUATION.

At execution, reaction block following or fell through by the first and only the first SITUATION that becomes true, is executed. In case no SITUATION becomes true and default situation is present, reaction block following or fell through by default statement is executed. Default situation is not necessary for operation of rswitch. An rswitch without any situation-reaction block is grammatically valid.

~Sample Invocations

|# Full length condition block. Second condition perhaps got redundant with maintenance.
|rswitch {$a $b} {
|   {> $c} -
|   {< $d} {
|      puts "$a is either > $c or < $d or both"
|   }

|   {< $c} {
|      # Full length condition block. Second condition is used.
|      rswitch {$a $d} {
|         > {
|            puts "$a is < $c AND > $d"
|         }

|         == {
|            puts "$a and $d are equal and they are < $c"
|         }

|         default {
|            puts "$a is < $c BUT <= $d"
|            puts "should never come here"
|         }
|      }
|   }

|   {3 > } {
|      puts "$a == $c, $a >= $d and $b <= 3"
|   }

|   default {
|      puts "$a == $c, $a >= $d and $b >= 3"
|   }
|}

~Contrast

Contrast above code with its if-elseif-else equivalent. Notice that:

 1. Both the examples have same effect.

 2. Both examples are indented with the same style.

|if {($a > $c) || ($a < $d)} {
|	# could you see a maintenance nightmare that could have arisen when
|	# reference to $b were eliminated?
|	puts "$a is either > $c or < $d or both"
|} elseif {$a < $c} {
|	if {$a > $d} {
|		puts "$a is < $c AND > $d"
|	} elseif {$a == $d} {
|		puts "$a and $d are equal and they are < $c"
|	} else {
|		puts "Pop-up question: What should we have here?
|		puts "$a is < $c BUT <= $d"
|		puts "should never come here"
|	}

|} elseif {3 > $b} {
|		puts "$a == $c, $a >= $d and $b <= 3"
|} else {
|		puts "$a == $c, $a >= $d and $b >= 3"
|}

~Responses to Revision 1.2

Revision 1.3 tries to reflect suggestions from various of the following contributors. Thanks.

John Ousterhaut wrote:

 > This is certainly a novel suggestion, but I'm not sure how useful it
   is. The proposed new command doesn't seem much clearer or much more
<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|
|

|
|
|
|

|
|

|

|
|

|
|

|

|

|
|

|

|

|
|

|

|

|

|
|
|
|
|
|
|
|
|
<
>

|

|

|
|

|
|

|
|

|

|
|
|
<
>
|
<
>

|

|

|

|

|

|

|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
<
>
|
|
|
<
<
<
>
>
>
|
|
<
>
|
|
<
<
|
>
>
|

|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
|
>
|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110

111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147

148
149

150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184

185
186
187
188
189
190

191
192
193

194
195
196
197

198
199
200
201
202

203
204
205

206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229

230
231
232
233
234

235
236
237
238
239
240
241
242
243
244

# TIP 70: A Relational Switch Control Structure

	Author:         Bhushit Joshipura <[email protected]>
	Author:         Donal K. Fellows <[email protected]>
	State:          Withdrawn
	Type:           Project
	Vote:           Pending
	Created:        20-Oct-2001
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

This TIP proposes the introduction of a new control structure, _rswitch_,
which is a relational parallel to switch-case control structure. It
consists of two lists: condition list and situation-reaction list. At
the maximum two conditions can be specified. Based on situation,
reaction is executed. The situation is selected on "first true and
only the first true" basis.

# Rationale

Theoretically only two controls - _if_ and _goto_ - are sufficient
to implement all algorithms. However, languages provide more control
structures for better representation of algorithms. To many structural
programmers like me, a _switch_ statement gives much better picture of
the program than equivalent _if_-_elseif_-...-_else_ chain. It
pronounces the course of decision of a big chunk of program in a single
statement, making understanding and maintaining software easier. It
also helps to optimize the software better if it is written in
_switch_ form. However, _switch_ is strictly data based i.e.
_switch_ happens strictly on data value. 

The proposed _rswitch_ command is a control structure similar to \(and
more general than\) _switch_. \(As a matter of fact, Tcl's _foreach_
control structure is a special case of its for control structure.\) Using
_rswitch_ it should be possible to take decisions based on relations
between entities.

In response to comments on Revision: 1.2 of the draft, Bhushit
Joshipura wrote [Re-edited]:

Why rswitch? [Re-written] 

 1. if..elseif..else contains elements of surprize spread in the code.
    From maintenance point of view, once the
    _if_.._elseif_.._else_ code goes beyond horizon \(one display
    length\), burden of reference retention comes on human brain making
    maintenance bug-prone.

 2. In case of _rswitch_, if a situation refers to one or more
    conditions, we had reduced this burden by stating upfront which
    variable is in the spotlight. The maintainer can then easily jump
    irrelavant cases. 

 3. In _if_.._elseif_.._else_, a string of three conditions \(with
    one of them being a \`not' string\) can mystify me for at least an
    hour. 

 4. In case of an _rswitch_ \(even in a situation which does not
    refer to any conditions\):

	 > 1. \`and' is nested rswitch

	 > 2. \`or' is a fall through case

	 > 3. \`not' could be written as "default" case. \`not' free from \`and'
    and \`or' is less confusing. We can write almost a \`not-less' code
    using "default". 

	 > Logical connectives result into visual presentation.

 5. Object Oriented Programming is trying to eliminate _switch-case_
    statements by identifying localization of references and finding a
    heirarchy of data-actions. 

 6. In similar way one can think of _rswitch_ cases \(situations\).
    Identifying localized references \(w.r.t. situations\) and a
    heirarchy of situations-reactions. However, this is a research
    issue.

Moreover, the writer does not need to be an artist to be able to write understandable code.

# Implementation in Other Languages

I queried about proposal of such a control structure in C to Dr. Dennis
M. Ritchie in February 2001. \(At that time I thought only of
bi-conditional relational switch. See a few pages down for currently
proposed control structure.\)

 >  Absence of relational switch I know this can be odd for other languages - but not for C.
    C is so near to machine and a relational switch could be ideal for many many
    machine-cycle saving situations.

 >  Apart from machine-orientedness, it could avoid many usages of not-so-structured ?:
    operator.

 >  It could simplify a lot of control and signal processing code too.

 >  Why did C become more data-biased for a control structure?

	    relational-switch(expr1,expr2){
	    case ==: statements;
	            break;
	    case > : statements;
	            break;
	    case < : statements;
	            break;
	    default: statements;
	            break;

	    }

TCL need not be so optimized, as C has to be. However, clarity and
maintainability remain formidable reasons for relational switch
implementation.

In a quick reply, Dr. Ritchie wrote back:

 >  The relational switch idea is \(so far as I know\) for C a new suggestion, although I have
    no idea of all things that were proposed for the 1989 or 1999 C standards. If seems to
    hark back to the old Fortran GOTO statement

	    IF (expression) 2, 3, 4

 >  which went to various places depending on whether the expression was -, 0 or \+. It's
    also a bit strange syntactically \(though it might work in the grammar\) in that the case
    values aren't expressions, but just operators.

 >  Regards, Dennis"

Thus the structure is absent from C and its whole family. It is absent
from Pascal, PERL, BASIC, shell scripts - and of course, TCL.

Fortran's computed goto is near to bi-conditional rswitch. \(That way
Chimpanzees are near to Homo sapiens too.\) However, clarity of
presentation of default and fall through are not achievable through
computed goto. Mono-conditional rswitch, however, does not have a
parallel in languages of my knowledge \(C, C\+\+, Java, Pascal, BASIC,
Fortran, shell scripts\). 

# Grammar and Behavior

Overall:

	rswitch {[condition(s)]} {
	    <situation-1> {
	        <reaction-block-1>

	    }
	    ...

	}

The condition list may have no, one or two variables.

A situation is legal if:

 1. It is a valid expression or

 2. If the condition list had at least one element 'x', \{$x $situation\} is a valid expression or

	 > 1. If the condition list had exactly one element 'x', \{$situation $x\} is a valid expression or

	 > 2. If the condition list had two elements 'x' and 'y', \{$x $situation $y\} is a valid expression or

 3. If the condition list had two elements 'x' and 'y' and \{$situation $y\} is a valid expression or

 4. If $situation == "default"

A reaction block is legal if:

 1. It is not the last block and $reactionBlock == "-" \(fall through\) or

 2. It is a valid TCL action block

Let us call a non-default extracted valid expression a SITUATION.

At execution, reaction block following or fell through by the first and only the first SITUATION that becomes true, is executed. In case no SITUATION becomes true and default situation is present, reaction block following or fell through by default statement is executed. Default situation is not necessary for operation of rswitch. An rswitch without any situation-reaction block is grammatically valid.

# Sample Invocations

	# Full length condition block. Second condition perhaps got redundant with maintenance.
	rswitch {$a $b} {
	   {> $c} -
	   {< $d} {
	      puts "$a is either > $c or < $d or both"

	   }
	   {< $c} {
	      # Full length condition block. Second condition is used.
	      rswitch {$a $d} {
	         > {
	            puts "$a is < $c AND > $d"

	         }
	         == {
	            puts "$a and $d are equal and they are < $c"

	         }
	         default {
	            puts "$a is < $c BUT <= $d"
	            puts "should never come here"

	         }
	      }
	   }
	   {3 > } {
	      puts "$a == $c, $a >= $d and $b <= 3"

	   }
	   default {
	      puts "$a == $c, $a >= $d and $b >= 3"

	   }
	}

# Contrast

Contrast above code with its if-elseif-else equivalent. Notice that:

 1. Both the examples have same effect.

 2. Both examples are indented with the same style.

		if {($a > $c) || ($a < $d)} {
			# could you see a maintenance nightmare that could have arisen when
			# reference to $b were eliminated?
			puts "$a is either > $c or < $d or both"
		} elseif {$a < $c} {
			if {$a > $d} {
				puts "$a is < $c AND > $d"
			} elseif {$a == $d} {
				puts "$a and $d are equal and they are < $c"
			} else {
				puts "Pop-up question: What should we have here?
				puts "$a is < $c BUT <= $d"
				puts "should never come here"

			}
		} elseif {3 > $b} {
				puts "$a == $c, $a >= $d and $b <= 3"
		} else {
				puts "$a == $c, $a >= $d and $b >= 3"

		}

# Responses to Revision 1.2

Revision 1.3 tries to reflect suggestions from various of the following contributors. Thanks.

John Ousterhaut wrote:

 > This is certainly a novel suggestion, but I'm not sure how useful it
   is. The proposed new command doesn't seem much clearer or much more

︙ ︙ 
273
274
275
276
277
278
279
280
281
282
283
284
285

286
287
288
289
290
291
292
293
294
295
296
297
298
299

300
301
302
303
304
305
306
307
308
309
310
311
312
313

314
315
316
317
318
319
320
321
322
323

324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347

348
349
350
351
352
353
354
355
356
357
358
359
360

   can accept multiple conditional relationships in a single statement.

 > I can also think of another usage of 'rswitch' which doesn't take
   any arguments at all. A vanilla version which is just replacement to
   if-elseif-elseif-..-else structure but only that code is more easier
   to read...

|rswitch { 
|	($a > 4): /* block 1 */ 
|	($b < 100): /* block 2 */
|	($c > 5 ): /* block 3 */ 
|	($d == 10): /*block 4 */ 
|}

Don Porter wrote:

 > Rather than defining two forms of the command, mono-conditional and
   bi-conditional, why not use the power of Tcl to allow for both and
   even more possibilities within a singleform? 

 > Consider:

|rswitch $formatString { 
|	$sub1 $body1
|	...
|	$subN $bodyN 
|} 

 > Then have [rswitch] construct the Tcl expressions to be tested 
   using [format]: 

|format $formatString $sub1 

 > So you could have: 

|rswitch {$a %s $b} { 
|	> {puts "$a is greater than $b"} 
|	< {puts "$a is less than $b"} 
|	== {puts "$a equals $b"} 
|} 

 > or

|rswitch {$a %s} {
|	1 {puts "$a > 1"}
|	5 {puts "$a > 5"}
|	15 {puts "$a > 15"}
|	{>$b} {puts "$a > $b"}
|	{<$b} - > {==$b} {puts "$a <= $b"} 
|} 

 > Extending this idea further, consider the possibility of using
   [[format]] to create the expression like so:

|eval [linsert $sub1 0 format $formatString] 

 > Then the substitutions could be lists of multiple values to
   substitute into multiple %-conversion specifiers in the format
   string, allowing for the construction of quite elaborate
   expressions.

Brent Welch wrote

 > I like Don's suggestion. I'm reminded of the switch statement in
   the tclhttpd state machine, crafted by Steve Uhler:

|set state [string compare $readCount 0],$data(state) 
|switch -glob -- $state {
|	1,start { # Read something in the start state } 
|	0,start { # Read empty line in the start state 
|	1,mime  { # Read something in the mime state }
|	0,mime  { # Read blank line in the mime state }
|	-1,*    { # Read error in any state }
|	default { # Unexpected condition }
|}

 > I had had a bunch of nested if-then-else's, of course. With an
   artful creation of the switch value and the power of things like
   glob, you can really create compact, expressive switch statements
   already.

~Sample Implementation

Will be provided later.

~Copyright

This document is placed in public domain.

|
|
|
|
|
<
>

|
|
|
|
<
>

|

|
|
|
|
<
|
>

|
|
|
|
|
|
<
|
>

|

|

|
|
|
|
|
|
|
|
<
>

|

|

>
272
273
274
275
276
277
278
279
280
281
282
283

284
285
286
287
288
289
290
291
292
293
294
295
296
297

298
299
300
301
302
303
304
305
306
307
308
309
310

311
312
313
314
315
316
317
318
319
320

321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345

346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
   can accept multiple conditional relationships in a single statement.

 > I can also think of another usage of 'rswitch' which doesn't take
   any arguments at all. A vanilla version which is just replacement to
   if-elseif-elseif-..-else structure but only that code is more easier
   to read...

	rswitch { 
		($a > 4): /* block 1 */ 
		($b < 100): /* block 2 */
		($c > 5 ): /* block 3 */ 
		($d == 10): /*block 4 */ 

	}

Don Porter wrote:

 > Rather than defining two forms of the command, mono-conditional and
   bi-conditional, why not use the power of Tcl to allow for both and
   even more possibilities within a singleform? 

 > Consider:

	rswitch $formatString { 
		$sub1 $body1
		...
		$subN $bodyN 

	} 

 > Then have [rswitch] construct the Tcl expressions to be tested 
   using [format]: 

	format $formatString $sub1 

 > So you could have: 

	rswitch {$a %s $b} { 
		> {puts "$a is greater than $b"} 
		< {puts "$a is less than $b"} 
		== {puts "$a equals $b"} 

	} 

 > or

	rswitch {$a %s} {
		1 {puts "$a > 1"}
		5 {puts "$a > 5"}
		15 {puts "$a > 15"}
		{>$b} {puts "$a > $b"}
		{<$b} - > {==$b} {puts "$a <= $b"} 

	} 

 > Extending this idea further, consider the possibility of using
   [format] to create the expression like so:

	eval [linsert $sub1 0 format $formatString] 

 > Then the substitutions could be lists of multiple values to
   substitute into multiple %-conversion specifiers in the format
   string, allowing for the construction of quite elaborate
   expressions.

Brent Welch wrote

 > I like Don's suggestion. I'm reminded of the switch statement in
   the tclhttpd state machine, crafted by Steve Uhler:

	set state [string compare $readCount 0],$data(state) 
	switch -glob -- $state {
		1,start { # Read something in the start state } 
		0,start { # Read empty line in the start state 
		1,mime  { # Read something in the mime state }
		0,mime  { # Read blank line in the mime state }
		-1,*    { # Read error in any state }
		default { # Unexpected condition }

	}

 > I had had a bunch of nested if-then-else's, of course. With an
   artful creation of the switch value and the power of things like
   glob, you can really create compact, expressive switch statements
   already.

# Sample Implementation

Will be provided later.

# Copyright

This document is placed in public domain.

Name change from tip/71.tip to tip/71.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92

TIP:            71
Title:          Tk Bitmap Image Improvements
Version:        $Revision: 1.14 $
Author:         Chris Nelson <[email protected]>
Author:         Kevin Kenny <[email protected]>
Author:         Eric Melski <[email protected]>
Author:         Donal K. Fellows <[email protected]>
State:          Withdrawn
Type:           Project
Vote:           Pending
Created:        26-Oct-2001
Post-History:   
Tcl-Version:    8.5

~ Abstract

Tk has a number of pre-defined bitmaps (10 on all platforms) but it
lacks a number of bitmaps useful for creating GUI elements.  This TIP
adds several such bitmaps (as bitmap images).

~ New Bitmaps

Many complex widgets like comboboxes, spinboxes, etc. require arrows
pictures on buttons.  While newer releases of Tk have added more
widgets, there will always be some unforeseen need for new or
customized widgets.  One example is a menubutton which, according to
the Microsoft Windows User Experience
[http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnwue/html/welcome.asp],
should have a downward arrow on the right side.  With compound
buttons, it is not hard to do:

|   button .mb -text Tools -image downarrow -compound right

but there is no stock down-arrow image.  

I propose to add 12 bitmap images providing all four directions (up,
down, left, and right) in three sizes (3x2, 5x3, and 7x4) in black.
The down arrows would look something like:

|   @@@@@@@   @@@@@      @@@
|   .@@@@@.   .@@@.      .@.
|   ..@@@..   ..@..    
|   ...@...

I propose the following names:

|   arrow_u7x4      arrow_u5x3      arrow_u3x2
|   arrow_d7x4      arrow_d5x3      arrow_d3x2
|   arrow_l7x4      arrow_l5x3      arrow_l3x2
|   arrow_r7x4      arrow_r5x3      arrow_r3x2

I'm mindful of the fact that adding new predefined bitmap images has
the potential to collide with application-defined images or other
commands but I'm unsure of the workaround for that.

~ Reference Implementation

SourceForge patch 475332 provided a reference implementation of a
previous version of this proposal
[http://sf.net/tracker/?func=detail&aid=475332&group_id=12997&atid=312997].
This version is not implemented yet.

~ Commentary

''Donal K. Fellows <[email protected]> writes:''

 > Previous versions of this TIP proposed fixing the problem using
   bitmaps instead of bitmap images and added an infrastructure for
   tracking those bitmaps.  Since I think that ultimately we should be
   getting rid of bitmaps and instead using something based on the
   image infrastructure (which already has proper introspection
   support) those parts of this TIP have been removed.  However,
   making the changes to effect the switch to using bitmap images
   instead of bitmaps for things like stippes, cursors, etc. lies
   outside the scope of this TIP.

''Donal K. Fellows <[email protected]> writes:''

 > In the long period since this TIP was proposed, the world of GUIs
   has moved on somewhat.  Although the requirement for arrows remains
   the same, the solutions proposed in this TIP (both originally and
   as it now stands) do not permit the sort of graphical snazziness
   that modern users tend to expect.  Nor is there a sufficient range
   of sizes for a reasonable selection to be available for a modern
   display; even the largest of those arrows would look unusably tiny
   on my desktop!  This indicates that a completely different solution
   is required, which in turn would be better stated as a separate
   TIP.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|
|

|
|
|
|

|
|
|
|

|

|

|

|

|
|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92

# TIP 71: Tk Bitmap Image Improvements

	Author:         Chris Nelson <[email protected]>
	Author:         Kevin Kenny <[email protected]>
	Author:         Eric Melski <[email protected]>
	Author:         Donal K. Fellows <[email protected]>
	State:          Withdrawn
	Type:           Project
	Vote:           Pending
	Created:        26-Oct-2001
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

Tk has a number of pre-defined bitmaps \(10 on all platforms\) but it
lacks a number of bitmaps useful for creating GUI elements.  This TIP
adds several such bitmaps \(as bitmap images\).

# New Bitmaps

Many complex widgets like comboboxes, spinboxes, etc. require arrows
pictures on buttons.  While newer releases of Tk have added more
widgets, there will always be some unforeseen need for new or
customized widgets.  One example is a menubutton which, according to
the Microsoft Windows User Experience
<http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnwue/html/welcome.asp> ,
should have a downward arrow on the right side.  With compound
buttons, it is not hard to do:

	   button .mb -text Tools -image downarrow -compound right

but there is no stock down-arrow image.  

I propose to add 12 bitmap images providing all four directions \(up,
down, left, and right\) in three sizes \(3x2, 5x3, and 7x4\) in black.
The down arrows would look something like:

	   @@@@@@@   @@@@@      @@@
	   .@@@@@.   .@@@.      .@.
	   ..@@@..   ..@..    
	   ...@...

I propose the following names:

	   arrow_u7x4      arrow_u5x3      arrow_u3x2
	   arrow_d7x4      arrow_d5x3      arrow_d3x2
	   arrow_l7x4      arrow_l5x3      arrow_l3x2
	   arrow_r7x4      arrow_r5x3      arrow_r3x2

I'm mindful of the fact that adding new predefined bitmap images has
the potential to collide with application-defined images or other
commands but I'm unsure of the workaround for that.

# Reference Implementation

SourceForge patch 475332 provided a reference implementation of a
previous version of this proposal
<http://sf.net/tracker/?func=detail&aid=475332&group_id=12997&atid=312997> .
This version is not implemented yet.

# Commentary

_Donal K. Fellows <[email protected]> writes:_

 > Previous versions of this TIP proposed fixing the problem using
   bitmaps instead of bitmap images and added an infrastructure for
   tracking those bitmaps.  Since I think that ultimately we should be
   getting rid of bitmaps and instead using something based on the
   image infrastructure \(which already has proper introspection
   support\) those parts of this TIP have been removed.  However,
   making the changes to effect the switch to using bitmap images
   instead of bitmaps for things like stippes, cursors, etc. lies
   outside the scope of this TIP.

_Donal K. Fellows <[email protected]> writes:_

 > In the long period since this TIP was proposed, the world of GUIs
   has moved on somewhat.  Although the requirement for arrows remains
   the same, the solutions proposed in this TIP \(both originally and
   as it now stands\) do not permit the sort of graphical snazziness
   that modern users tend to expect.  Nor is there a sufficient range
   of sizes for a reasonable selection to be available for a modern
   display; even the largest of those arrows would look unusably tiny
   on my desktop!  This indicates that a completely different solution
   is required, which in turn would be better stated as a separate
   TIP.

# Copyright

This document has been placed in the public domain.

Name change from tip/72.tip to tip/72.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223

TIP:            72
Title:          64-Bit Value Support for Tcl on 32-Bit Platforms
Version:        $Revision: 1.10 $
Author:         Donal K. Fellows <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        05-Nov-2001
Post-History:   
Tcl-Version:    8.4

~ Abstract

This TIP adds the capability to perform computations on values that
are (at least) 64-bits wide even on 32-bit platforms.  It also adds
support for handling files that are larger than 2GB large on those
platforms (where supported by the underlying platform and filing
system).

~ Rationale

There have been a number of requests, and from a whole range of
application areas, for Tcl to be enhanced to handle 64-bit values even
on platforms where that is larger than the native word size, and the
vast majority of C compilers support a large enough arithmetic type
(often called ''long long'' though other names are common on the
Windows platform.)  Such areas include:

 * ''large-file support'' for people working with lots of data.  Note
   that at the moment Tcl cannot even report the file type for a file
   that is larger than 2GB in size.

 * ''large value support'' for people working with network addresses
   (this is likely to come up more in the future with IPv6.)

However, a number of existing algorithms assume that integer
arithmetic operations wrap at 32-bits (demonstrating the need for
''semantic backward-compatibility'' so termed because a recompile of
the C portions of the relevant code will not fix the problem) and
there are many existing extensions that assume a particular word-size
too (requiring ''syntactic backward-compatibility'' because
recompilation will probably cure the problem.)  Hence any upgrade of
Tcl's functionality must be done carefully so as to preserve as much
backward compatibility as possible.

~ Proposed for Changes

To resolve this problem, I will introduce:

 1. A new pair of types at the C level to represent signed and
    unsigned values with a width of at least 64-bits.  These types
    will be called ''Tcl_WideInt'' and ''Tcl_WideUInt'' respectively.
    On 64-bit platforms (and 32-bit platforms where there is no
    compiler support for arithmetic 64-bit types) these will be
    typedef'ed to ''long'' to preserve as much inter-platform
    compatibility as possible.

 >  The type names are based on the term ''Wide'' as opposed to either
    ''Long'' or ''LongLong'' because the first causes a problem with
    existing Tcl APIs (''Tcl_GetLongFromObj'' for example) and the
    second because it is both longer and less mnemonic.  Not all Tcl
    platforms are built with compilers that understand ''long long''
    in the first place, and the major factor in its favour at the C
    level was almost certainly the fact that it did not introduce any
    new reserved words into the C syntax which would have had a major
    backward-compatibility impact - we are not bound by such things
    and can choose to suit ourselves.

 2. A new field of type ''Tcl_WideInt'' in the internalRep union of
    the ''Tcl_Obj'' type.  Note that this is 100% backward compatible
    since the union already contains a field that is a pair of
    pointers (each of which I assume to be at least 32-bits wide.)  

 3. A new object type of 64-bit wide values together with accessor
    functions to create, modify and retrieve from objects of that type
    called ''Tcl_NewWideIntObj'', ''Tcl_SetWideIntObj'' and
    ''Tcl_GetWideIntFromObj'' (on platforms where ''Tcl_WideInt'' is
    not distinct from ''long'', these will be all redirected to the
    previously existing integer type.)

 4. The [[expr]] command shall be reworked so that:

 >  * If a constant looks like a signed integer (i.e. it lies between
      INT_MIN and INT_MAX inclusive) it is treated as such.  Otherwise
      if it looks like an integer of any size, an attempt will be made
      to treat it like a wide integer, and if that fails or it doesn't
      look like an integer at all, it will be treated as a double.
      ''Note'' that this will be a source of a potential backwards
      incompatibility with scripts that include values that are meant
      to be unsigned integers.

 >  * With arithmetic operations, the output will be a double if at
      least one of the operands is a double, a wide integer if at
      least one of the operands is a wide integer, and a normal
      integer otherwise.  (The main exception to this will be the left
      and right shift operations where the type of the second operand
      will not affect the type of the result.)

 >  * The ''int()'' pseudo-function will always return a non-wide
      integer (converting by dropping the high bits) and the new
      pseudo-function ''wide()'' will always return a wide integer
      (converting by sign-extending.)  On platforms without a distinct
      64-bit type, these operations will behave identically.

 >  * User-defined functions will be able to gain access to the wide
      integer through an extra ''wideValue'' field in the
      ''Tcl_Value'' structure and TCL_WIDE_INT (which will be the same
      as TCL_INT on platforms without a distinct 64-bit type) value in
      the ''Tcl_ValueType'' enumeration.

 5. The [[incr]] command will be able to increment variables
    containing 64-bit values correctly, but will only accept 32-bit
    values as amounts to increment by.

 6. ''Tcl_Seek'' and ''Tcl_Tell'' (together will all channel drivers)
    will be updated to use the new 64-bit type for offsets (which will
    reflect at the Tcl level in the [[seek]] and [[tell]] commands)
    though a compatibility interface for old extensions that do not
    supply a channel driver will be maintained (though the size of
    offset reportable through the interface will naturally be limited.)

 7. ''Tcl_FSStat'' and ''Tcl_FSLstat'' will all be
    updated to use a stat structure reference that can contain 64-bit
    wide values.  This will enable various [[file]] subcommands (and
    [[glob]] with some options) to work correctly with files over 2GB
    in size.  Note that there is no neat way to do this in a backward
    compatible way as there is currently no guarantee on which fields
    will actually be present in the structure, but those functions
    have never been available outside an alpha...

 >  Because the name of a suitable structure varies considerably
    between platforms, a new type, ''Tcl_StatBuf'', will be declared
    to be the type of the structure which a pointer to should be
    passed to the stat-related functions. A new function,
    ''Tcl_AllocStatBuf'', will be provided to allow extensions to
    allocate a buffer of the correct size whatever the platform.

 >  Note that ''Tcl_Stat'' will written to contain
    backward-compatability code so that code that references it will
    work unchanged.

 8. The ''format'' and ''scan'' commands will gain a significance to
    the ''l'' modifier to their integer-handling conversion specifiers
    (d, u, i, o and x) which will tell them to work with 64-bit values
    (if those are not the default for the platform anyway.)

 9. The ''binary'' command will gain new ''w'' and ''W'' specifiers
    for its ''format'' and ''scan'' subcommands.  These will operate
    on 64-bit wide values in a fashion analogous to the existing ''i''
    and ''I'' specifiers (i.e. smallest byte to largest, and largest
    byte to smallest respectively.)

 10. New compatibility functions will also be provided, because not
     all platforms have convenient equivalent functions to ''strtoll''
     and ''strtoull''.

 11. ''Tcl_LinkVar'' will be extended to be given the ability to link
     with a wide C variable (via a TCL_LINK_WIDE_INT flag).

 12. The ''tcl_platform'' array will gain a new member, ''wordSize'',
     which will give the native size of machine words on the host
     platform (actually whatever ''sizeof(long)'' returns.)

~ Summary of Incompatibilities and Fixes

The behaviour of expressions containing constants that appear positive
but which have a negative internal representation will change, as
these will now usually be interpreted as wide integers.  This is
always fixable by replacing the constant with ''int(''constant'')''.

Extensions creating new channel types will need
to be altered as different types are now in use in those areas.  The
change to the declaration of ''Tcl_FSStat'' and ''Tcl_FSLstat'' (which
are the new preferred API in any case) are less serious as no
non-alpha releases have been made yet with those API functions.

Scripts that are lax about the use of the ''l'' modifier in ''format''
and ''scan'' will probably need to be rewritten.  This should be very
uncommon though as previously it had absolutely no effect.

Extensions that create new math functions that take more than one
argument will need to be recompiled (the size of ''Tcl_Value''
changes), and functions that accept arguments of any type
(''TCL_EITHER'') will need to be rewritten to handle wide integer
values.  (I do not expect this to affect many extensions at all.)

~ Why Tcl_WideInt?

I chose the name ''Tcl_WideInt'' for the type because it represents a
wider-than-normal integer.  Alternatives that were considered and
rejected were:

 Tcl_LongLong: This takes its name from the name of the underlying C
    type used in many UNIX compilers, but that in turn was chosen
    because it meant that no new keywords would be added to the
    language, and not out of any feeling that the type name itself is
    of any wider merit. Seeing as Tcl is a keyword-less language, there
    is no particular reason for going down this route (which would
    lead to things like a ''longlong()'' type conversion function added
    to the [[expr]] command, which is really very ugly indeed...) It
    is also not universally the name of the underlying type; the Windows
    world is different (as usual.)

 Tcl_Int64: This name, by contrast, comes more from the Windows world.
    It's major problem is that it specifies eternally what the size of
    the type is, whereas at some point in the future (when 64-bit words
    are the norm) we may want to support something wider still (though
    I do not yet know what uses we would put 128-bit integers to.)  I
    believe that the name of a type is part of its specification, but
    that the size of the type is less so.  ''Tcl_Int64'' is also ugly
    when it comes to derivations of the name for things like the type
    converter in [[expr]] (again) and the names of variables containing
    values of the type (internally, as formal parameters, and as fields
    of structures) and may well clash on systems where the C compiler
    gives real meaning to ''int64'' by default.  By contrast,
    ''Tcl_WideInt'' lends itself well to generating variable names
    (''wideValue'', ''widePtr'', etc., and even just plain ''w'' in the
    implementation of the bytecode execution engine) which, as the
    person implementing the changes, was a major consideration.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|
|

|

|
|

|

|
|

|
|
|

|
|

|

|
|
|
|

|
|
|

|

|
|

|

|
|
|
|

|

|
|

|

|

|

|

|
|
|
|

|
|
|
|
|

|

|
|
|

|
|

|

|
|

|
|

|

|

|
|
|
|

|
|
|
|
|

|
|

|
|

|

|

|

|

|
|

|
|

|
|
|
|

|

|

|

|
|
|

|

|

|
|
|

|

|
|
|
|
|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223

# TIP 72: 64-Bit Value Support for Tcl on 32-Bit Platforms

	Author:         Donal K. Fellows <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        05-Nov-2001
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

This TIP adds the capability to perform computations on values that
are \(at least\) 64-bits wide even on 32-bit platforms.  It also adds
support for handling files that are larger than 2GB large on those
platforms \(where supported by the underlying platform and filing
system\).

# Rationale

There have been a number of requests, and from a whole range of
application areas, for Tcl to be enhanced to handle 64-bit values even
on platforms where that is larger than the native word size, and the
vast majority of C compilers support a large enough arithmetic type
\(often called _long long_ though other names are common on the
Windows platform.\)  Such areas include:

 * _large-file support_ for people working with lots of data.  Note
   that at the moment Tcl cannot even report the file type for a file
   that is larger than 2GB in size.

 * _large value support_ for people working with network addresses
   \(this is likely to come up more in the future with IPv6.\)

However, a number of existing algorithms assume that integer
arithmetic operations wrap at 32-bits \(demonstrating the need for
_semantic backward-compatibility_ so termed because a recompile of
the C portions of the relevant code will not fix the problem\) and
there are many existing extensions that assume a particular word-size
too \(requiring _syntactic backward-compatibility_ because
recompilation will probably cure the problem.\)  Hence any upgrade of
Tcl's functionality must be done carefully so as to preserve as much
backward compatibility as possible.

# Proposed for Changes

To resolve this problem, I will introduce:

 1. A new pair of types at the C level to represent signed and
    unsigned values with a width of at least 64-bits.  These types
    will be called _Tcl\_WideInt_ and _Tcl\_WideUInt_ respectively.
    On 64-bit platforms \(and 32-bit platforms where there is no
    compiler support for arithmetic 64-bit types\) these will be
    typedef'ed to _long_ to preserve as much inter-platform
    compatibility as possible.

	 >  The type names are based on the term _Wide_ as opposed to either
    _Long_ or _LongLong_ because the first causes a problem with
    existing Tcl APIs \(_Tcl\_GetLongFromObj_ for example\) and the
    second because it is both longer and less mnemonic.  Not all Tcl
    platforms are built with compilers that understand _long long_
    in the first place, and the major factor in its favour at the C
    level was almost certainly the fact that it did not introduce any
    new reserved words into the C syntax which would have had a major
    backward-compatibility impact - we are not bound by such things
    and can choose to suit ourselves.

 2. A new field of type _Tcl\_WideInt_ in the internalRep union of
    the _Tcl\_Obj_ type.  Note that this is 100% backward compatible
    since the union already contains a field that is a pair of
    pointers \(each of which I assume to be at least 32-bits wide.\)  

 3. A new object type of 64-bit wide values together with accessor
    functions to create, modify and retrieve from objects of that type
    called _Tcl\_NewWideIntObj_, _Tcl\_SetWideIntObj_ and
    _Tcl\_GetWideIntFromObj_ \(on platforms where _Tcl\_WideInt_ is
    not distinct from _long_, these will be all redirected to the
    previously existing integer type.\)

 4. The [expr] command shall be reworked so that:

	 >  \* If a constant looks like a signed integer \(i.e. it lies between
      INT\_MIN and INT\_MAX inclusive\) it is treated as such.  Otherwise
      if it looks like an integer of any size, an attempt will be made
      to treat it like a wide integer, and if that fails or it doesn't
      look like an integer at all, it will be treated as a double.
      _Note_ that this will be a source of a potential backwards
      incompatibility with scripts that include values that are meant
      to be unsigned integers.

	 >  \* With arithmetic operations, the output will be a double if at
      least one of the operands is a double, a wide integer if at
      least one of the operands is a wide integer, and a normal
      integer otherwise.  \(The main exception to this will be the left
      and right shift operations where the type of the second operand
      will not affect the type of the result.\)

	 >  \* The _int\(\)_ pseudo-function will always return a non-wide
      integer \(converting by dropping the high bits\) and the new
      pseudo-function _wide\(\)_ will always return a wide integer
      \(converting by sign-extending.\)  On platforms without a distinct
      64-bit type, these operations will behave identically.

	 >  \* User-defined functions will be able to gain access to the wide
      integer through an extra _wideValue_ field in the
      _Tcl\_Value_ structure and TCL\_WIDE\_INT \(which will be the same
      as TCL\_INT on platforms without a distinct 64-bit type\) value in
      the _Tcl\_ValueType_ enumeration.

 5. The [incr] command will be able to increment variables
    containing 64-bit values correctly, but will only accept 32-bit
    values as amounts to increment by.

 6. _Tcl\_Seek_ and _Tcl\_Tell_ \(together will all channel drivers\)
    will be updated to use the new 64-bit type for offsets \(which will
    reflect at the Tcl level in the [seek] and [tell] commands\)
    though a compatibility interface for old extensions that do not
    supply a channel driver will be maintained \(though the size of
    offset reportable through the interface will naturally be limited.\)

 7. _Tcl\_FSStat_ and _Tcl\_FSLstat_ will all be
    updated to use a stat structure reference that can contain 64-bit
    wide values.  This will enable various [file] subcommands \(and
    [glob] with some options\) to work correctly with files over 2GB
    in size.  Note that there is no neat way to do this in a backward
    compatible way as there is currently no guarantee on which fields
    will actually be present in the structure, but those functions
    have never been available outside an alpha...

	 >  Because the name of a suitable structure varies considerably
    between platforms, a new type, _Tcl\_StatBuf_, will be declared
    to be the type of the structure which a pointer to should be
    passed to the stat-related functions. A new function,
    _Tcl\_AllocStatBuf_, will be provided to allow extensions to
    allocate a buffer of the correct size whatever the platform.

	 >  Note that _Tcl\_Stat_ will written to contain
    backward-compatability code so that code that references it will
    work unchanged.

 8. The _format_ and _scan_ commands will gain a significance to
    the _l_ modifier to their integer-handling conversion specifiers
    \(d, u, i, o and x\) which will tell them to work with 64-bit values
    \(if those are not the default for the platform anyway.\)

 9. The _binary_ command will gain new _w_ and _W_ specifiers
    for its _format_ and _scan_ subcommands.  These will operate
    on 64-bit wide values in a fashion analogous to the existing _i_
    and _I_ specifiers \(i.e. smallest byte to largest, and largest
    byte to smallest respectively.\)

 10. New compatibility functions will also be provided, because not
     all platforms have convenient equivalent functions to _strtoll_
     and _strtoull_.

 11. _Tcl\_LinkVar_ will be extended to be given the ability to link
     with a wide C variable \(via a TCL\_LINK\_WIDE\_INT flag\).

 12. The _tcl\_platform_ array will gain a new member, _wordSize_,
     which will give the native size of machine words on the host
     platform \(actually whatever _sizeof\(long\)_ returns.\)

# Summary of Incompatibilities and Fixes

The behaviour of expressions containing constants that appear positive
but which have a negative internal representation will change, as
these will now usually be interpreted as wide integers.  This is
always fixable by replacing the constant with _int\(_constant_\)_.

Extensions creating new channel types will need
to be altered as different types are now in use in those areas.  The
change to the declaration of _Tcl\_FSStat_ and _Tcl\_FSLstat_ \(which
are the new preferred API in any case\) are less serious as no
non-alpha releases have been made yet with those API functions.

Scripts that are lax about the use of the _l_ modifier in _format_
and _scan_ will probably need to be rewritten.  This should be very
uncommon though as previously it had absolutely no effect.

Extensions that create new math functions that take more than one
argument will need to be recompiled \(the size of _Tcl\_Value_
changes\), and functions that accept arguments of any type
\(_TCL\_EITHER_\) will need to be rewritten to handle wide integer
values.  \(I do not expect this to affect many extensions at all.\)

# Why Tcl\_WideInt?

I chose the name _Tcl\_WideInt_ for the type because it represents a
wider-than-normal integer.  Alternatives that were considered and
rejected were:

 Tcl\_LongLong: This takes its name from the name of the underlying C
    type used in many UNIX compilers, but that in turn was chosen
    because it meant that no new keywords would be added to the
    language, and not out of any feeling that the type name itself is
    of any wider merit. Seeing as Tcl is a keyword-less language, there
    is no particular reason for going down this route \(which would
    lead to things like a _longlong\(\)_ type conversion function added
    to the [expr] command, which is really very ugly indeed...\) It
    is also not universally the name of the underlying type; the Windows
    world is different \(as usual.\)

 Tcl\_Int64: This name, by contrast, comes more from the Windows world.
    It's major problem is that it specifies eternally what the size of
    the type is, whereas at some point in the future \(when 64-bit words
    are the norm\) we may want to support something wider still \(though
    I do not yet know what uses we would put 128-bit integers to.\)  I
    believe that the name of a type is part of its specification, but
    that the size of the type is less so.  _Tcl\_Int64_ is also ugly
    when it comes to derivations of the name for things like the type
    converter in [expr] \(again\) and the names of variables containing
    values of the type \(internally, as formal parameters, and as fields
    of structures\) and may well clash on systems where the C compiler
    gives real meaning to _int64_ by default.  By contrast,
    _Tcl\_WideInt_ lends itself well to generating variable names
    \(_wideValue_, _widePtr_, etc., and even just plain _w_ in the
    implementation of the bytecode execution engine\) which, as the
    person implementing the changes, was a major consideration.

# Copyright

This document has been placed in the public domain.

Name change from tip/73.tip to tip/73.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45

TIP:		73
Title:          Export Tcl_GetTime in the Public API
State:          Final
Type:           Project
Tcl-Version:    8.4
Vote:           Done
Post-History:   
Version:	$Revision: 1.4 $
Author:		Kevin Kenny <[email protected]>
Created:	03-Nov-2001

~ Abstract

This TIP proposes that the existing ''TclpGetTime'' function be
renamed to be ''Tcl_GetTime'' and included in the published API.

~ Rationale

The Tcl library provides a uniform abstraction, ''TclpGetTime'' that
is implemented on each of the platforms to retrieve absolute time in a
''Tcl_Time'' object.  This function is highly useful outside the Tcl
library itself, since it hides a very complex set of interfaces,
particularly on Windows, where several hundred lines of code enable
its use for high-precision measurements.  For this reason, it ought to
be made part of the public API.

~ Proposed Change

The existing ''TclpGetTime'' procedure shall be renamed to be
''Tcl_GetTime'', and its declaration shall be added to
''tcl.decls''.

| void TclpGetTime( Tcl_Time* timePtr );

A definition of ''TclpGetTime'' as a stub procedure that simply
invokes ''Tcl_GetTime'' shall be retained in ''tclInt.decls'' for
compatibility with existing Stubs-enabled extensions that invoke it.

This change requires no other change to the public headers; the
''Tcl_Time'' structure is already exported in ''tcl.h''.

~ Copyright

Copyright � 2001 by Kevin B. Kenny.  Distribution in whole or part,
with or without annotations, is unlimited.

<
|
|
|
|
|
|
<
|
|
>

|

|
|

|

|

|

|

|
|
|

|

|
|

|

|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45

# TIP 73: Export Tcl_GetTime in the Public API
	State:          Final
	Type:           Project
	Tcl-Version:    8.4
	Vote:           Done
	Post-History:   

	Author:		Kevin Kenny <[email protected]>
	Created:	03-Nov-2001
-----

# Abstract

This TIP proposes that the existing _TclpGetTime_ function be
renamed to be _Tcl\_GetTime_ and included in the published API.

# Rationale

The Tcl library provides a uniform abstraction, _TclpGetTime_ that
is implemented on each of the platforms to retrieve absolute time in a
_Tcl\_Time_ object.  This function is highly useful outside the Tcl
library itself, since it hides a very complex set of interfaces,
particularly on Windows, where several hundred lines of code enable
its use for high-precision measurements.  For this reason, it ought to
be made part of the public API.

# Proposed Change

The existing _TclpGetTime_ procedure shall be renamed to be
_Tcl\_GetTime_, and its declaration shall be added to
_tcl.decls_.

	 void TclpGetTime( Tcl_Time* timePtr );

A definition of _TclpGetTime_ as a stub procedure that simply
invokes _Tcl\_GetTime_ shall be retained in _tclInt.decls_ for
compatibility with existing Stubs-enabled extensions that invoke it.

This change requires no other change to the public headers; the
_Tcl\_Time_ structure is already exported in _tcl.h_.

# Copyright

Copyright © 2001 by Kevin B. Kenny.  Distribution in whole or part,
with or without annotations, is unlimited.

Name change from tip/74.tip to tip/74.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227

TIP:            74
Title:          wm stackorder command
Version:        $Revision: 1.6 $
Author:         Mo DeJong <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        12-Nov-2001
Post-History:   
Tcl-Version:    8.4

~ Abstract

Tk provides no means to query the stacking order of toplevel windows.
This functionality would be useful to applications that wished to save
and restore the state and relative order of each toplevel.  This
functionality would also make it possible to write test cases for window
manager related commands like focus, raise, and lower.  This document
suggests a new ''wm stackorder'' command to address this deficiency.

~ Specification

|wm stackorder window ?isabove|isbelow? ?window?

The following would return a list of all the toplevels
on the display:

|% wm stackorder .

The returned list would include the passed in window and its children.
Only those toplevel windows that are mapped would be returned.  The
stacking order is from lowest to highest, so the last element in the
list is the window on top of the display.

The ''wm stackorder'' command could also be used to compare the
relative position in the stackorder.  The following command would
return true if ''.a'' was higher in the stacking order compared to
''.b''.

|% wm stackorder .a isabove .b

The ''isbelow'' usage is analogous:

|% wm stackorder .b isbelow .a

One additional C API would be added.  It would accept a Tk window and
return an array of Tk windows in stacking order.  This function would
be implemented in the platform specific window manager code, such as
''tkUnixWm.c''.  This function signature is subject to change.

|TkWindow ** TkWmStackorderToplevel(TkWindow *parentPtr);

~ Rationale

Tk exposes a number of features related to toplevel windows through
the ''wm'' command.  While a user can set the relative position of a
toplevel in the stacking order, it is not currently possible to query
the stacking order for toplevel windows.

Users are frustrated by the lack of access to this information.  This
is a posting to news:comp.lang.tcl by Jim Ingham is typical:

 > ''This seems pretty basic, but for the life of me I can't figure
   out how to determine the stacking order of Tk toplevels.  I want to
   save away the currently open windows in my application, and I would
   like to preserve both positions ''and'' window stacking order.  I
   know how to get the positions of toplevels, but I can't figure out
   how to get the window manager's stacking order.  Should be in the
   ''wm'' commands, but nothing leaps out at me.  What am I missing?''

It is simply not logical to provide a means to manipulate the stacking
order of toplevel windows without also providing a way to query the
stacking order.  This functionality is needed, if only to help with
the authoring of test cases.  For example, one could verify that a
call to ''wm raise'' actually worked by checking to see if the
stacking order was changed.

The second form of the wm stackorder command provides an easy way to
compare the relative position of windows in the stacking order.  This
sort of boolean check is commonly needed in test cases.  One could
implement the same logic by querying the whole list, searching it
twice to find the indices, and then comparing the indices, but the
code would not be as easy to understand and it would not be as
efficient.

The ''wm stackorder'' command also has an extra benefit, it provides
an easy way to query the currently mapped toplevel windows.  It is not
difficult to write a procedure that recursively descends through each
window and filters out those windows that are not mapped toplevels.
This ''wm stackorder'' command would just make it easier to query this
list.

~ Reference Implementation

A reference implementation has been created for X windows and Win32
systems. The X version makes us of the ''XQueryTree()'' function
while the Windows version depends on the ''EnumWindows()'' Win32 API.
Both implementations query the stacking order of toplevel windows
in the root window.  The patch, test cases, and documentation changes
can be found in Tk patch 481148 at SourceForge.  Porting to MacOS
and MacOS X will require assistance from area maintainers.

~ Alternatives

Instead of adding a new ''wm stackorder'' command, one could
adjust the behavior of ''winfo children''. The
documentation currently reads:

 winfo children window:
    Returns a list containing the path names of all the children of
    window.  The list is in stacking order, with the lowest window
    first.  Top-level windows are returned as children of their
    logical parents.

A user would no doubt conclude that the stacking order was maintained
for both toplevels and contained widgets.  Unfortunately, the
implementation only tracks the stacking order for contained
widgets.

|% toplevel .t
|% pack [button .t.b1]
|% pack [button .t.b2]
|% winfo children .t
|.t.b1 .t.b2
|% raise .t.b1
|% winfo children .t
|.t.b2 .t.b1

Tk does not track stacking order changes for toplevels.

|% toplevel .t2
|% winfo children .
|.t .t2
|% raise .t
|% winfo children .
|.t .t2

There are two possible ways to "fix" the ''winfo children'' command
so that it would return toplevels in stacking order. One could call
the ''TkWmStackorderToplevel()'' function and use the results to
sort any toplevels that would be returned by ''winfo children''.
The other option would be to resort the ''TkWindow->childList''
as toplevels are moved up and down in the stacking order.

Both of these alternatives have some serious implementation issues.
The ''TkWmStackorderToplevel()'' function is very slow. The X based
implementation recurses through each window in the Tk hierarchy to
create a mapping of wrapper window ids to ''TkWindow''s. The function
then queries the X server to find each X window that is a child of
the root screen and checks to see if the window exists in the
mapping. When compared to ''winfo children'', which just loops
over an in-memory list, it is easy to see why ''wm stackorder''
is so much slower.

|% for {set i 0} {$i < 10} {incr i} {toplevel .t$i}
|% time {winfo children .} 100
|34 microseconds per iteration
|% time {wm stackorder .} 100
|394 microseconds per iteration

It would not be wise to make the ''winfo children'' command
an order of magnitude slower just to add a new capability.
One also needs to remember that the ''winfo children'' command
is often used recursively, so any slowdown would be multiplied
by the depth of the window hierarchy.

The second option would be to keep the ''TkWindow->childList''
sorted as toplevels are raised and lowered either by the
application or the window manager. This would imply binding
to the <Circulate> event under X. While the <Circulate> event
is defined in Tk, it does not seem to actually be delivered
by the window manager. In any event, this seems like an area
ripe for incompatibility and error.

Even if one of the above fixes for the ''winfo children'' command
was doable, it still would not satisfy user's needs. One would
be able to compare the relative stacking order of two toplevels
that have the same parent.

|% toplevel .t1
|% toplevel .t2
|% winfo children .
|{.t1 .t2}

Unfortunately, it would not be possible to compare the stacking
order of two toplevel windows that have different parents.

|% toplevel .t1
|% toplevel .t1.t2
|# No help here!
|% winfo children .
|.t1
|% winfo children .t1
|.t1.t2

It would not even be possible to query the position of the
. window in the stacking order since it does not have a
parent window that can be passed to ''winfo children''.

Since modifying ''winfo children'' could cause some
serious problems and would ultimately be ineffective,
this alternative was rejected. Instead, the documentation
for the ''winfo children'' command should be updated to
indicate that toplevel windows are not returned in stacking
order.

~ Risks

It is not entirely clear what risks would be associated with this TIP.
The logic of the ''wm stackorder'' command is rather insulated from
the rest of the Tk core.  Changing the implementation of
''Tk_RestackWindow()'' and keeping the ''TkWindow->childList'' up to
date w.r.t. external changes would be more risky since it could affect
other parts of the core. Doing an explicit query to find the stacking
order seems a lot less error prone when compared to monitoring events
from the window manager. We might speed things up by
also storing wrapper pointers in the map table so that a call to
''Tk_IdToWindow()'' with a wrapper id would work, but it is not
clear that would help since the ''XQueryTree()'' likely takes up
most of the function processing time.

It is also not known how difficult or costly this functionality will
be to implement on Mac OS, or Mac OS X.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|
|

|
|
|
|
|
|
|
|

|
|
|
|
|
|

|

|
|
|

|

|

|
|

|
|
|
|
|

|

|

|

|

|
|
|
|

|
|
|
|
|
|
|

|

|

|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227

# TIP 74: wm stackorder command

	Author:         Mo DeJong <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        12-Nov-2001
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

Tk provides no means to query the stacking order of toplevel windows.
This functionality would be useful to applications that wished to save
and restore the state and relative order of each toplevel.  This
functionality would also make it possible to write test cases for window
manager related commands like focus, raise, and lower.  This document
suggests a new _wm stackorder_ command to address this deficiency.

# Specification

	wm stackorder window ?isabove|isbelow? ?window?

The following would return a list of all the toplevels
on the display:

	% wm stackorder .

The returned list would include the passed in window and its children.
Only those toplevel windows that are mapped would be returned.  The
stacking order is from lowest to highest, so the last element in the
list is the window on top of the display.

The _wm stackorder_ command could also be used to compare the
relative position in the stackorder.  The following command would
return true if _.a_ was higher in the stacking order compared to
_.b_.

	% wm stackorder .a isabove .b

The _isbelow_ usage is analogous:

	% wm stackorder .b isbelow .a

One additional C API would be added.  It would accept a Tk window and
return an array of Tk windows in stacking order.  This function would
be implemented in the platform specific window manager code, such as
_tkUnixWm.c_.  This function signature is subject to change.

	TkWindow ** TkWmStackorderToplevel(TkWindow *parentPtr);

# Rationale

Tk exposes a number of features related to toplevel windows through
the _wm_ command.  While a user can set the relative position of a
toplevel in the stacking order, it is not currently possible to query
the stacking order for toplevel windows.

Users are frustrated by the lack of access to this information.  This
is a posting to news:comp.lang.tcl by Jim Ingham is typical:

 > _This seems pretty basic, but for the life of me I can't figure
   out how to determine the stacking order of Tk toplevels.  I want to
   save away the currently open windows in my application, and I would
   like to preserve both positions _and_ window stacking order.  I
   know how to get the positions of toplevels, but I can't figure out
   how to get the window manager's stacking order.  Should be in the
   _wm_ commands, but nothing leaps out at me.  What am I missing?_

It is simply not logical to provide a means to manipulate the stacking
order of toplevel windows without also providing a way to query the
stacking order.  This functionality is needed, if only to help with
the authoring of test cases.  For example, one could verify that a
call to _wm raise_ actually worked by checking to see if the
stacking order was changed.

The second form of the wm stackorder command provides an easy way to
compare the relative position of windows in the stacking order.  This
sort of boolean check is commonly needed in test cases.  One could
implement the same logic by querying the whole list, searching it
twice to find the indices, and then comparing the indices, but the
code would not be as easy to understand and it would not be as
efficient.

The _wm stackorder_ command also has an extra benefit, it provides
an easy way to query the currently mapped toplevel windows.  It is not
difficult to write a procedure that recursively descends through each
window and filters out those windows that are not mapped toplevels.
This _wm stackorder_ command would just make it easier to query this
list.

# Reference Implementation

A reference implementation has been created for X windows and Win32
systems. The X version makes us of the _XQueryTree\(\)_ function
while the Windows version depends on the _EnumWindows\(\)_ Win32 API.
Both implementations query the stacking order of toplevel windows
in the root window.  The patch, test cases, and documentation changes
can be found in Tk patch 481148 at SourceForge.  Porting to MacOS
and MacOS X will require assistance from area maintainers.

# Alternatives

Instead of adding a new _wm stackorder_ command, one could
adjust the behavior of _winfo children_. The
documentation currently reads:

 winfo children window:
    Returns a list containing the path names of all the children of
    window.  The list is in stacking order, with the lowest window
    first.  Top-level windows are returned as children of their
    logical parents.

A user would no doubt conclude that the stacking order was maintained
for both toplevels and contained widgets.  Unfortunately, the
implementation only tracks the stacking order for contained
widgets.

	% toplevel .t
	% pack [button .t.b1]
	% pack [button .t.b2]
	% winfo children .t
	.t.b1 .t.b2
	% raise .t.b1
	% winfo children .t
	.t.b2 .t.b1

Tk does not track stacking order changes for toplevels.

	% toplevel .t2
	% winfo children .
	.t .t2
	% raise .t
	% winfo children .
	.t .t2

There are two possible ways to "fix" the _winfo children_ command
so that it would return toplevels in stacking order. One could call
the _TkWmStackorderToplevel\(\)_ function and use the results to
sort any toplevels that would be returned by _winfo children_.
The other option would be to resort the _TkWindow->childList_
as toplevels are moved up and down in the stacking order.

Both of these alternatives have some serious implementation issues.
The _TkWmStackorderToplevel\(\)_ function is very slow. The X based
implementation recurses through each window in the Tk hierarchy to
create a mapping of wrapper window ids to _TkWindow_s. The function
then queries the X server to find each X window that is a child of
the root screen and checks to see if the window exists in the
mapping. When compared to _winfo children_, which just loops
over an in-memory list, it is easy to see why _wm stackorder_
is so much slower.

	% for {set i 0} {$i < 10} {incr i} {toplevel .t$i}
	% time {winfo children .} 100
	34 microseconds per iteration
	% time {wm stackorder .} 100
	394 microseconds per iteration

It would not be wise to make the _winfo children_ command
an order of magnitude slower just to add a new capability.
One also needs to remember that the _winfo children_ command
is often used recursively, so any slowdown would be multiplied
by the depth of the window hierarchy.

The second option would be to keep the _TkWindow->childList_
sorted as toplevels are raised and lowered either by the
application or the window manager. This would imply binding
to the <Circulate> event under X. While the <Circulate> event
is defined in Tk, it does not seem to actually be delivered
by the window manager. In any event, this seems like an area
ripe for incompatibility and error.

Even if one of the above fixes for the _winfo children_ command
was doable, it still would not satisfy user's needs. One would
be able to compare the relative stacking order of two toplevels
that have the same parent.

	% toplevel .t1
	% toplevel .t2
	% winfo children .
	{.t1 .t2}

Unfortunately, it would not be possible to compare the stacking
order of two toplevel windows that have different parents.

	% toplevel .t1
	% toplevel .t1.t2
	# No help here!
	% winfo children .
	.t1
	% winfo children .t1
	.t1.t2

It would not even be possible to query the position of the
. window in the stacking order since it does not have a
parent window that can be passed to _winfo children_.

Since modifying _winfo children_ could cause some
serious problems and would ultimately be ineffective,
this alternative was rejected. Instead, the documentation
for the _winfo children_ command should be updated to
indicate that toplevel windows are not returned in stacking
order.

# Risks

It is not entirely clear what risks would be associated with this TIP.
The logic of the _wm stackorder_ command is rather insulated from
the rest of the Tk core.  Changing the implementation of
_Tk\_RestackWindow\(\)_ and keeping the _TkWindow->childList_ up to
date w.r.t. external changes would be more risky since it could affect
other parts of the core. Doing an explicit query to find the stacking
order seems a lot less error prone when compared to monitoring events
from the window manager. We might speed things up by
also storing wrapper pointers in the map table so that a call to
_Tk\_IdToWindow\(\)_ with a wrapper id would work, but it is not
clear that would help since the _XQueryTree\(\)_ likely takes up
most of the function processing time.

It is also not known how difficult or costly this functionality will
be to implement on Mac OS, or Mac OS X.

# Copyright

This document has been placed in the public domain.

Name change from tip/75.tip to tip/75.md.

1
2
3
4
5
6
7
8
9
10
11
12
13
14

15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34

35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72

73
74
75
76
77

78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105

106
107
108

109
110
111

112
113
114
115

116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138

TIP:            75
Title:          Refer to Sub-RegExps Inside 'switch -regexp' Bodies
Version:        $Revision: 1.14 $
Author:         Donal K. Fellows <[email protected]>
Author:         J�nos Hol�nyi <[email protected]>
Author:         Salvatore Sanfilippo <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        28-Nov-2001
Post-History:   
Discussions-To: http://purl.org/mini/cgi-bin/chat.cgi
Keywords:       switch,regexp,parentheses
Tcl-Version:    8.5

~ Abstract

Currently, it is necessary to match a regular expression against a
string twice in order to get the sub-expressions out of the matched
string.  This TIP alters that so that those sub-exps can be
substituted directly into the body of the script to be executed.

~Rationale

Similarly to the

|   regexp -- <RE> $string matchvar submatchvar ...

of Tcl and the

|   interact -re <RE> {
|      set matches "$interact_out(0,string) $interact_out(1,string) ..."
|   }

of Tcl/Expect, it would be very helpful and would also make Tcl more
consistent if the [[switch]] command of Tcl would support references
to parenthesized REs inside the switch patterns from the bodies
associated to each of the patterns.  As it is, it is currently
necessary to match the regular expression against the string twice to
obtain this information.

~Specification

The easiest way to get the information is to place it into a variable.
All that remains is a way to specify which variable should receive the
information.  This is done by a new option to the [[switch]] command:
''-matchvar''.  The argument to this optiongives the name of a
variable in which will be placed a Tcl list of the matches discovered
by the RE engine, such that the part of the string that was matched is
given by [[lindex $var 0]], the first parenthesis by [[lindex $var
1]], etc.  The alternative to this is to use the name of an array, but
this is more expensive.

The indices which the match occurred at can also be sometimes useful.
Therefore, the new option ''-indexvar'' will also be provided which
will name a variable into which a list of match indices (each a two
item list of values in the same way that [[regexp -indices]] computes)
will be placed.  It will be legal for both -matchvar and -indexvar to
be specified in the same [[switch]] command, but only if the matching
mode is -regexp.  (The other kinds of match modes always match against
the whole string anyway.)

Both variables (if specified, of course) will contain the empty list
if the ''default'' branch is taken.

~Example

|set string "some long complicated message"
|switch -matchvar foo -indexvar bar -regexp -- $string {
|   {\w*(e)\w*} {
|      puts "matched [lindex $foo 0] with 'e' at [lindex $bar 1 0]"
|   }

|   default {
|      puts "no words containing a letter 'e' at all"
|   }
|}

~Alternatives

Actually, no new syntax is needed to achieve the mentioned ability.
The solution could adopt the behavior of [[regsub]] ''(description
taken from regsub(n))'':

 > If subSpec contains a `&' or `\0', then it is replaced in the
   substitution with the portion of string that matched exp.  If
   subSpec contains a `\''n''', where ''n'' is a digit between 1 and
   9, then it is replaced in the substitution with the portion of
   string that matched the ''n''-th parenthesized subexpression of
   exp.  Additional backslashes may be used in subSpec to prevent
   special interpretation of `&' or `\0' or `\n' or backslash.

This has the disadvantage of being incompatible with existing code
that makes use of the -regexp option to [[switch]] and which may well
have characters matching the above sequences inside already.

Another alternative can be to specify either -submatches, or -subindexes and
use three elements for every switch case. The first is the regexp,
the second the list of vars like in the [regexp] command, and the
last the script to execute.

|set string [getSomeComplexProtocolLine]
|switch -regexp -submatches -- $string {
|    {EHLO (.*)} {match heloarg} {
|       puts "Helo $heloarg"
|    }

|    {MAIL FROM: <(.*)@(.*)>} {match user host} {
|       puts "Mail from $user at $host"
|    }

|    {QUIT} {} {
|       exit
|    }

|    default {} {
|       puts "What a strange SMTP command!"
|    }
|}  

Usually submatches have quite logical names, so it is possible
that to refer they by name instead of to use [lindex] can be
more comfortable. Another minor advantage of this is that variable
names are very near the script, so it shouldn't be hard to follow
what the script is doing.

On the other side this changes a well-known fact of switch getting
as input two elements for every case; the main proposal of this TIP
has the advantage of leaving that feature of the [[switch]] command as
an invariant.  This makes the overall implementation of the feature
easier, and also makes it easier to tell people how to use.  And it
allows for trivial obtaining of both the matched string and the range
of the input string that matched.  Of course, in that case you could
just have four values for each entry, but that is getting baroque.

~Reference Implementation

http://sf.net/tracker/?func=detail&aid=848578&group_id=10894&atid=310894

~Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|
|
<
|
>

|

|

|
|

|
|

|
|
|

|
|
|

|
|

|

|
|
|
|
<
>
|
|
<
<
|
>
>
|

|
|

|

|

|

|

|

|
|
|
|
<
>
|
|
<
>
|
|
<
>
|
|
<
<
>
>

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31

32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70

71
72
73

74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103

104
105
106

107
108
109

110
111
112

113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138

# TIP 75: Refer to Sub-RegExps Inside 'switch -regexp' Bodies

	Author:         Donal K. Fellows <[email protected]>
	Author:         János Holányi <[email protected]>
	Author:         Salvatore Sanfilippo <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        28-Nov-2001
	Post-History:   
	Discussions-To: http://purl.org/mini/cgi-bin/chat.cgi
	Keywords:       switch,regexp,parentheses
	Tcl-Version:    8.5
-----

# Abstract

Currently, it is necessary to match a regular expression against a
string twice in order to get the sub-expressions out of the matched
string.  This TIP alters that so that those sub-exps can be
substituted directly into the body of the script to be executed.

# Rationale

Similarly to the

	   regexp -- <RE> $string matchvar submatchvar ...

of Tcl and the

	   interact -re <RE> {
	      set matches "$interact_out(0,string) $interact_out(1,string) ..."

	   }

of Tcl/Expect, it would be very helpful and would also make Tcl more
consistent if the [switch] command of Tcl would support references
to parenthesized REs inside the switch patterns from the bodies
associated to each of the patterns.  As it is, it is currently
necessary to match the regular expression against the string twice to
obtain this information.

# Specification

The easiest way to get the information is to place it into a variable.
All that remains is a way to specify which variable should receive the
information.  This is done by a new option to the [switch] command:
_-matchvar_.  The argument to this optiongives the name of a
variable in which will be placed a Tcl list of the matches discovered
by the RE engine, such that the part of the string that was matched is
given by [lindex $var 0], the first parenthesis by [lindex $var
1], etc.  The alternative to this is to use the name of an array, but
this is more expensive.

The indices which the match occurred at can also be sometimes useful.
Therefore, the new option _-indexvar_ will also be provided which
will name a variable into which a list of match indices \(each a two
item list of values in the same way that [regexp -indices] computes\)
will be placed.  It will be legal for both -matchvar and -indexvar to
be specified in the same [switch] command, but only if the matching
mode is -regexp.  \(The other kinds of match modes always match against
the whole string anyway.\)

Both variables \(if specified, of course\) will contain the empty list
if the _default_ branch is taken.

# Example

	set string "some long complicated message"
	switch -matchvar foo -indexvar bar -regexp -- $string {
	   {\w*(e)\w*} {
	      puts "matched [lindex $foo 0] with 'e' at [lindex $bar 1 0]"

	   }
	   default {
	      puts "no words containing a letter 'e' at all"

	   }
	}

# Alternatives

Actually, no new syntax is needed to achieve the mentioned ability.
The solution could adopt the behavior of [regsub] _\(description
taken from regsub\(n\)\)_:

 > If subSpec contains a \`&' or \`\\0', then it is replaced in the
   substitution with the portion of string that matched exp.  If
   subSpec contains a \`\\_n**, where _n_ is a digit between 1 and
   9, then it is replaced in the substitution with the portion of
   string that matched the _n_-th parenthesized subexpression of
   exp.  Additional backslashes may be used in subSpec to prevent
   special interpretation of \`&' or \`\\0' or \`\\n' or backslash.

This has the disadvantage of being incompatible with existing code
that makes use of the -regexp option to [switch] and which may well
have characters matching the above sequences inside already.

Another alternative can be to specify either -submatches, or -subindexes and
use three elements for every switch case. The first is the regexp,
the second the list of vars like in the [regexp] command, and the
last the script to execute.

	set string [getSomeComplexProtocolLine]
	switch -regexp -submatches -- $string {
	    {EHLO (.*)} {match heloarg} {
	       puts "Helo $heloarg"

	    }
	    {MAIL FROM: <(.*)@(.*)>} {match user host} {
	       puts "Mail from $user at $host"

	    }
	    {QUIT} {} {
	       exit

	    }
	    default {} {
	       puts "What a strange SMTP command!"

	    }
	}  

Usually submatches have quite logical names, so it is possible
that to refer they by name instead of to use [lindex] can be
more comfortable. Another minor advantage of this is that variable
names are very near the script, so it shouldn't be hard to follow
what the script is doing.

On the other side this changes a well-known fact of switch getting
as input two elements for every case; the main proposal of this TIP
has the advantage of leaving that feature of the [switch] command as
an invariant.  This makes the overall implementation of the feature
easier, and also makes it easier to tell people how to use.  And it
allows for trivial obtaining of both the matched string and the range
of the input string that matched.  Of course, in that case you could
just have four values for each entry, but that is getting baroque.

# Reference Implementation

<http://sf.net/tracker/?func=detail&aid=848578&group\_id=10894&atid=310894>

# Copyright

This document has been placed in the public domain.

Name change from tip/76.tip to tip/76.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59

60
61
62
63
64
65
66
67

68
69
70
71
72
73
74
75
76
77
78
79
80
81
82

83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98

99
100
101
102
103
104
105

106
107
108
109
110
111

TIP:		76
Title:		Make 'regsub' Return a String
State:		Final
Type:		Project
Tcl-Version:	8.4
Vote:		Done
Post-History:	
Version:	$Revision: 1.3 $
Author:		Bruce Hartweg <[email protected]>
Author:		Donal K. Fellows <[email protected]>
Created:	29-Nov-2001

~ Abstract

This TIP proposes altering the [[regsub]] command so that it can
return the substituted string as the result of the command.

~ Rationale

In many of the most common uses of the [[regsub]] command, the
substituted string is used only once in the immediately following
command.  However, the [[regsub]] command only provides the
substituted string via a variable, with the result of the command
itself being the number of substitutions performed.  For many uses of
the command, it is the substituted string though that is the most
useful result, especially if some other transformation is going to be
applied to it (like further [[regsub]] commands or some other Tcl
command like one of the [[string]] subcommands or [[subst]].)  This
TIP proposes a mechanism for providing the ability to return the
string as the command's result, and in a way that is
backward-compatible with existing scripts.

~ Specification

|   regsub ?switches? exp string subSpec ?varName?

If ''varName'' is supplied the new string is written there and the
number of substitutions are returned (same as current behavior).  If
''varName'' is not supplied than the new string is returned as the
result of the [[regsub]] command.

~ Reference Implementation

This is a pretty easy change, although I do not currently have an
environment where I can actually build and test this the following
should create the desired behavior.

File: ''tcl/generic/tclCmdMZ.c''

Function: ''Tcl_RegsubObjCmd''

Currently (v 1.52):

|    if (objc - idx != 4) {
|	 Tcl_WrongNumArgs(interp, 1, objv,
|		 "?switches? exp string subSpec varName");
|	 return TCL_ERROR;
|    }

which should be changed to:

|    objc -= idx;
|    if (objc != 3 || objc != 4) {
|	 Tcl_WrongNumArgs(interp, 1, objv,
|		 "?switches? exp string subSpec ?varName?");
|	 return TCL_ERROR;
|    }

and then at the end change this:

|    if (Tcl_ObjSetVar2(interp, objv[3], NULL, resultPtr, 0) == NULL) {
|	 Tcl_AppendResult(interp, "couldn't set variable \"",
|		 Tcl_GetString(objv[3]), "\"", (char *) NULL);
|	 result = TCL_ERROR;
|    } else {
|	 /*
|	  * Set the interpreter's object result to an integer object
|	  * holding the number of matches.
|	  */
|
|	 Tcl_SetIntObj(Tcl_GetObjResult(interp), numMatches);
|    }

to this:

|    if (objc == 4) {
|        if (Tcl_ObjSetVar2(interp, objv[3], NULL, resultPtr, 0) == NULL) {
|            Tcl_AppendResult(interp, "couldn't set variable \"",
|        	     Tcl_GetString(objv[3]), "\"", (char *) NULL);
|            result = TCL_ERROR;
|        } else {
|            /*
|             * Set the interpreter's object result to an integer object
|             * holding the number of matches.
|             */
|
|            Tcl_SetIntObj(Tcl_GetObjResult(interp), numMatches);
|        }

|    } else {
|           /*
|            * No varname supplied, return string as result
|            */
|           Tcl_SetObjResult(interp, resultPtr);
|    }

And then minor updates to the man page to show that ''varName'' is
optional.

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
|
>

|

|

|

|

|

|
|

|

|

|
|
|
|

|

|

|

|

|
|
|
|
<
|
>

|
|
|
|
|
<
>

|
|
|
|
|
|
|
|
|
|
|
<
>

|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
<
|
>
|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56

57
58
59
60
61
62
63
64
65

66
67
68
69
70
71
72
73
74
75
76
77
78
79
80

81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96

97
98
99
100
101
102

103
104
105
106
107
108
109
110
111

# TIP 76: Make 'regsub' Return a String
	State:		Final
	Type:		Project
	Tcl-Version:	8.4
	Vote:		Done
	Post-History:	

	Author:		Bruce Hartweg <[email protected]>
	Author:		Donal K. Fellows <[email protected]>
	Created:	29-Nov-2001
-----

# Abstract

This TIP proposes altering the [regsub] command so that it can
return the substituted string as the result of the command.

# Rationale

In many of the most common uses of the [regsub] command, the
substituted string is used only once in the immediately following
command.  However, the [regsub] command only provides the
substituted string via a variable, with the result of the command
itself being the number of substitutions performed.  For many uses of
the command, it is the substituted string though that is the most
useful result, especially if some other transformation is going to be
applied to it \(like further [regsub] commands or some other Tcl
command like one of the [string] subcommands or [subst].\)  This
TIP proposes a mechanism for providing the ability to return the
string as the command's result, and in a way that is
backward-compatible with existing scripts.

# Specification

	   regsub ?switches? exp string subSpec ?varName?

If _varName_ is supplied the new string is written there and the
number of substitutions are returned \(same as current behavior\).  If
_varName_ is not supplied than the new string is returned as the
result of the [regsub] command.

# Reference Implementation

This is a pretty easy change, although I do not currently have an
environment where I can actually build and test this the following
should create the desired behavior.

File: _tcl/generic/tclCmdMZ.c_

Function: _Tcl\_RegsubObjCmd_

Currently \(v 1.52\):

	    if (objc - idx != 4) {
		 Tcl_WrongNumArgs(interp, 1, objv,
			 "?switches? exp string subSpec varName");
		 return TCL_ERROR;

	    }

which should be changed to:

	    objc -= idx;
	    if (objc != 3 || objc != 4) {
		 Tcl_WrongNumArgs(interp, 1, objv,
			 "?switches? exp string subSpec ?varName?");
		 return TCL_ERROR;

	    }

and then at the end change this:

	    if (Tcl_ObjSetVar2(interp, objv[3], NULL, resultPtr, 0) == NULL) {
		 Tcl_AppendResult(interp, "couldn't set variable \"",
			 Tcl_GetString(objv[3]), "\"", (char *) NULL);
		 result = TCL_ERROR;
	    } else {
		 /*
		  * Set the interpreter's object result to an integer object
		  * holding the number of matches.
		  */

		 Tcl_SetIntObj(Tcl_GetObjResult(interp), numMatches);

	    }

to this:

	    if (objc == 4) {
	        if (Tcl_ObjSetVar2(interp, objv[3], NULL, resultPtr, 0) == NULL) {
	            Tcl_AppendResult(interp, "couldn't set variable \"",
	        	     Tcl_GetString(objv[3]), "\"", (char *) NULL);
	            result = TCL_ERROR;
	        } else {
	            /*
	             * Set the interpreter's object result to an integer object
	             * holding the number of matches.
	             */

	            Tcl_SetIntObj(Tcl_GetObjResult(interp), numMatches);

	        }
	    } else {
	           /*
	            * No varname supplied, return string as result
	            */
	           Tcl_SetObjResult(interp, resultPtr);

	    }

And then minor updates to the man page to show that _varName_ is
optional.

# Copyright

This document has been placed in the public domain.

Name change from tip/77.tip to tip/77.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35

36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52

53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120

121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221

TIP:		77
Title:		Support for Nested Paired Item Lists
Version:	$Revision: 1.3 $
Author:		Christian Williams <[email protected]>
State:		Withdrawn
Type:		Project
Tcl-Version:	8.5
Vote:		Pending
Created:	07-Dec-2001
Obsoleted-By:	111
Post-History:	

~ Abstract

Tcl arrays can be transformed to and from lists using the ''array
get'' and ''array set'' commands.  This TIP proposes a new command for
working directly these paired lists, and extending them to allow
nesting in a manner analogous to [22].

~ Rationale

Tcl lists provide only ordinal access to their items; often it makes
more sense to access items by pre-assigned descriptive names.  This
can be easily accomplished with Tcl arrays.  Consider these
alternatives:

|  set urlList { http tcl.activestate.com 80 /index.html }
|
|  array set urlArray {
|       proto   http
|       host    tcl.activestate.com
|       port    80
|       uri     /index.html
|  }

Clearly the array approach promotes more readable code
(''$urlArray(host)'' versus ''[lindex $urlList 1]'').

However, it's quite unwieldy and sometimes expensive to use arrays to
access members of many sets of structured data, particularly when that
data contains nested structures.

Consider this structured data:

|  set data {
|       text    {ignored-data}
|       valid-styles {
|               justification {left centered right full}
|               font          {courier helvetica times}
|       }
|  }

Extracting items from structures like this can be accomplished by
multiple ''array set'' commands:

|  array set dataArray $data
|  array set validStylesArray $dataArray(valid-styles)
|  puts "Justification: $validStylesArray(justification)"

To modify an item in ''struct'', we need some pretty ugly code:

|  array set dataArray $struct
|  array set validStylesArray $dataArray(valid-styles)
|  set validStylesArray(justification) {left}
|  set dataArray(valid-styles) [array get validStylesArray]
|  set data [array get dataArray]

Clearly, all this setting and getting of arrays imposes a rather high
overhead; many variables are created and moved around.  Also, if this
is occurring in a loop, then care must be taken to unset the
''dataArray'' and ''validStylesArray'' arrays first.

In contrast, a C programmer may expect that code to look more like
this:

|  data->valid-styles->justification = 'left';

Extending Tcl with a command supporting nested, paired item lists
would permit very efficient and readable handling of these useful data
structures.

~ Specification

Under this proposal, a new command named ''pair'' (referring to the
pairs of name/value list items it works with) would be added to the
Tcl core.

A well-formed paired list is defined as a well-formed Tcl list whose
length is evenly divisible by two.  In each pair of list items, the
first item gives the name of the pair, and the second gives the value.
Paired lists may be nested by placing a valid paired list in the
second (value) item of any pair.  Note that the pairs are not grouped
together into a two-item list as in TclX's keyed lists.  Tcl's ''array
get'' command returns a well-formed paired list.

The syntax for the new ''pair'' command would be:

|  pair option variable node ?newValue?

Valid values for the ''options'' argument include ''get'', ''set'',
''unset'', ''exists'', and ''append''.  These subcommands are
equivalent to the existing Tcl commands of the same names.

The ''variable'' argument is the name of a Tcl variable; it is always
referred to by name, not by its value (that is, no ''$'').  Generally,
the variable would contain a well-formed, and optionally nested,
paired list.

The ''node'' argument is a well-formed Tcl list of zero or more items
specifying the route to the item we're interested in.

For example:

|  set data {
|       text    {ignored-data}
|       valid-styles {
|               justification {left centered right full}
|               font          {courier helvetica times}
|       }
|  }

|
|  puts "Justification: [pair get data {valid-styles justification}]"

displays "Justification: left centered right full".

If the ''data'' argument contains zero items, then the "root" node of
the variable is targeted -- that is, the entire variable:

|  pair set node {} new-value
|  puts $node

displays "new-value".

If a non-existent node is targeted using the ''get'' or ''unset''
options, an error is returned:

|  unset x
|  pair get x {first second third}
|  -> no such value

If a non-existent node is targeted using the ''set'' or ''append''
options, the node, and any parent nodes, are created.

|  unset x
|  pair set x {first second third} value
|  puts $x

displays "first {second {third value}}"

The ''exists'' option mimics Tcl's ''info exists'' command:

|  set x {name value}
|  pair exists x name
|  -> 1
|  pair exists x name2
|  -> 0

The ''set'' and ''append'' options return the value of the node that
has just been set, not the value of the variable.  This would seem to
be more in keeping with the intent of Tcl's ''set'' and ''append''
commands' return values than duplicating the exact behaviour:

|  puts [pair set x {first second third} value]

displays "value".

An error is returned if a variable is passed to the ''pair'' command
which doesn't contain a well-formed paired Tcl list at any point on
the way to the node specified by the ''node'' argument:

|  set x {name value thirdarg}
|  pair get x name
|  -> list must have an even number of elements

If there are traces registered on the variable passed to the ''pair''
command, they are triggered in the same manner as Tcl's ''set'' and
''append'' commands.  Note that the ''append'' option triggers only
write triggers, not read triggers.

Note that the ''set'' and ''append'' options both return the value of
the node specified, and the ''newValue'' argument is optional in both
cases, making the ''get'' option redundant.  The ''get'' command is
included to improve readability.

If the variable passed to ''pair'' doesn't exist, it will be created
if the option is 'set' or ''append''; the ''exists'' option will
always return a ''0''; the ''get'' option will return an error.

If a paired list contains multiple pairs with identical names, the
pair occurring later in the list is targeted.  This is specified to
mimic the behaviour of ''array set'':

|  set x "name value1 name value2"
|  pair get x name
|  -> value2
|
|  array set arrX $x
|  set arrX(name)
|  -> value2

~ Reference Implementation

http://sf.net/tracker/?func=detail&aid=491070&group_id=10894&atid=310894

There should be a public C API for working with nested paired lists.
The supplied reference code currently does not provide this.

~ Notes

It would be nice to mimic Tcl 8.4's new ''unset -nocomplain''
behaviour.

~ Side Effects

Whether the result of the pair operation is successful, the underlying
Tcl_Obj that represents the list argument may have its internal
representation invalidated or changed to that of a list.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|

|
|
|
|
|
|
|
<
|
>

|

|
|
|
|
|
<
<
|
>
>

|

|
|
|

|

|
|
|
|
|

|

|

|

|
|

|
|
|

|

|

|
|

|
|

|

|
|
|
|
|
<
<
>
>
|
|

|

|
|

|

|
|
|

|

|
|
|

|

|

|
|
|
|
|

|

|

|

|

|

|
|
|

|
|
|

|
|
|

|
|
|

|

|
|
|
|
|
|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32

33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48

49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117

118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221

# TIP 77: Support for Nested Paired Item Lists

	Author:		Christian Williams <[email protected]>
	State:		Withdrawn
	Type:		Project
	Tcl-Version:	8.5
	Vote:		Pending
	Created:	07-Dec-2001
	Obsoleted-By:	111
	Post-History:	
-----

# Abstract

Tcl arrays can be transformed to and from lists using the _array
get_ and _array set_ commands.  This TIP proposes a new command for
working directly these paired lists, and extending them to allow
nesting in a manner analogous to [[22]](22.md).

# Rationale

Tcl lists provide only ordinal access to their items; often it makes
more sense to access items by pre-assigned descriptive names.  This
can be easily accomplished with Tcl arrays.  Consider these
alternatives:

	  set urlList { http tcl.activestate.com 80 /index.html }

	  array set urlArray {
	       proto   http
	       host    tcl.activestate.com
	       port    80
	       uri     /index.html

	  }

Clearly the array approach promotes more readable code
\(_$urlArray\(host\)_ versus _[lindex $urlList 1]_\).

However, it's quite unwieldy and sometimes expensive to use arrays to
access members of many sets of structured data, particularly when that
data contains nested structures.

Consider this structured data:

	  set data {
	       text    {ignored-data}
	       valid-styles {
	               justification {left centered right full}
	               font          {courier helvetica times}

	       }
	  }

Extracting items from structures like this can be accomplished by
multiple _array set_ commands:

	  array set dataArray $data
	  array set validStylesArray $dataArray(valid-styles)
	  puts "Justification: $validStylesArray(justification)"

To modify an item in _struct_, we need some pretty ugly code:

	  array set dataArray $struct
	  array set validStylesArray $dataArray(valid-styles)
	  set validStylesArray(justification) {left}
	  set dataArray(valid-styles) [array get validStylesArray]
	  set data [array get dataArray]

Clearly, all this setting and getting of arrays imposes a rather high
overhead; many variables are created and moved around.  Also, if this
is occurring in a loop, then care must be taken to unset the
_dataArray_ and _validStylesArray_ arrays first.

In contrast, a C programmer may expect that code to look more like
this:

	  data->valid-styles->justification = 'left';

Extending Tcl with a command supporting nested, paired item lists
would permit very efficient and readable handling of these useful data
structures.

# Specification

Under this proposal, a new command named _pair_ \(referring to the
pairs of name/value list items it works with\) would be added to the
Tcl core.

A well-formed paired list is defined as a well-formed Tcl list whose
length is evenly divisible by two.  In each pair of list items, the
first item gives the name of the pair, and the second gives the value.
Paired lists may be nested by placing a valid paired list in the
second \(value\) item of any pair.  Note that the pairs are not grouped
together into a two-item list as in TclX's keyed lists.  Tcl's _array
get_ command returns a well-formed paired list.

The syntax for the new _pair_ command would be:

	  pair option variable node ?newValue?

Valid values for the _options_ argument include _get_, _set_,
_unset_, _exists_, and _append_.  These subcommands are
equivalent to the existing Tcl commands of the same names.

The _variable_ argument is the name of a Tcl variable; it is always
referred to by name, not by its value \(that is, no _$_\).  Generally,
the variable would contain a well-formed, and optionally nested,
paired list.

The _node_ argument is a well-formed Tcl list of zero or more items
specifying the route to the item we're interested in.

For example:

	  set data {
	       text    {ignored-data}
	       valid-styles {
	               justification {left centered right full}
	               font          {courier helvetica times}

	       }
	  }

	  puts "Justification: [pair get data {valid-styles justification}]"

displays "Justification: left centered right full".

If the _data_ argument contains zero items, then the "root" node of
the variable is targeted -- that is, the entire variable:

	  pair set node {} new-value
	  puts $node

displays "new-value".

If a non-existent node is targeted using the _get_ or _unset_
options, an error is returned:

	  unset x
	  pair get x {first second third}
	  -> no such value

If a non-existent node is targeted using the _set_ or _append_
options, the node, and any parent nodes, are created.

	  unset x
	  pair set x {first second third} value
	  puts $x

displays "first \{second \{third value\}\}"

The _exists_ option mimics Tcl's _info exists_ command:

	  set x {name value}
	  pair exists x name
	  -> 1
	  pair exists x name2
	  -> 0

The _set_ and _append_ options return the value of the node that
has just been set, not the value of the variable.  This would seem to
be more in keeping with the intent of Tcl's _set_ and _append_
commands' return values than duplicating the exact behaviour:

	  puts [pair set x {first second third} value]

displays "value".

An error is returned if a variable is passed to the _pair_ command
which doesn't contain a well-formed paired Tcl list at any point on
the way to the node specified by the _node_ argument:

	  set x {name value thirdarg}
	  pair get x name
	  -> list must have an even number of elements

If there are traces registered on the variable passed to the _pair_
command, they are triggered in the same manner as Tcl's _set_ and
_append_ commands.  Note that the _append_ option triggers only
write triggers, not read triggers.

Note that the _set_ and _append_ options both return the value of
the node specified, and the _newValue_ argument is optional in both
cases, making the _get_ option redundant.  The _get_ command is
included to improve readability.

If the variable passed to _pair_ doesn't exist, it will be created
if the option is 'set' or _append_; the _exists_ option will
always return a _0_; the _get_ option will return an error.

If a paired list contains multiple pairs with identical names, the
pair occurring later in the list is targeted.  This is specified to
mimic the behaviour of _array set_:

	  set x "name value1 name value2"
	  pair get x name
	  -> value2

	  array set arrX $x
	  set arrX(name)
	  -> value2

# Reference Implementation

<http://sf.net/tracker/?func=detail&aid=491070&group\_id=10894&atid=310894>

There should be a public C API for working with nested paired lists.
The supplied reference code currently does not provide this.

# Notes

It would be nice to mimic Tcl 8.4's new _unset -nocomplain_
behaviour.

# Side Effects

Whether the result of the pair operation is successful, the underlying
Tcl\_Obj that represents the list argument may have its internal
representation invalidated or changed to that of a list.

# Copyright

This document has been placed in the public domain.

Name change from tip/78.tip to tip/78.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484

TIP:            78
Title:          TEA 2.0 Definitions
Version:        $Revision: 1.4 $
Author:         Andreas Kupries <[email protected]>
Author:         Larry W. Virden <[email protected]>
State:          Draft
Type:           Informative
Vote:           Pending
Created:        15-Dec-2001
Post-History:   

~ Abstract

This document is an informational TIP providing definitions for
commonly used terms (like package, extension, core, distribution,
etc.) to make future communication among people in the community
easier.  It is recommended that future and past documents specifying
details inside of the greater context of TEA refer to this document to
ensure a consistent usage of terms.

~ Background

This document is an adjunct to the [[TIP <<vision>>]].  ''(DKF - Is
this meant to be a reference to [34]?  Edit to such if so...)''

To facilitate the specification and adoption of clearly defined
interfaces to various activities and technologies it specifies a
number of terms used by the community at large to create a greater
unity and coherence in the usage of these terms.  In other words, by
creating generally accepted definitions for important terms the risk
of misunderstanding each other is reduced.

~ Specification of Technical Terms

This section specifies a number of important technical terms in
alphabetical order.  Terms used inside of a specification are
highlighted.

 * Application

 > An entity implementing some functionality required by a ''P/A
   User'' (see section ''Roles'') to perform his work.  Consists of
   one more files.  May make use of ''packages'' to implement its
   functionality.

 * Archive

 > Encapsulation of a ''distribution'' in a single file.  Well-known
   formats for archives are tar, gzipped tar, bzipped tar, and zip.

 * Binary generation

 > The process of wrapping up a built package into a binary
   ''distribution''.  See ''building'' too.

 * Building

 > The process of ''configuring'' and ''transforming'' a source
   ''distribution'' into a set of files which can be either
   immediately installed on the site which built them or wrapped into
   a binary ''distribution'' for shipment to other sites.  This also
   includes the execution of a test-suite coming with the package.

 * Bundle

 > A ''distribution'' encapsulating more than one ''package''.

 * Catalog

 > A catalog is a site providing an index of the packages.

 * Configuration

 > The process of customizing a source distribution for a particular
   architecture and/or site.  A part of ''Building''.

 * Conflict

 > Two or more packages are said to be in conflict if the usage of one of them
   excludes the usage of the others.

 > In a ''strong conflict'' installing one of the packages disallows
   even the installation of the others.

 * Core

 > A shorthand for ''Tcl core''.

 * Core distribution.

 > See ''Tcl core distribution''.

 * Dependency

 > A relationship between packages.  A package X is said dependent on
   another package Y if said package Y is required to allow X to work.

 > There are several types of dependencies:

 > * Build-time dependency

 > > Y is required to allow X to be build.

 > * Run-time dependency

 > > Y is required to allow the usage of an installed X.

 > * Optional dependency

 > > Y can be used by X (usually for improved performance) but is not
     required for building or use.

 * Distribution

 > An encapsulation of one or more ''packages'' for transport between
   places, machines, organizations, and people.  Several types of
   distributions are possible, explained below.

 > 1. binary

 > > A binary distribution is in a state and format allowing the
     installation of the contained packages on a particular platform.

 > > It contains at least all the files

 > > * implementing the functionality of the distributed packages and

 > > * required to allow the package management of the Tcl core to
       handle the packages.

 > > A binary distribution usually contains just the files for a
     single architecture.  This is not a explicit requirement however.
     In other words, a binary distribution is allowed to contain the
     files for several architectures.  Such a binary distribution will
     be called a ''fat binary distribution''.

 > > We distinguish between three sub-types:

 > > * installable

 > > > This is the minimal binary distribution we spoke of before.

 > > * auto-installable

 > > > Like ''installable''', but additionally contains an application
       whose execution will perform all the steps which are necessary
       to install the contained packages at a site.  The installation
       to add the packages to (and thus the location of the installed
       packages) can be freely chosen by the caller of the executable.

 > > * auto-image

 > > > Like ''auto-installable'', but the final location of the
       packages is hard-wired into the distribution.  This means that
       a site using auto-images is restricted to one installation per
       architecture for which files are contained in the binary
       distribution.

 > > > Some of the existing archive formats restrict their contained
       distributions to this type.  Examples of such archive formats
       are

 > > > * RPM (RedHat Package Manager)

 > > > * DEB (DEBian Linux package format)

 > > See [TIP 55] for the specification of a proposed layout for
     binary distributions.

 > 1. bundle

 > > A distribution which either contains the distributions of more
     than one package or a list of references to packages.  The
     references are done by specifying the name and version of the
     packages contained in the bundle.

 > 1. buildable

 > > See ''source''.

 > 1. compileable

 > > See ''source''.

 > 1. raw

 > > A raw distribution is the most fundamental distribution.  Its
     format is nearly completely unspecified.  Its contents are
     straight from a source repository.  The process of converting a
     raw distribution into a source distribution is called
     ''Preparation''.

 > > Because of the unformatted nature of a raw distribution the
     commands for its conversion into a source distribution have to be
     part of it.  This is the only part of a raw distribution which
     can and has to be specified.

 > > Example: The execution of ''autoconf'' to generate a
     ''configure'' script from its ''configure.in'' file can be a
     single step in a complex preparation.

 > 1. source

 > > A source distribution is in a format and state where tools can be
     used to build its contents.

 > > The format needs further specification but for now we can assume
     that it is governed by the current TEA specification.

 > > Alternate name for this type of distribution are ''compileable''
     and ''buildable''.

 * Distribution repository

 > See ''repository''

 * Extension

 > Alternate name for a ''package'', generally used for packages
   requiring compilation to become functional.

 > This term is ''deprecated''.

 * Installation

 > A special type of ''distribution repository, binary'' (see
   ''repository'') containing all ''packages'' which were installed on
   a site.  A site may host several installations differing in version
   and/or configuration of Tcl, and/or the platform Tcl was built for,
   etc.

 > Currently difficult to do but in the future it should made be
   possible for an installation to refer and use another installation,
   provided both are configured identically.  This allows a site to
   build a hierarchy of installations from the most general containing
   the common packages down to installations associated with one or
   more ''P/A Developers''.

 * Installing

 > The process of unpacking a binary distribution and adding the
   contained packages to an ''installation''.  The latter may include
   moving files to their proper places.

 > Also the process of adding a built package to an ''installation''.

 * Manifest

 > A file detailing the files making up a particular package.

 * Package

 > A collection of files providing additional functionality to a user
   of a Tcl interpreter when loaded into said interpreter.

 > Some files in a package implement the provided functionality
   whereas other files contain meta-information required by the
   package management of Tcl to be able to use the package.

 * Preparation

 > The process of converting a raw ''distribution'' into a source
   ''distribution'' suitable as input to ''building''.  This includes
   actions like:

 > * Retrieval of sources from a source repository

 > * Creating a ''configure'' file from its ''configure.in''.

 > * Creating the distributed documentation from internal sources.

 > * Removal of internal files containing notes, scratch info and the like.

 > * Inserting version information into files.

 > * ...

 > The tool ''makedist'' (which I wrote) is in my mind when thinking
   about this step.

 * Raw retrieval

 > The process of retrieving a raw ''distribution'' from a ''source
   repository''.

 * Repository

 > General term with two possible meanings.

 > 1. A collection of ''archives''.  The exact term for this type of
     repository is "distribution repository".

 > > If a distribution repository is restricted to one type of
     distributions this type can be added to the term as further
     specification of the type of repository.  Thus

 > > * ''distribution repository, binary'', or

 > > * ''distribution repository, source''.

 > 2. A collection of directories, developer files and control files
      containing version control information.  The exact term for this
      type of repository is ''source repository''.

 > > A repository can either be internal to an organization or public.

 * Source repository

 > See ''repository''

 * Tcl core

 > The most fundamental part in the Tcl world; the interpreter engine
   and the builtin commands coming with it.  These are all commands
   and procedures reported by ''info commands'' for a ''tclsh'' which
   was started with an empty ''.tclshrc'' file and after sourcing all
   ''.tcl'' files in the directory returned by ''info library''.

 * Tcl core distribution

 > The most fundamental distribution there is.  Contains the ''Tcl
   core'' and a number of packages.

~ Roles

The terms in the preceding section specified both passive data
structures and actions upon them.  This section specifies the possible
actors, i.e. entities which perform one or more of these actions.

To make the specification easier, related actions are grouped into
roles of behavior.  The mapping from roles to actual actors,
i.e. people and organizations is n:m.  On other words, one actor may
have several roles, either at once or changing over time and one role
can be held by several distinct actors.

Examples are given at the end of this section.

 1. Catalog manager

 > A catalog manager handles one or more catalogs.  He is responsible
   for

 > * the final name arbitration for packages with conflicting names

 > * and the categorization of the packages indexed by the catalog.

 1. P/A Builder

 > A package and/or application builder is a person and/or organization

 > * who retrieves raw distributions from source repositories and (in
     a sequence of several steps) generates binary distributions from
     them.

 > * or who retrieves source distributions from distribution
     repositories and (in a sequence of several steps) generates
     binary distributions from them.

 > * uploads the generated binary distributions into one or more
     distribution repositories.

 > * uploads the generated source distributions into one or more
     distribution repositories.

 > The intermediate steps performed by a builder are ''Preparation'',
   ''Building'', and ''Binary generation''.

 > If ''System administrator'' and ''P/A Builder'' coordinate with each other
   it is also possible to install a package directly from the built
   package.

 > ''NOTE:'' Think about splitting P/A Builder into two roles; one for
   the preparation of source ''distributions'' and a second role for
   the generation of binary ''distributions''.

|         TODO Find some nice names for the split roles.

 1. P/A Developer

 > A developer is a ''P/A User'' whose tasks include the creation of
   new packages and/or applications.  A workspace contains the raw
   sources of these new packages and applications.  For posterity and
   version control it is kept synchronized with one or more ''source
   repositories''.

 > During development, at least one installation has to be accessible,
   containing the initial packages, the new packages and the
   applications built upon the packages.

 1. P/A User

 > A person or organization which uses Tcl based tools but does not
   develop new code.  Its workspace contains the files required for
   the tasks at hand.

 > The border between the roles of P/A Developer and P/A User blurs if
   the tool being used allows one to customize/program/extend it in
   Tcl.

 1. Repository manager

 > This role handles the management of all types of ''repositories''.
   This role is not part of the development process ''per se'' and has
   no direct actions for with regards to distributions, packages,
   sources, etc.

 > Repository managers are responsible for keeping the system up and
   running, doing adequate backups, supporting mirrors, providing
   browsing and download capabilities, providing confidence that
   updates to items in the repository are being done by the approved
   person or persons, etc.

 > The role is related to ''System administrator'', but not the same.
   It was split out of that role because a ''System-Administrator'' is
   usually internal to an organization whereas a repository and its
   management can be provided by an entity external to the
   organization.

 1. System administrator

 > A system administrator manages

 > * one or more ''installations'' of the Tcl core and additional
     packages.  Each installation may be configured differently (Tcl
     version, installed packages, platform, ...).

 > * Installed packages are taken either from a ''distribution
     repository'' or directly from a built package.  The latter has to
     be done in coordination with a ''P/A Builder''.

 > Her responsibilities include

 > * the creation of empty installations, 

 > * the destruction of installations,

 > * the addition and removal of packages to/from an existing
     installation.

The three P/A roles are central to the development process and bound
together in a tight loop of information flowing between them.  The
other three roles handle the support structure without which the other
roles would be unable to communicate and collaborate.

#image:78pa_cycle

Examples:

 * Larry Virden is ''System-Administrator'', ''P/A Builder'' and ''P/A
   User'' for his organization.

 * I (the author of this TIP) am all roles, on my system at home.

 * ActiveState is ''Repository Manager'' for the Perl community and
   plans to become one for the Tcl community.

 * SourceForge is a combination of ''Repository Manager'' and
   ''Catalog Manager''.

 * Most people with a windows machine at home are
   ''System-Administration'' and ''P/A User'' for this machine.

~ Visual Representation

The following drawings are a visual adjunct to the terms in the last
sections to aid in the understanding of the terms and their relations.

Legend:

 * Blue rounded boxes - Areas of responsibility for roles.  The
   responsible role is written in gold text inside of the box.

 * White rounded boxes with a black border - Data, like packages,
   distributions, etc.

 * White boxes with a red border - Actions on data.

#image:78tea_terms_relations_1 Roles, Data and Actions

#image:78tea_terms_relations_2 Relationships Between Data Entities

~ Copyright  

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|
|

|

|
|

|

|
|
|

|

|
|

|
|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

|

|
|

|

|

|

|
|

|

|

|
|

|

|

|

|

|
|

|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|
|
|

|
|

|

|

|

|

|

|
|

|
|

|

|

|
|

|

|
|
|

|

|

|
|

|

|

|

|
|

|

|
|

|

|
|
|

|
|
|

|

|

|

|

|

|
|

|

|

|
|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484

# TIP 78: TEA 2.0 Definitions

	Author:         Andreas Kupries <[email protected]>
	Author:         Larry W. Virden <[email protected]>
	State:          Draft
	Type:           Informative
	Vote:           Pending
	Created:        15-Dec-2001
	Post-History:   
-----

# Abstract

This document is an informational TIP providing definitions for
commonly used terms \(like package, extension, core, distribution,
etc.\) to make future communication among people in the community
easier.  It is recommended that future and past documents specifying
details inside of the greater context of TEA refer to this document to
ensure a consistent usage of terms.

# Background

This document is an adjunct to the [TIP <<vision>>].  _\(DKF - Is
this meant to be a reference to [[34]](34.md)?  Edit to such if so...\)_

To facilitate the specification and adoption of clearly defined
interfaces to various activities and technologies it specifies a
number of terms used by the community at large to create a greater
unity and coherence in the usage of these terms.  In other words, by
creating generally accepted definitions for important terms the risk
of misunderstanding each other is reduced.

# Specification of Technical Terms

This section specifies a number of important technical terms in
alphabetical order.  Terms used inside of a specification are
highlighted.

 * Application

	 > An entity implementing some functionality required by a _P/A
   User_ \(see section _Roles_\) to perform his work.  Consists of
   one more files.  May make use of _packages_ to implement its
   functionality.

 * Archive

	 > Encapsulation of a _distribution_ in a single file.  Well-known
   formats for archives are tar, gzipped tar, bzipped tar, and zip.

 * Binary generation

	 > The process of wrapping up a built package into a binary
   _distribution_.  See _building_ too.

 * Building

	 > The process of _configuring_ and _transforming_ a source
   _distribution_ into a set of files which can be either
   immediately installed on the site which built them or wrapped into
   a binary _distribution_ for shipment to other sites.  This also
   includes the execution of a test-suite coming with the package.

 * Bundle

	 > A _distribution_ encapsulating more than one _package_.

 * Catalog

	 > A catalog is a site providing an index of the packages.

 * Configuration

	 > The process of customizing a source distribution for a particular
   architecture and/or site.  A part of _Building_.

 * Conflict

	 > Two or more packages are said to be in conflict if the usage of one of them
   excludes the usage of the others.

	 > In a _strong conflict_ installing one of the packages disallows
   even the installation of the others.

 * Core

	 > A shorthand for _Tcl core_.

 * Core distribution.

	 > See _Tcl core distribution_.

 * Dependency

	 > A relationship between packages.  A package X is said dependent on
   another package Y if said package Y is required to allow X to work.

	 > There are several types of dependencies:

	 > \* Build-time dependency

	 > > Y is required to allow X to be build.

	 > \* Run-time dependency

	 > > Y is required to allow the usage of an installed X.

	 > \* Optional dependency

	 > > Y can be used by X \(usually for improved performance\) but is not
     required for building or use.

 * Distribution

	 > An encapsulation of one or more _packages_ for transport between
   places, machines, organizations, and people.  Several types of
   distributions are possible, explained below.

	 > 1. binary

	 > > A binary distribution is in a state and format allowing the
     installation of the contained packages on a particular platform.

	 > > It contains at least all the files

	 > > \* implementing the functionality of the distributed packages and

	 > > \* required to allow the package management of the Tcl core to
       handle the packages.

	 > > A binary distribution usually contains just the files for a
     single architecture.  This is not a explicit requirement however.
     In other words, a binary distribution is allowed to contain the
     files for several architectures.  Such a binary distribution will
     be called a _fat binary distribution_.

	 > > We distinguish between three sub-types:

	 > > \* installable

	 > > > This is the minimal binary distribution we spoke of before.

	 > > \* auto-installable

	 > > > Like _installable**, but additionally contains an application
       whose execution will perform all the steps which are necessary
       to install the contained packages at a site.  The installation
       to add the packages to \(and thus the location of the installed
       packages\) can be freely chosen by the caller of the executable.

	 > > \* auto-image

	 > > > Like _auto-installable_, but the final location of the
       packages is hard-wired into the distribution.  This means that
       a site using auto-images is restricted to one installation per
       architecture for which files are contained in the binary
       distribution.

	 > > > Some of the existing archive formats restrict their contained
       distributions to this type.  Examples of such archive formats
       are

	 > > > \* RPM \(RedHat Package Manager\)

	 > > > \* DEB \(DEBian Linux package format\)

	 > > See [TIP 55] for the specification of a proposed layout for
     binary distributions.

	 > 1. bundle

	 > > A distribution which either contains the distributions of more
     than one package or a list of references to packages.  The
     references are done by specifying the name and version of the
     packages contained in the bundle.

	 > 1. buildable

	 > > See _source_.

	 > 1. compileable

	 > > See _source_.

	 > 1. raw

	 > > A raw distribution is the most fundamental distribution.  Its
     format is nearly completely unspecified.  Its contents are
     straight from a source repository.  The process of converting a
     raw distribution into a source distribution is called
     _Preparation_.

	 > > Because of the unformatted nature of a raw distribution the
     commands for its conversion into a source distribution have to be
     part of it.  This is the only part of a raw distribution which
     can and has to be specified.

	 > > Example: The execution of _autoconf_ to generate a
     _configure_ script from its _configure.in_ file can be a
     single step in a complex preparation.

	 > 1. source

	 > > A source distribution is in a format and state where tools can be
     used to build its contents.

	 > > The format needs further specification but for now we can assume
     that it is governed by the current TEA specification.

	 > > Alternate name for this type of distribution are _compileable_
     and _buildable_.

 * Distribution repository

	 > See _repository_

 * Extension

	 > Alternate name for a _package_, generally used for packages
   requiring compilation to become functional.

	 > This term is _deprecated_.

 * Installation

	 > A special type of _distribution repository, binary_ \(see
   _repository_\) containing all _packages_ which were installed on
   a site.  A site may host several installations differing in version
   and/or configuration of Tcl, and/or the platform Tcl was built for,
   etc.

	 > Currently difficult to do but in the future it should made be
   possible for an installation to refer and use another installation,
   provided both are configured identically.  This allows a site to
   build a hierarchy of installations from the most general containing
   the common packages down to installations associated with one or
   more _P/A Developers_.

 * Installing

	 > The process of unpacking a binary distribution and adding the
   contained packages to an _installation_.  The latter may include
   moving files to their proper places.

	 > Also the process of adding a built package to an _installation_.

 * Manifest

	 > A file detailing the files making up a particular package.

 * Package

	 > A collection of files providing additional functionality to a user
   of a Tcl interpreter when loaded into said interpreter.

	 > Some files in a package implement the provided functionality
   whereas other files contain meta-information required by the
   package management of Tcl to be able to use the package.

 * Preparation

	 > The process of converting a raw _distribution_ into a source
   _distribution_ suitable as input to _building_.  This includes
   actions like:

	 > \* Retrieval of sources from a source repository

	 > \* Creating a _configure_ file from its _configure.in_.

	 > \* Creating the distributed documentation from internal sources.

	 > \* Removal of internal files containing notes, scratch info and the like.

	 > \* Inserting version information into files.

	 > \* ...

	 > The tool _makedist_ \(which I wrote\) is in my mind when thinking
   about this step.

 * Raw retrieval

	 > The process of retrieving a raw _distribution_ from a _source
   repository_.

 * Repository

	 > General term with two possible meanings.

	 > 1. A collection of _archives_.  The exact term for this type of
     repository is "distribution repository".

	 > > If a distribution repository is restricted to one type of
     distributions this type can be added to the term as further
     specification of the type of repository.  Thus

	 > > \* _distribution repository, binary_, or

	 > > \* _distribution repository, source_.

	 > 2. A collection of directories, developer files and control files
      containing version control information.  The exact term for this
      type of repository is _source repository_.

	 > > A repository can either be internal to an organization or public.

 * Source repository

	 > See _repository_

 * Tcl core

	 > The most fundamental part in the Tcl world; the interpreter engine
   and the builtin commands coming with it.  These are all commands
   and procedures reported by _info commands_ for a _tclsh_ which
   was started with an empty _.tclshrc_ file and after sourcing all
   _.tcl_ files in the directory returned by _info library_.

 * Tcl core distribution

	 > The most fundamental distribution there is.  Contains the _Tcl
   core_ and a number of packages.

# Roles

The terms in the preceding section specified both passive data
structures and actions upon them.  This section specifies the possible
actors, i.e. entities which perform one or more of these actions.

To make the specification easier, related actions are grouped into
roles of behavior.  The mapping from roles to actual actors,
i.e. people and organizations is n:m.  On other words, one actor may
have several roles, either at once or changing over time and one role
can be held by several distinct actors.

Examples are given at the end of this section.

 1. Catalog manager

	 > A catalog manager handles one or more catalogs.  He is responsible
   for

	 > \* the final name arbitration for packages with conflicting names

	 > \* and the categorization of the packages indexed by the catalog.

 1. P/A Builder

	 > A package and/or application builder is a person and/or organization

	 > \* who retrieves raw distributions from source repositories and \(in
     a sequence of several steps\) generates binary distributions from
     them.

	 > \* or who retrieves source distributions from distribution
     repositories and \(in a sequence of several steps\) generates
     binary distributions from them.

	 > \* uploads the generated binary distributions into one or more
     distribution repositories.

	 > \* uploads the generated source distributions into one or more
     distribution repositories.

	 > The intermediate steps performed by a builder are _Preparation_,
   _Building_, and _Binary generation_.

	 > If _System administrator_ and _P/A Builder_ coordinate with each other
   it is also possible to install a package directly from the built
   package.

	 > _NOTE:_ Think about splitting P/A Builder into two roles; one for
   the preparation of source _distributions_ and a second role for
   the generation of binary _distributions_.

		         TODO Find some nice names for the split roles.

 1. P/A Developer

	 > A developer is a _P/A User_ whose tasks include the creation of
   new packages and/or applications.  A workspace contains the raw
   sources of these new packages and applications.  For posterity and
   version control it is kept synchronized with one or more _source
   repositories_.

	 > During development, at least one installation has to be accessible,
   containing the initial packages, the new packages and the
   applications built upon the packages.

 1. P/A User

	 > A person or organization which uses Tcl based tools but does not
   develop new code.  Its workspace contains the files required for
   the tasks at hand.

	 > The border between the roles of P/A Developer and P/A User blurs if
   the tool being used allows one to customize/program/extend it in
   Tcl.

 1. Repository manager

	 > This role handles the management of all types of _repositories_.
   This role is not part of the development process _per se_ and has
   no direct actions for with regards to distributions, packages,
   sources, etc.

	 > Repository managers are responsible for keeping the system up and
   running, doing adequate backups, supporting mirrors, providing
   browsing and download capabilities, providing confidence that
   updates to items in the repository are being done by the approved
   person or persons, etc.

	 > The role is related to _System administrator_, but not the same.
   It was split out of that role because a _System-Administrator_ is
   usually internal to an organization whereas a repository and its
   management can be provided by an entity external to the
   organization.

 1. System administrator

	 > A system administrator manages

	 > \* one or more _installations_ of the Tcl core and additional
     packages.  Each installation may be configured differently \(Tcl
     version, installed packages, platform, ...\).

	 > \* Installed packages are taken either from a _distribution
     repository_ or directly from a built package.  The latter has to
     be done in coordination with a _P/A Builder_.

	 > Her responsibilities include

	 > \* the creation of empty installations, 

	 > \* the destruction of installations,

	 > \* the addition and removal of packages to/from an existing
     installation.

The three P/A roles are central to the development process and bound
together in a tight loop of information flowing between them.  The
other three roles handle the support structure without which the other
roles would be unable to communicate and collaborate.

![](../assets/78pa_cycle.gif)

Examples:

 * Larry Virden is _System-Administrator_, _P/A Builder_ and _P/A
   User_ for his organization.

 * I \(the author of this TIP\) am all roles, on my system at home.

 * ActiveState is _Repository Manager_ for the Perl community and
   plans to become one for the Tcl community.

 * SourceForge is a combination of _Repository Manager_ and
   _Catalog Manager_.

 * Most people with a windows machine at home are
   _System-Administration_ and _P/A User_ for this machine.

# Visual Representation

The following drawings are a visual adjunct to the terms in the last
sections to aid in the understanding of the terms and their relations.

Legend:

 * Blue rounded boxes - Areas of responsibility for roles.  The
   responsible role is written in gold text inside of the box.

 * White rounded boxes with a black border - Data, like packages,
   distributions, etc.

 * White boxes with a red border - Actions on data.

![Roles, Data and Actions](../assets/78tea_terms_relations_1.gif)

![Relationships Between Data Entities](../assets/78tea_terms_relations_2.gif)

# Copyright  

This document has been placed in the public domain.

Name change from tip/79.tip to tip/79.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109

TIP:            79
Title:          Add Deletion Callback to Tcl_CreateObjTrace
Version:        $Revision: 1.7 $
Author:         Kevin Kenny <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        03-Jan-2002
Post-History:   
Discussions-To: news:comp.lang.tcl
Keywords:       trace,Tcl_Obj
Tcl-Version:    8.4

~ Abstract

This document is a correction to the ''Tcl_CreateObjTrace'' API from
[32].  It addresses a deficiency that the API provides no deletion
callback for its client data.

~ Rationale

In developing a reference implementation for the changes described in
[32], the author of this TIP discovered an anomaly in the proposed API
for ''Tcl_CreateObjTrace.''  While the function accepts a
''ClientData'' parameter, it provides no deletion callback for the
client data, making it difficult to clean up the client data if
''Tcl_DeleteTrace'' is called from a point in the code where the
client data is not readily available.  (The usual pattern in the Tcl
library is to provide a deletion callback wherever client data is
passed to the Tcl interpreter; ''Tcl_CreateObjCommand'' is an example.

~ Specification

The ''Tcl_CreateObjTrace'' function proposed in [32] shall be changed
to the following:

| Tcl_Trace Tcl_CreateObjTrace ( Tcl_Interp*                interp,
|                                int                        level, 
|                                int                        flags,
|                                Tcl_CmdObjTraceProc*       objProc,
|                                ClientData                 clientData,
|                                Tcl_CmdObjTraceDeleteProc* deleteProc );

The ''Tcl_CreateObjTrace'' function adds a trace to the Tcl evaluator.
The ''interp'' argument is the Tcl interpreter for which tracing is
being requested.  The ''level'' argument is the maximum depth of
recursive calls; when the execution depth of the interpreter exceeds
this number, the trace callback does not execute.  The ''objProc''
argument is the callback procedure to execute each time a Tcl command
is evaluated; it is expected to have arguments and result type that
match ''Tcl_CmdObjTraceProc'' below.  The ''clientData'' argument is
client data to pass to the ''objProc'' callback.  The ''deleteProc''
argument specifies a function to call when the trace is removed by a
call to ''Tcl_DeleteTrace.''  This parameter may be a null pointer if
no deletion callback is desired.  Finally, the ''flags'' argument
gives flags that control the tracing.  Initially, the only flag
supported will be ''TCL_ALLOW_INLINE_COMPILE''.  If this flag is set,
the bytecode compiler is permitted to compile in-line code for the Tcl
built-in commands; any command that has been compiled in-line will not
be traced.

The trace token returned from ''Tcl_CreateObjTrace'' may be passed as
a parameter to ''Tcl_DeleteTrace'', which arranges to cancel the
tracing.  If a non-empty ''deleteProc'' argument was supplied to
''Tcl_CreateObjTrace'', it is called at this time.  After
''Tcl_DeleteTrace'' returns, no further calls to the trace procedure
will be made, and the trace token must not be used further in the
calling program.

The ''Tcl_CmdObjTraceProc'' will have the following type signature.

|    typedef int Tcl_CmdObjTraceProc( ClientData     clientData,
|                                     Tcl_Interp*    interp,
|                                     int            level,
|                                     CONST char*    command,
|                                     Tcl_Command    commandInfo,
|                                     int            objc,
|                                     Tcl_Obj *CONST objv[] );

The ''clientData'' parameter is the client data that was passed to
''Tcl_CreateObjTrace''.  The ''interp'' parameter designates a Tcl
interpreter.  The ''level'' parameter specifies the execution level.
The ''command'' parameter gives the raw UTF-8 text of the command
being evaluated, before any substitutions have been performed.  The
''commandInfo'' parameter is an opaque ''Tcl_Command'' object that
gives information about the command.  The ''objc'' and ''objv''
parameters are the command name and parameter vector after
substitution.

The trace procedure is expected to return a standard Tcl status
return.  If it returns ''TCL_OK'', the command is evaluated normally.
If it returns ''TCL_ERROR'', evaluation of the command does not take
place.  The interpreter result is expected to contain an error
message.  If it returns any other status, such as ''TCL_BREAK'',
''TCL_CONTINUE'' or ''TCL_RETURN'', it is treated as if the command
had done so.

The ''Tcl_CmdObjTraceDeleteProc'' will have the following type
signature.

|    typedef void Tcl_CmdObjTraceDeleteProc( ClientData clientData );

The ''clientData'' parameter is the client data that was originally
passed into ''Tcl_CreateObjTrace''.

~ Copyright

Copyright � 2002 by Kevin B. Kenny.  Distribution in whole or part,
with or without annotations, is unlimited.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|
|

|

|
|
|

|
|

|

|

|

|
|
|
|
|
|

|
|
|

|

|
|

|
|

|

|
|
|
|
|

|

|
|
|
|
|
|
|

|
|
|
|

|
|

|
|

|
|

|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109

# TIP 79: Add Deletion Callback to Tcl_CreateObjTrace

	Author:         Kevin Kenny <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        03-Jan-2002
	Post-History:   
	Discussions-To: news:comp.lang.tcl
	Keywords:       trace,Tcl_Obj
	Tcl-Version:    8.4
-----

# Abstract

This document is a correction to the _Tcl\_CreateObjTrace_ API from
[[32]](32.md).  It addresses a deficiency that the API provides no deletion
callback for its client data.

# Rationale

In developing a reference implementation for the changes described in
[[32]](32.md), the author of this TIP discovered an anomaly in the proposed API
for _Tcl\_CreateObjTrace._  While the function accepts a
_ClientData_ parameter, it provides no deletion callback for the
client data, making it difficult to clean up the client data if
_Tcl\_DeleteTrace_ is called from a point in the code where the
client data is not readily available.  \(The usual pattern in the Tcl
library is to provide a deletion callback wherever client data is
passed to the Tcl interpreter; _Tcl\_CreateObjCommand_ is an example.

# Specification

The _Tcl\_CreateObjTrace_ function proposed in [[32]](32.md) shall be changed
to the following:

	 Tcl_Trace Tcl_CreateObjTrace ( Tcl_Interp*                interp,
	                                int                        level, 
	                                int                        flags,
	                                Tcl_CmdObjTraceProc*       objProc,
	                                ClientData                 clientData,
	                                Tcl_CmdObjTraceDeleteProc* deleteProc );

The _Tcl\_CreateObjTrace_ function adds a trace to the Tcl evaluator.
The _interp_ argument is the Tcl interpreter for which tracing is
being requested.  The _level_ argument is the maximum depth of
recursive calls; when the execution depth of the interpreter exceeds
this number, the trace callback does not execute.  The _objProc_
argument is the callback procedure to execute each time a Tcl command
is evaluated; it is expected to have arguments and result type that
match _Tcl\_CmdObjTraceProc_ below.  The _clientData_ argument is
client data to pass to the _objProc_ callback.  The _deleteProc_
argument specifies a function to call when the trace is removed by a
call to _Tcl\_DeleteTrace._  This parameter may be a null pointer if
no deletion callback is desired.  Finally, the _flags_ argument
gives flags that control the tracing.  Initially, the only flag
supported will be _TCL\_ALLOW\_INLINE\_COMPILE_.  If this flag is set,
the bytecode compiler is permitted to compile in-line code for the Tcl
built-in commands; any command that has been compiled in-line will not
be traced.

The trace token returned from _Tcl\_CreateObjTrace_ may be passed as
a parameter to _Tcl\_DeleteTrace_, which arranges to cancel the
tracing.  If a non-empty _deleteProc_ argument was supplied to
_Tcl\_CreateObjTrace_, it is called at this time.  After
_Tcl\_DeleteTrace_ returns, no further calls to the trace procedure
will be made, and the trace token must not be used further in the
calling program.

The _Tcl\_CmdObjTraceProc_ will have the following type signature.

	    typedef int Tcl_CmdObjTraceProc( ClientData     clientData,
	                                     Tcl_Interp*    interp,
	                                     int            level,
	                                     CONST char*    command,
	                                     Tcl_Command    commandInfo,
	                                     int            objc,
	                                     Tcl_Obj *CONST objv[] );

The _clientData_ parameter is the client data that was passed to
_Tcl\_CreateObjTrace_.  The _interp_ parameter designates a Tcl
interpreter.  The _level_ parameter specifies the execution level.
The _command_ parameter gives the raw UTF-8 text of the command
being evaluated, before any substitutions have been performed.  The
_commandInfo_ parameter is an opaque _Tcl\_Command_ object that
gives information about the command.  The _objc_ and _objv_
parameters are the command name and parameter vector after
substitution.

The trace procedure is expected to return a standard Tcl status
return.  If it returns _TCL\_OK_, the command is evaluated normally.
If it returns _TCL\_ERROR_, evaluation of the command does not take
place.  The interpreter result is expected to contain an error
message.  If it returns any other status, such as _TCL\_BREAK_,
_TCL\_CONTINUE_ or _TCL\_RETURN_, it is treated as if the command
had done so.

The _Tcl\_CmdObjTraceDeleteProc_ will have the following type
signature.

	    typedef void Tcl_CmdObjTraceDeleteProc( ClientData clientData );

The _clientData_ parameter is the client data that was originally
passed into _Tcl\_CreateObjTrace_.

# Copyright

Copyright © 2002 by Kevin B. Kenny.  Distribution in whole or part,
with or without annotations, is unlimited.

Name change from tip/8.tip to tip/8.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

TIP:           8
Title:         Add Winico support to the wm command on windows
Version:       $Revision: 1.8 $
Author:        Vince Darley <[email protected]>
State:         Final
Type:          Project
Tcl-Version:   8.4.0
Vote:          Done
Created:       06-Nov-2000
Post-History:

~ Abstract

Add to ''wm'' the ability to do the windows-titlebar-icon manipulation
that the Winico extension currently provides, without the bugs noted
in that extension.

~ Proposal

Modify ''wm'' on Windows only to allow an optional ''-default'' argument.

|wm iconbitmap .winpath ?-default? filename

And to allow a file which is of valid windows-icon format to be
interpreted as such.  Any file which is not correctly interpreted
as an icon will be handled as before, by the ''bitmap'' code (which
will generally either do nothing, or throw an error, thus maintaining
backwards compatibility).

The ''-default'' argument, if given, will change not the icon of the
.winpath given, but rather the default icon for all windows in the
current application for which no specific icon as been set.

An implementation already exists, which fixes the basic "wrapper
window" problems and which has the above syntax.  The issues
surrounding reference counting of icons in use has also been addressed
in this patch so that icons no longer in use are released (the Winico
patch required manual deletion of icons).  This reference
implementation is available from
ftp://ftp.ucsd.edu/pub/alpha/tcl/tkWinWm.diff (documentation has been
separately patched, and can also be made available).

~ Rationale

There have been many requests on news:comp.lang.tcl for this ability
in the Tk core, and several bug reports filed against Winico, and this
ability has been placed on the Tk 8.4 roadmap.
http://purl.org/tcl/home/software/tcltk/roadmap.tml

The choice of ''wm iconbitmap'' is suggested, because ''wm
iconbitmap'' currently doesn't appear to do anything on Windows, yet
is the obvious choice for the user trying to set the window's icon
(e.g. many posts on news:comp.lang.tcl are actually asking why ''wm
iconbitmap'' doesn't do anything).

In the future we may wish to extend ''wm iconbitmap'' on all platforms
so that other image types can be accepted (e.g. .gif, .png).  This
proposal extends naturally to allow such future work.  The primary
changes required will be icon<->image conversion routines.

~ Alternatives

Fix the core so that Winico can work properly as an extension.

My implementation as shown that this would require a couple of
patches, and also the exporting of an additional obscure function into
Tk's stub table (a function which would ensure that Tk's window
manager is completely initialised).  It would also not help the users
posting to news:comp.lang.tcl asking "why doesn't wm iconbitmap do
anything?"

~ Objections

''This is platform specific and should go in an extension''

See ''Alternatives'' above, also see the ''future suggestion'' above
in which this kind of code can be usefully extended in a
cross-platform way.

''The -default flag is weird, and it means we ignore the window name''

I agree, but please suggest a better alternative rather than just
moaning.  The command with the -default flag is in my opinion more
useful than the command without (for example it makes sure that Tk's
built-in dialogs have the icon of your application).  An alternative
might be to use ''wm iconbitmap -default filename'', but that involves
more significant modifications of the semantics of ''wm''.  It might,
however, be a good idea.

''wm iconbitmap will still do nothing when given a bitmap''

Yes, but there's that backwards compatibility issue.  This should be
properly documented with pointers to the use of valid icon file
formats.  When or if proper support is added to Tk for .gif, .png or
even Tk images as icons, this bug can be fixed.  The purpose of this
TIP is not to fix that bug, but to provide a better solution.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|
|

|
|

|

|

|
|

|
|

|
|

|

|
|

|

|

|

|

|
|
|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

# TIP 8: Add Winico support to the wm command on windows

	Author:        Vince Darley <[email protected]>
	State:         Final
	Type:          Project
	Tcl-Version:   8.4.0
	Vote:          Done
	Created:       06-Nov-2000
	Post-History:
-----

# Abstract

Add to _wm_ the ability to do the windows-titlebar-icon manipulation
that the Winico extension currently provides, without the bugs noted
in that extension.

# Proposal

Modify _wm_ on Windows only to allow an optional _-default_ argument.

	wm iconbitmap .winpath ?-default? filename

And to allow a file which is of valid windows-icon format to be
interpreted as such.  Any file which is not correctly interpreted
as an icon will be handled as before, by the _bitmap_ code \(which
will generally either do nothing, or throw an error, thus maintaining
backwards compatibility\).

The _-default_ argument, if given, will change not the icon of the
.winpath given, but rather the default icon for all windows in the
current application for which no specific icon as been set.

An implementation already exists, which fixes the basic "wrapper
window" problems and which has the above syntax.  The issues
surrounding reference counting of icons in use has also been addressed
in this patch so that icons no longer in use are released \(the Winico
patch required manual deletion of icons\).  This reference
implementation is available from
ftp://ftp.ucsd.edu/pub/alpha/tcl/tkWinWm.diff \(documentation has been
separately patched, and can also be made available\).

# Rationale

There have been many requests on news:comp.lang.tcl for this ability
in the Tk core, and several bug reports filed against Winico, and this
ability has been placed on the Tk 8.4 roadmap.
<http://purl.org/tcl/home/software/tcltk/roadmap.tml>

The choice of _wm iconbitmap_ is suggested, because _wm
iconbitmap_ currently doesn't appear to do anything on Windows, yet
is the obvious choice for the user trying to set the window's icon
\(e.g. many posts on news:comp.lang.tcl are actually asking why _wm
iconbitmap_ doesn't do anything\).

In the future we may wish to extend _wm iconbitmap_ on all platforms
so that other image types can be accepted \(e.g. .gif, .png\).  This
proposal extends naturally to allow such future work.  The primary
changes required will be icon<->image conversion routines.

# Alternatives

Fix the core so that Winico can work properly as an extension.

My implementation as shown that this would require a couple of
patches, and also the exporting of an additional obscure function into
Tk's stub table \(a function which would ensure that Tk's window
manager is completely initialised\).  It would also not help the users
posting to news:comp.lang.tcl asking "why doesn't wm iconbitmap do
anything?"

# Objections

_This is platform specific and should go in an extension_

See _Alternatives_ above, also see the _future suggestion_ above
in which this kind of code can be usefully extended in a
cross-platform way.

_The -default flag is weird, and it means we ignore the window name_

I agree, but please suggest a better alternative rather than just
moaning.  The command with the -default flag is in my opinion more
useful than the command without \(for example it makes sure that Tk's
built-in dialogs have the icon of your application\).  An alternative
might be to use _wm iconbitmap -default filename_, but that involves
more significant modifications of the semantics of _wm_.  It might,
however, be a good idea.

_wm iconbitmap will still do nothing when given a bitmap_

Yes, but there's that backwards compatibility issue.  This should be
properly documented with pointers to the use of valid icon file
formats.  When or if proper support is added to Tk for .gif, .png or
even Tk images as icons, this bug can be fixed.  The purpose of this
TIP is not to fix that bug, but to provide a better solution.

# Copyright

This document has been placed in the public domain.

Name change from tip/80.tip to tip/80.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60

61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80

81
82

83
84
85

86
87
88

89
90
91

92
93
94
95
96
97
98
99
100
101

102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136

137
138
139
140
141

142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161

162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191

192
193
194
195
196
197
198
199
200

201
202
203
204
205
206
207
208
209

210
211
212
213
214
215
216
217
218
219
220
221

222
223
224

225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241

242
243
244
245
246
247

248
249

250
251
252

253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276

277
278
279
280
281

282
283
284
285
286

287
288
289
290
291
292
293

294
295

296
297
298
299
300
301

302
303
304
305
306
307
308

309
310
311

312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332

333
334
335
336
337
338
339
340

341
342
343
344
345
346
347
348
349
350
351
352
353

354
355

356
357
358
359
360
361

362
363
364
365
366
367

368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391

392
393
394
395
396

397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412

413
414
415
416
417
418
419
420
421
422

423
424
425
426
427
428
429

430
431
432
433

434
435

436

437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455

456
457
458
459
460
461
462

463
464

465
466
467

468
469
470
471
472
473
474

475
476

477
478
479

480
481
482
483
484
485
486

487
488

489
490

491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526

TIP:            80
Title:          Additional Options for 'lsearch'
Version:        $Revision: 1.10 $
Author:         Tom Wilkason <[email protected]>
Author:         Tom Wilkason <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        02-Jan-2002
Post-History:   
Discussions-To: news:comp.lang.tcl
Tcl-Version:    8.4

~ Abstract

This TIP proposes additional options for the ''lsearch'' command to
return and work with all matching items in the return rather than the
first matching item. Additional options are also added.

~ Rationale

The ''lsearch'' function works well for finding the first item in a
list that matches a pattern.  However it is often useful to find all
of the items in the list that match a pattern.  This TIP proposes
adding options to return the entire list of matches.  With this
capability, additional options are proposed to return the data rather
than the indices (since you often want to work the the data anyway),
and to add an option to return the logical exclusion of the matching
items (i.e. those that don't match the search pattern).

~ Specification

I propose the following options be added to ''lsearch'':

Option: ''-start index''

 > Initiates the list search starting at ''index'', which can be
   any valid list index (such as 0 , end , end-1 ...) 

Option: ''-all''

 > Returns a list of all indices that match the search condition
   (rather than the first one).  The indices are returned low to high
   order.  For a no match condition, a {} (empty list) is returned.
   If the the ''-all'' or ''-inline'' switches are not specified, a
   -1 is returned for a no match condition just as is done now.

Option: ''-inline''

 > Returns a single item (or a list of items for ''-all'') of the data
   that matches the search condition rather than the index (or
   indices).  An empty result or empty list (''-all'') is returned for
   a no match condition.  The data is returned in original list order.
   This option is useful when you want to iterate over the returned
   data anyway.  e.g.

|    foreach item [lsearch -all -inline -glob $someList *stuff] {
|       # deal with item
|    }

Option: ''-not''

 > Negates the sense of the search condition (i.e. what doesn't
   match).  When used with the ''-inline'' or ''-all'' options, the
   return set will be the items that do match.  If all items match
   then a {} is returned.  Without the ''-all'' option, the first item
   in the list that does not match will be returned.

These can be combined as needed and yield some powerful capabilities
when iterating over sub-lists (esp. with the new ''lset'' command).

~ Reference Implementation

Changes to the ''Tcl_LsearchObjCmd'' command in ''generic/tclCmdIL.c''
are needed along with documentation and test code.  The changes to the
8.4 head version of ''tclCmdIL.c'' are available below.

|/*
| *----------------------------------------------------------------------
| *

| * Tcl_LsearchObjCmd --
| *

| *      This procedure is invoked to process the "lsearch" Tcl command.
| *      See the user documentation for details on what it does.
| *

| * Results:
| *      A standard Tcl result.
| *

| * Side effects:
| *      See the user documentation.
| *

| *----------------------------------------------------------------------
| */
|
|int
|Tcl_LsearchObjCmd(clientData, interp, objc, objv)
|    ClientData clientData;      /* Not used. */
|    Tcl_Interp *interp;         /* Current interpreter. */
|    int objc;                   /* Number of arguments. */
|    Tcl_Obj *CONST objv[];      /* Argument values. */
|{

|    char *bytes, *patternBytes;
|    int i, match, mode, index, result, listc, length, elemLen;
|    int useStart=-1, offset, allData=0, returnInline=0;
|    int dataType, isIncreasing, lower, upper, patInt, objInt, notMatch=0;
|    double patDouble, objDouble;
|    Tcl_Obj *patObj, **listv, *listPtr, *startPtr = NULL;
|    static CONST char *options[] = {
|        "-all", "-ascii", "-decreasing", "-dictionary",
|        "-exact", "-glob", "-increasing", "-inline",
|        "-integer", "-not", "-real", "-regexp",
|        "-sorted", "-start", NULL
|    };
|    enum options {
|        LSEARCH_ALL, LSEARCH_ASCII, LSEARCH_DECREASING, LSEARCH_DICTIONARY,
|        LSEARCH_EXACT, LSEARCH_GLOB, LSEARCH_INCREASING, LSEARCH_INLINE,
|        LSEARCH_INTEGER, LSEARCH_NOT, LSEARCH_REAL, LSEARCH_REGEXP,
|        LSEARCH_SORTED, LSEARCH_START
|    };
|
|    enum datatypes {
|        ASCII, DICTIONARY, INTEGER, REAL
|    };
|
|    enum modes {
|        EXACT, GLOB, REGEXP, SORTED
|    };
|
|    mode = GLOB;
|    dataType = ASCII;
|    isIncreasing = 1;
|    /* Note: This counts options as possible list|patterns */
|    if (objc < 3) {
|        Tcl_WrongNumArgs(interp, 1, objv, "?options? list pattern");
|        return TCL_ERROR;
|    }

|    for (i = 1; i < objc-2; i++) {
|        if (Tcl_GetIndexFromObj(interp, objv[i], options, "option", 0, &index)
|                != TCL_OK) {
|            return TCL_ERROR;
|        }

|        switch ((enum options) index) {
|            case LSEARCH_ASCII:         /* -ascii */
|                dataType = ASCII;
|                break;
|            case LSEARCH_NOT:           /* -not */
|                notMatch = 1;
|                break;
|            case LSEARCH_ALL:           /* -all */
|                allData = 1;
|                listPtr = Tcl_NewListObj(0, (Tcl_Obj **) NULL);
|                break;
|            case LSEARCH_INLINE:        /* -inline */
|                returnInline = 1;
|                break;
|            case LSEARCH_START:         /* -start index */
|                useStart = ++i;         /* Use next arg as offset index */
|                if (objc-i < 2) {
|                    Tcl_SetResult(interp,
|                            "missing argument to -start option", TCL_STATIC);
|                }

|                break;
|            case LSEARCH_DECREASING:    /* -decreasing */
|                isIncreasing = 0;
|                break;
|            case LSEARCH_DICTIONARY:    /* -dictionary */
|                dataType = DICTIONARY;
|                break;
|            case LSEARCH_EXACT:         /* -exact */
|                mode = EXACT;
|                break;
|            case LSEARCH_INCREASING:    /* -increasing */
|                isIncreasing = 1;
|                break;
|            case LSEARCH_INTEGER:       /* -integer */
|                dataType = INTEGER;
|                break;
|            case LSEARCH_GLOB:          /* -glob */
|                mode = GLOB;
|                break;
|            case LSEARCH_REAL:          /* -real */
|                dataType = REAL;
|                break;
|            case LSEARCH_REGEXP:        /* -regexp */
|                mode = REGEXP;
|                break;
|            case LSEARCH_SORTED:        /* -sorted */
|                mode = SORTED;
|                break;
|        }
|    }

|     
|
|    /*
|     * -start option processing:
|     * Ensure we get a unique copy of command line arg for start index
|     */
|    if (useStart > 0) {
|        startPtr = Tcl_DuplicateObj(objv[useStart]);
|    }

|
|    /*
|     * Make sure the list argument is a list object and get its length and
|     * a pointer to its array of element pointers.
|     */
|    result = Tcl_ListObjGetInline(interp, objv[objc - 2], &listc, &listv);
|    if (result != TCL_OK) {
|        return result;
|    }

|    /*
|     * Retrieve user specified start offset.
|     */
|    if (useStart > 0) {
|        result = TclGetIntForIndex(interp, startPtr, /*end*/ listc-1, &offset);
|        Tcl_DecrRefCount(startPtr); /* free unneeded obj */
|
|        if (result != TCL_OK) {
|           return result;
|        } else if (offset < 0) {
|           offset = 0;
|        }

|    } else {
|       offset = 0;
|    }

|
|    /*
|     * Process the pattern
|     */
|    patObj = objv[objc - 1];
|    patternBytes = NULL;
|    if ((enum modes) mode == EXACT || (enum modes) mode == SORTED) {
|        switch ((enum datatypes) dataType) {
|            case ASCII:
|            case DICTIONARY:
|                patternBytes = Tcl_GetStringFromObj(patObj, &length);
|                break;
|            case INTEGER:
|                result = Tcl_GetIntFromObj(interp, patObj, &patInt);
|                if (result != TCL_OK) {
|                    return result;
|                }

|                break;
|            case REAL:
|                result = Tcl_GetDoubleFromObj(interp, patObj, &patDouble);
|                if (result != TCL_OK) {
|                    return result;
|                }

|                break;
|        }

|    } else {
|        patternBytes = Tcl_GetStringFromObj(patObj, &length);
|    }

|
|    /*
|     * Set default index value to -1, indicating failure; if we find the
|     * item in the course of our search, index will be set to the correct
|     * value.
|     */
|    index = -1;
|    match = 0;
|    if ((enum modes) mode == SORTED && allData == FALSE) {
|        /*
|         * If the data is sorted, we can do a more intelligent search.
|         * Note that there is no point in being smart when -all was
|         * specified; in that case, we have to look at all items anyway.
|         */
|        lower = offset-1 /*-1*/;
|        upper = listc;
|        while (lower + 1 != upper) {
|            i = (lower + upper)/2;
|            switch ((enum datatypes) dataType) {
|                case ASCII: {
|                    bytes = Tcl_GetString(listv[i]);
|                    match = strcmp(patternBytes, bytes);
|                    break;
|                }

|                case DICTIONARY: {
|                    bytes = Tcl_GetString(listv[i]);
|                    match = DictionaryCompare(patternBytes, bytes);
|                    break;
|                }

|                case INTEGER: {
|                    result = Tcl_GetIntFromObj(interp, listv[i], &objInt);
|                    if (result != TCL_OK) {
|                        return result;
|                    }

|                    if (patInt == objInt) {
|                        match = 0;
|                    } else if (patInt < objInt) {
|                        match = -1;
|                    } else {
|                        match = 1;
|                    }

|                    break;
|                }

|                case REAL: {
|                    result = Tcl_GetDoubleFromObj(interp, listv[i],
|                            &objDouble);
|                    if (result != TCL_OK) {
|                        return result;
|                    }

|                    if (patDouble == objDouble) {
|                        match = 0;
|                    } else if (patDouble < objDouble) {
|                        match = -1;
|                    } else {
|                        match = 1;
|                    }

|                    break;
|                }
|            }

|            if (match == 0) {
|                /*
|                 * Normally, binary search is written to stop when it
|                 * finds a match.  If there are duplicates of an element in
|                 * the list, our first match might not be the first occurance.
|                 * Consider:  0 0 0 1 1 1 2 2 2
|                 * To maintain consistancy with standard lsearch semantics,
|                 * we must find the leftmost occurance of the pattern in the
|                 * list.  Thus we don't just stop searching here.  This
|                 * variation means that a search always makes log n
|                 * comparisons (normal binary search might "get lucky" with
|                 * an early comparison).
|                 */
|                index = i;
|                upper = i;
|            } else if (match > 0) {
|                if (isIncreasing) {
|                    lower = i;
|                } else {
|                    upper = i;
|                }

|            } else {
|                if (isIncreasing) {
|                    upper = i;
|                } else {
|                    lower = i;
|                }
|            }
|        }

|    } else {
|        for (i = offset; i < listc; i++) {
|            match = 0;
|            switch ((enum modes) mode) {
|                case SORTED:
|                case EXACT: {
|                    switch ((enum datatypes) dataType) {
|                        case ASCII: {
|                            bytes = Tcl_GetStringFromObj(listv[i], &elemLen);
|                            if (length == elemLen) {
|                                match = (memcmp(bytes, patternBytes,
|                                        (size_t) length) == 0);
|                            }

|                            break;
|                        }

|                        case DICTIONARY: {
|                            bytes = Tcl_GetString(listv[i]);
|                            match =
|                                (DictionaryCompare(bytes, patternBytes) == 0);
|                            break;
|                        }

|                        case INTEGER: {
|                            result = Tcl_GetIntFromObj(interp, listv[i],
|                                    &objInt);
|                            if (result != TCL_OK) {
|                                return result;
|                            }

|                            match = (objInt == patInt);
|                            break;
|                        }
|                        case REAL: {
|                            result = Tcl_GetDoubleFromObj(interp, listv[i],
|                                    &objDouble);
|                            if (result != TCL_OK) {
|                                return result;
|                            }
|                            match = (objDouble == patDouble);
|                            break;
|                        }
|                    }
|                    break;
|                }
|                case GLOB: {
|                    match = Tcl_StringMatch(Tcl_GetString(listv[i]),
|                            patternBytes);
|                    break;
|                }
|                case REGEXP: {
|                    match = Tcl_RegExpMatchObj(interp, listv[i], patObj);
|                    if (match < 0) {
|                        return TCL_ERROR;

|                    }
|                    break;
|                }
|            }
|            /* Invert match condition for -not */

|            if (notMatch) {
|                match = (match != 0 ? 0 : 1);
|            }
|
|            /* Process the possible match for this element */
|            if (match != 0) {
|                if (allData) {
|                    if (returnInline) {
|                        /* Append data */
|                        Tcl_ListObjAppendElement(interp, listPtr,listv[i]);
|                    } else {
|                        /* Append index */
|                        Tcl_ListObjAppendElement(interp, listPtr,Tcl_NewIntObj(i));
|                    }
|                } else {
|                    index = i;

|                    break;
|                }
|            }
|        }
|    }
|    /*
|     * Return either a list (-all) or a single element
|     */
|    if (allData) {
|        Tcl_SetObjResult(interp,listPtr);

|    } else {
|        if (returnInline) {
|            if (index < 0) { /* Return a null */
|                Tcl_SetObjResult(interp,Tcl_NewObj());
|            } else {         /* Return one datum */
|                Tcl_SetObjResult(interp,listv[index]);
|            }

|        } else {
|            Tcl_SetIntObj(Tcl_GetObjResult(interp), index);
|        }
|    }

|    return TCL_OK;
|}

~ Notes

The changes to ''lsearch'' are entirely backward compatible and do no
change the behaviour or performance of the command for existing
options.  Moreover, these changes should not impact any of the other
list changes in [22], [33] or [45].

~ Copyright

This document has been placed in the public domain.

~ Appendix

The benchmarks below denote the expected speed increase of using the
new options vs. tcl only implementations.  Your mileage may vary.

|##
|# performs a lsearch -all -inline -glob search
|#

|proc lsearch_dataGLOB {listData pattern} {
|    set result [list]
|    foreach item $listData {
|        if {[string match $pattern $item]} {
|            lappend result $item
|        }
|    }

|    return $result
|}

|##
|# performs a lsearch -all -inline -regexp search
|#

|proc lsearch_dataRE {listData pattern} {
|    set result [list]
|    foreach item $listData {
|        if {[regexp $pattern $item]} {
|            lappend result $item
|        }
|    }

|    return $result
|}

|##
|# performs a lsearch -all -glob search
|#

|proc lsearch_allGLOB {listData pattern} {
|    set result [list]
|    set count 0
|    foreach item $listData {
|        if {[string match $pattern $item]} {
|            lappend result $count
|        }

|        incr count
|    }

|    return $result
|}

|
|# Build a 2K list of data
|catch {unset LIST}
|time {lappend LIST someStuff} 1000
|time {lappend LIST otherStuff} 1000
|
|# Case with all data matching in a 2K list 2.8x speedup
|puts "#C implementation [time {lsearch -glob -all -inline $LIST *Stuff} 100]"
|#=> C implementation 3766 microseconds per iteration
|puts "#tcl implementation [time {lsearch_dataGLOB $LIST *Stuff} 100]"
|#=> tcl implementation 10815 microseconds per iteration
|
|# Case with all data matching but returning indicies 3X speed up
|puts "#C implementation [time {lsearch -glob -all $LIST *Stuff} 100]"
|#=> C implementation 4305 microseconds per iteration
|puts "#tcl implementation [time {lsearch_allGLOB $LIST *Stuff} 100]"
|#=> tcl implementation 13277 microseconds per iteration
|
|# Case with no matching data 8X speed up
|puts "#C implementation [time {lsearch -glob -all -inline $LIST none*} 100]"
|#=> C implementation 646 microseconds per iteration
|puts "#tcl implementation [time {lsearch_dataGLOB $LIST none*} 100]"
|#=> tcl implementation 5354 microseconds per iteration
|
|
|# Repeat with RE, note more time spent in RE engine 2X speedup
|puts "#C implementation [time {lsearch -regexp -all -inline $LIST Stuff} 100]"
|#=> C implementation 35260 microseconds per iteration
|puts "#tcl implementation [time {lsearch_dataRE $LIST Stuff} 100]"
|#=> tcl implementation 62292 microseconds per iteration
|
|# Case with no matching data 2X speedup
|puts "#C implementation [time {lsearch -regexp -all -inline $LIST none*} 100]"
|#=> C implementation 14815 microseconds per iteration
|puts "#tcl implementation [time {lsearch_dataRE $LIST none} 100]"
|#=> tcl implementation 30553 microseconds per iteration

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|
|

|

|
|
|

|

|
|
|

|
|
<
|
>
|

|
|

|

|

|

|

|

|
|
<
>
|
<
>
|
|
<
>
|
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
<
>
>
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
<
>
|
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
<
<
>
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
<
<
<
>
>
>
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
<
>
|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
<
<
<
<
<
<
<
<
<
<
<
<
|
<
<
<
<
<
<
<
<
<
<
>
|
|
<
<
<
>
|
<
<
<
<
<
<
|
<
<
<
<
<
<
<
<
>
>
|
<
<
<
<
<
<
<
<
<
>
>
>
>
|
<
<
|
<
<
<
>
>
>
|
|
<
<
>
|
<
>
>
|
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
|

|

|

|

|

|
|
<
>
|
|
|
|
|
<
<
>
>
|
<
>
|
|
<
>
|
|
|
|
|
<
<
>
>
|
<
>
|
|
<
>
|
|
|
|
|
|
<
>
|
<
>
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57

58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78

79
80

81
82
83

84
85
86

87
88
89

90
91
92
93
94
95
96
97
98
99

100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134

135
136
137
138
139

140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159

160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188

189
190
191
192
193
194
195
196
197
198

199
200
201
202
203
204
205
206
207

208
209
210
211
212
213
214
215
216
217
218
219

220
221
222

223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239

240
241
242
243
244
245

246
247

248
249
250

251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274

275
276
277
278
279

280
281
282
283
284

285
286
287
288
289
290
291

292
293

294
295
296
297
298
299

300
301
302
303
304
305
306

307
308

309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330

331
332
333
334
335
336

337
338
339
340
341
342
343
344
345
346
347
348
349
350
351

352
353

354
355
356
357
358
359

360
361
362
363
364
365

366
367

368

369
370
371

372
373

374

375
376
377

378
379
380
381
382

383

384
385
386
387
388

389
390

391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453

454
455
456
457
458
459

460
461
462

463
464
465

466
467
468
469
470
471

472
473
474

475
476
477

478
479
480
481
482
483
484

485
486

487
488

489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526

# TIP 80: Additional Options for 'lsearch'

	Author:         Tom Wilkason <[email protected]>
	Author:         Tom Wilkason <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        02-Jan-2002
	Post-History:   
	Discussions-To: news:comp.lang.tcl
	Tcl-Version:    8.4
-----

# Abstract

This TIP proposes additional options for the _lsearch_ command to
return and work with all matching items in the return rather than the
first matching item. Additional options are also added.

# Rationale

The _lsearch_ function works well for finding the first item in a
list that matches a pattern.  However it is often useful to find all
of the items in the list that match a pattern.  This TIP proposes
adding options to return the entire list of matches.  With this
capability, additional options are proposed to return the data rather
than the indices \(since you often want to work the the data anyway\),
and to add an option to return the logical exclusion of the matching
items \(i.e. those that don't match the search pattern\).

# Specification

I propose the following options be added to _lsearch_:

Option: _-start index_

 > Initiates the list search starting at _index_, which can be
   any valid list index \(such as 0 , end , end-1 ...\) 

Option: _-all_

 > Returns a list of all indices that match the search condition
   \(rather than the first one\).  The indices are returned low to high
   order.  For a no match condition, a \{\} \(empty list\) is returned.
   If the the _-all_ or _-inline_ switches are not specified, a
   -1 is returned for a no match condition just as is done now.

Option: _-inline_

 > Returns a single item \(or a list of items for _-all_\) of the data
   that matches the search condition rather than the index \(or
   indices\).  An empty result or empty list \(_-all_\) is returned for
   a no match condition.  The data is returned in original list order.
   This option is useful when you want to iterate over the returned
   data anyway.  e.g.

	    foreach item [lsearch -all -inline -glob $someList *stuff] {
	       # deal with item

	    }

Option: _-not_

 > Negates the sense of the search condition \(i.e. what doesn't
   match\).  When used with the _-inline_ or _-all_ options, the
   return set will be the items that do match.  If all items match
   then a \{\} is returned.  Without the _-all_ option, the first item
   in the list that does not match will be returned.

These can be combined as needed and yield some powerful capabilities
when iterating over sub-lists \(esp. with the new _lset_ command\).

# Reference Implementation

Changes to the _Tcl\_LsearchObjCmd_ command in _generic/tclCmdIL.c_
are needed along with documentation and test code.  The changes to the
8.4 head version of _tclCmdIL.c_ are available below.

	/*
	 *----------------------------------------------------------------------

	 *
	 * Tcl_LsearchObjCmd --

	 *
	 *      This procedure is invoked to process the "lsearch" Tcl command.
	 *      See the user documentation for details on what it does.

	 *
	 * Results:
	 *      A standard Tcl result.

	 *
	 * Side effects:
	 *      See the user documentation.

	 *
	 *----------------------------------------------------------------------
	 */

	int
	Tcl_LsearchObjCmd(clientData, interp, objc, objv)
	    ClientData clientData;      /* Not used. */
	    Tcl_Interp *interp;         /* Current interpreter. */
	    int objc;                   /* Number of arguments. */
	    Tcl_Obj *CONST objv[];      /* Argument values. */

	{
	    char *bytes, *patternBytes;
	    int i, match, mode, index, result, listc, length, elemLen;
	    int useStart=-1, offset, allData=0, returnInline=0;
	    int dataType, isIncreasing, lower, upper, patInt, objInt, notMatch=0;
	    double patDouble, objDouble;
	    Tcl_Obj *patObj, **listv, *listPtr, *startPtr = NULL;
	    static CONST char *options[] = {
	        "-all", "-ascii", "-decreasing", "-dictionary",
	        "-exact", "-glob", "-increasing", "-inline",
	        "-integer", "-not", "-real", "-regexp",
	        "-sorted", "-start", NULL
	    };
	    enum options {
	        LSEARCH_ALL, LSEARCH_ASCII, LSEARCH_DECREASING, LSEARCH_DICTIONARY,
	        LSEARCH_EXACT, LSEARCH_GLOB, LSEARCH_INCREASING, LSEARCH_INLINE,
	        LSEARCH_INTEGER, LSEARCH_NOT, LSEARCH_REAL, LSEARCH_REGEXP,
	        LSEARCH_SORTED, LSEARCH_START
	    };

	    enum datatypes {
	        ASCII, DICTIONARY, INTEGER, REAL
	    };

	    enum modes {
	        EXACT, GLOB, REGEXP, SORTED
	    };

	    mode = GLOB;
	    dataType = ASCII;
	    isIncreasing = 1;
	    /* Note: This counts options as possible list|patterns */
	    if (objc < 3) {
	        Tcl_WrongNumArgs(interp, 1, objv, "?options? list pattern");
	        return TCL_ERROR;

	    }
	    for (i = 1; i < objc-2; i++) {
	        if (Tcl_GetIndexFromObj(interp, objv[i], options, "option", 0, &index)
	                != TCL_OK) {
	            return TCL_ERROR;

	        }
	        switch ((enum options) index) {
	            case LSEARCH_ASCII:         /* -ascii */
	                dataType = ASCII;
	                break;
	            case LSEARCH_NOT:           /* -not */
	                notMatch = 1;
	                break;
	            case LSEARCH_ALL:           /* -all */
	                allData = 1;
	                listPtr = Tcl_NewListObj(0, (Tcl_Obj **) NULL);
	                break;
	            case LSEARCH_INLINE:        /* -inline */
	                returnInline = 1;
	                break;
	            case LSEARCH_START:         /* -start index */
	                useStart = ++i;         /* Use next arg as offset index */
	                if (objc-i < 2) {
	                    Tcl_SetResult(interp,
	                            "missing argument to -start option", TCL_STATIC);

	                }
	                break;
	            case LSEARCH_DECREASING:    /* -decreasing */
	                isIncreasing = 0;
	                break;
	            case LSEARCH_DICTIONARY:    /* -dictionary */
	                dataType = DICTIONARY;
	                break;
	            case LSEARCH_EXACT:         /* -exact */
	                mode = EXACT;
	                break;
	            case LSEARCH_INCREASING:    /* -increasing */
	                isIncreasing = 1;
	                break;
	            case LSEARCH_INTEGER:       /* -integer */
	                dataType = INTEGER;
	                break;
	            case LSEARCH_GLOB:          /* -glob */
	                mode = GLOB;
	                break;
	            case LSEARCH_REAL:          /* -real */
	                dataType = REAL;
	                break;
	            case LSEARCH_REGEXP:        /* -regexp */
	                mode = REGEXP;
	                break;
	            case LSEARCH_SORTED:        /* -sorted */
	                mode = SORTED;
	                break;

	        }
	    }

	    /*
	     * -start option processing:
	     * Ensure we get a unique copy of command line arg for start index
	     */
	    if (useStart > 0) {
	        startPtr = Tcl_DuplicateObj(objv[useStart]);

	    }

	    /*
	     * Make sure the list argument is a list object and get its length and
	     * a pointer to its array of element pointers.
	     */
	    result = Tcl_ListObjGetInline(interp, objv[objc - 2], &listc, &listv);
	    if (result != TCL_OK) {
	        return result;

	    }
	    /*
	     * Retrieve user specified start offset.
	     */
	    if (useStart > 0) {
	        result = TclGetIntForIndex(interp, startPtr, /*end*/ listc-1, &offset);
	        Tcl_DecrRefCount(startPtr); /* free unneeded obj */

	        if (result != TCL_OK) {
	           return result;
	        } else if (offset < 0) {
	           offset = 0;

	        }
	    } else {
	       offset = 0;

	    }

	    /*
	     * Process the pattern
	     */
	    patObj = objv[objc - 1];
	    patternBytes = NULL;
	    if ((enum modes) mode == EXACT || (enum modes) mode == SORTED) {
	        switch ((enum datatypes) dataType) {
	            case ASCII:
	            case DICTIONARY:
	                patternBytes = Tcl_GetStringFromObj(patObj, &length);
	                break;
	            case INTEGER:
	                result = Tcl_GetIntFromObj(interp, patObj, &patInt);
	                if (result != TCL_OK) {
	                    return result;

	                }
	                break;
	            case REAL:
	                result = Tcl_GetDoubleFromObj(interp, patObj, &patDouble);
	                if (result != TCL_OK) {
	                    return result;

	                }
	                break;

	        }
	    } else {
	        patternBytes = Tcl_GetStringFromObj(patObj, &length);

	    }

	    /*
	     * Set default index value to -1, indicating failure; if we find the
	     * item in the course of our search, index will be set to the correct
	     * value.
	     */
	    index = -1;
	    match = 0;
	    if ((enum modes) mode == SORTED && allData == FALSE) {
	        /*
	         * If the data is sorted, we can do a more intelligent search.
	         * Note that there is no point in being smart when -all was
	         * specified; in that case, we have to look at all items anyway.
	         */
	        lower = offset-1 /*-1*/;
	        upper = listc;
	        while (lower + 1 != upper) {
	            i = (lower + upper)/2;
	            switch ((enum datatypes) dataType) {
	                case ASCII: {
	                    bytes = Tcl_GetString(listv[i]);
	                    match = strcmp(patternBytes, bytes);
	                    break;

	                }
	                case DICTIONARY: {
	                    bytes = Tcl_GetString(listv[i]);
	                    match = DictionaryCompare(patternBytes, bytes);
	                    break;

	                }
	                case INTEGER: {
	                    result = Tcl_GetIntFromObj(interp, listv[i], &objInt);
	                    if (result != TCL_OK) {
	                        return result;

	                    }
	                    if (patInt == objInt) {
	                        match = 0;
	                    } else if (patInt < objInt) {
	                        match = -1;
	                    } else {
	                        match = 1;

	                    }
	                    break;

	                }
	                case REAL: {
	                    result = Tcl_GetDoubleFromObj(interp, listv[i],
	                            &objDouble);
	                    if (result != TCL_OK) {
	                        return result;

	                    }
	                    if (patDouble == objDouble) {
	                        match = 0;
	                    } else if (patDouble < objDouble) {
	                        match = -1;
	                    } else {
	                        match = 1;

	                    }
	                    break;

	                }
	            }
	            if (match == 0) {
	                /*
	                 * Normally, binary search is written to stop when it
	                 * finds a match.  If there are duplicates of an element in
	                 * the list, our first match might not be the first occurance.
	                 * Consider:  0 0 0 1 1 1 2 2 2
	                 * To maintain consistancy with standard lsearch semantics,
	                 * we must find the leftmost occurance of the pattern in the
	                 * list.  Thus we don't just stop searching here.  This
	                 * variation means that a search always makes log n
	                 * comparisons (normal binary search might "get lucky" with
	                 * an early comparison).
	                 */
	                index = i;
	                upper = i;
	            } else if (match > 0) {
	                if (isIncreasing) {
	                    lower = i;
	                } else {
	                    upper = i;

	                }
	            } else {
	                if (isIncreasing) {
	                    upper = i;
	                } else {
	                    lower = i;

	                }
	            }
	        }
	    } else {
	        for (i = offset; i < listc; i++) {
	            match = 0;
	            switch ((enum modes) mode) {
	                case SORTED:
	                case EXACT: {
	                    switch ((enum datatypes) dataType) {
	                        case ASCII: {
	                            bytes = Tcl_GetStringFromObj(listv[i], &elemLen);
	                            if (length == elemLen) {
	                                match = (memcmp(bytes, patternBytes,
	                                        (size_t) length) == 0);

	                            }
	                            break;

	                        }
	                        case DICTIONARY: {
	                            bytes = Tcl_GetString(listv[i]);
	                            match =
	                                (DictionaryCompare(bytes, patternBytes) == 0);
	                            break;

	                        }
	                        case INTEGER: {
	                            result = Tcl_GetIntFromObj(interp, listv[i],
	                                    &objInt);
	                            if (result != TCL_OK) {
	                                return result;

	                            }
	                            match = (objInt == patInt);

	                            break;

	                        }
	                        case REAL: {
	                            result = Tcl_GetDoubleFromObj(interp, listv[i],

	                                    &objDouble);
	                            if (result != TCL_OK) {

	                                return result;

	                            }
	                            match = (objDouble == patDouble);
	                            break;

	                        }
	                    }
	                    break;
	                }
	                case GLOB: {

	                    match = Tcl_StringMatch(Tcl_GetString(listv[i]),

	                            patternBytes);
	                    break;
	                }
	                case REGEXP: {
	                    match = Tcl_RegExpMatchObj(interp, listv[i], patObj);

	                    if (match < 0) {
	                        return TCL_ERROR;

	                    }
	                    break;
	                }
	            }
	            /* Invert match condition for -not */
	            if (notMatch) {
	                match = (match != 0 ? 0 : 1);
	            }

	            /* Process the possible match for this element */
	            if (match != 0) {
	                if (allData) {
	                    if (returnInline) {
	                        /* Append data */
	                        Tcl_ListObjAppendElement(interp, listPtr,listv[i]);
	                    } else {
	                        /* Append index */
	                        Tcl_ListObjAppendElement(interp, listPtr,Tcl_NewIntObj(i));
	                    }
	                } else {
	                    index = i;
	                    break;
	                }
	            }
	        }
	    }
	    /*
	     * Return either a list (-all) or a single element
	     */
	    if (allData) {
	        Tcl_SetObjResult(interp,listPtr);
	    } else {
	        if (returnInline) {
	            if (index < 0) { /* Return a null */
	                Tcl_SetObjResult(interp,Tcl_NewObj());
	            } else {         /* Return one datum */
	                Tcl_SetObjResult(interp,listv[index]);
	            }
	        } else {
	            Tcl_SetIntObj(Tcl_GetObjResult(interp), index);
	        }
	    }
	    return TCL_OK;
	}

# Notes

The changes to _lsearch_ are entirely backward compatible and do no
change the behaviour or performance of the command for existing
options.  Moreover, these changes should not impact any of the other
list changes in [[22]](22.md), [[33]](33.md) or [[45]](45.md).

# Copyright

This document has been placed in the public domain.

# Appendix

The benchmarks below denote the expected speed increase of using the
new options vs. tcl only implementations.  Your mileage may vary.

	##
	# performs a lsearch -all -inline -glob search

	#
	proc lsearch_dataGLOB {listData pattern} {
	    set result [list]
	    foreach item $listData {
	        if {[string match $pattern $item]} {
	            lappend result $item

	        }
	    }
	    return $result

	}
	##
	# performs a lsearch -all -inline -regexp search

	#
	proc lsearch_dataRE {listData pattern} {
	    set result [list]
	    foreach item $listData {
	        if {[regexp $pattern $item]} {
	            lappend result $item

	        }
	    }
	    return $result

	}
	##
	# performs a lsearch -all -glob search

	#
	proc lsearch_allGLOB {listData pattern} {
	    set result [list]
	    set count 0
	    foreach item $listData {
	        if {[string match $pattern $item]} {
	            lappend result $count

	        }
	        incr count

	    }
	    return $result

	}

	# Build a 2K list of data
	catch {unset LIST}
	time {lappend LIST someStuff} 1000
	time {lappend LIST otherStuff} 1000

	# Case with all data matching in a 2K list 2.8x speedup
	puts "#C implementation [time {lsearch -glob -all -inline $LIST *Stuff} 100]"
	#=> C implementation 3766 microseconds per iteration
	puts "#tcl implementation [time {lsearch_dataGLOB $LIST *Stuff} 100]"
	#=> tcl implementation 10815 microseconds per iteration

	# Case with all data matching but returning indicies 3X speed up
	puts "#C implementation [time {lsearch -glob -all $LIST *Stuff} 100]"
	#=> C implementation 4305 microseconds per iteration
	puts "#tcl implementation [time {lsearch_allGLOB $LIST *Stuff} 100]"
	#=> tcl implementation 13277 microseconds per iteration

	# Case with no matching data 8X speed up
	puts "#C implementation [time {lsearch -glob -all -inline $LIST none*} 100]"
	#=> C implementation 646 microseconds per iteration
	puts "#tcl implementation [time {lsearch_dataGLOB $LIST none*} 100]"
	#=> tcl implementation 5354 microseconds per iteration

	# Repeat with RE, note more time spent in RE engine 2X speedup
	puts "#C implementation [time {lsearch -regexp -all -inline $LIST Stuff} 100]"
	#=> C implementation 35260 microseconds per iteration
	puts "#tcl implementation [time {lsearch_dataRE $LIST Stuff} 100]"
	#=> tcl implementation 62292 microseconds per iteration

	# Case with no matching data 2X speedup
	puts "#C implementation [time {lsearch -regexp -all -inline $LIST none*} 100]"
	#=> C implementation 14815 microseconds per iteration
	puts "#tcl implementation [time {lsearch_dataRE $LIST none} 100]"
	#=> tcl implementation 30553 microseconds per iteration

Name change from tip/81.tip to tip/81.md.

1
2
3
4
5
6
7
8
9

10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
TIP:            81
Title:          [incr Tcl] Functional Areas for Maintainer Assignments
Version:        $Revision: 1.7 $
Author:         Donal K. Fellows <[email protected]>
State:          Withdrawn
Type:           Process
Vote:           Pending
Created:        07-Jan-2002
Post-History:   

~ Abstract

This document proposes a division of [[incr Tcl]]'s source code into
functional areas so that each area may be assigned to one or more
maintainers.

~ Background

In order for [[incr Tcl]] to be adopted by the Tcl Core Team (see
[50]), it must be managed by the processes already established for
handling source (see [0] and [16] for details and rationale.)

~ Functional Areas

[[incr Tcl]] shall be divided into the following 8 functional units
(the seventh is the obsolete parts of the source tree, and the eighth
is the shared part of the source tree), each to be assigned one or
more maintainers:

 1. ''Objects'' -
      generic/itcl_bicmds.c,
      generic/itcl_class.c,
      generic/itcl_methods.c,
      generic/itcl_objects.c,
      generic/itcl_parse.c,
      doc/body.n,
      doc/class.n,
      doc/configbody.n,
      tests/basic.test,
      tests/body.test,
      tests/info.test,
      tests/inherit.test,
      tests/interp.test,
      tests/methods.test,
      tests/protection.test,

 2. ''Other Commands'' -
      generic/itcl_cmds.c,
      generic/itcl_ensemble.c,
      doc/code.n,
      doc/delete.n,
      doc/ensemble.n,
      doc/find.n,
      doc/local.n,
      doc/scope.n,
      tests/chain.test,
      tests/delete.test,
      tests/ensemble.test,
      tests/import.test,
      tests/local.test,
      tests/namespace.test,
      tests/scope.test

 3. ''Mac Build+Support'' -
      mac/tclMacAppInit.c,
      mac/MW_ItclHeader.pch,
      mac/pkgIndex.tcl,
      mac/itclMacApplication.r,
      mac/itclMacLibrary.r,
      mac/itclMacResource.r,
      mac/itclMacTclCode.r,
      mac/itclStaticApplication.r

 4. ''Unix Build+Support'' -
      unix/tclAppInit.c,
      configure.in ''(to move to unix/configure.in)'',
      aclocal.m4  ''(to move to unix/aclocal.m4 and gain pieces the
                     poorly-named Itcl tcl.m4)'',
      Makefile.in ''(to move to unix/Makefile.in)'',
      itclConfig.sh.in ''(to move to unix/itclConfig.sh.in)'',
      pkgIndex.tcl.in ''(to move to unix/pkgIndex.tcl.in)''

 5. ''Windows Build+Support'' -
      win/dllEntryPoint.c,
      win/makefile.bc,
      win/makefile.vc,
      win/rc/itcl.rc

 6. ''Other'' -
      generic/itcl_util.c,
      generic/itcl_linkage.c,
      doc/itclsh.1,
      library/itcl.tcl,
      tests/mkindex.itcl,
      tests/mkindex.test,
      tests/tclIndex

~ Obsolete Files

These files are all obsolete in one way or another, and will be removed
as part of the migration process.  They are listed here for completeness
only.

 * generic/itcl_obsolete.c,
   generic/itcl_migrate.c,
   doc/man.macros,
   doc/itcl_class.n,
   doc/itcl_info.n
   tests/defs,
   tests/old/AAA.test,
   tests/old/Bar.tcl,
   tests/old/BarFoo.tcl,
   tests/old/Baz.tcl,
   tests/old/Foo.tcl,
   tests/old/FooBar.tcl,
<
|
<
|
|
|
|
|
|
>

|

|

|

|
|
|

|

|
|
|

|
|
|
|
|
|

|
|
|

|

|

|

|
|
|
|
|
|

|

|
|
|

|

|
|

|
|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114

# TIP 81: [incr Tcl] Functional Areas for Maintainer Assignments

	Author:         Donal K. Fellows <[email protected]>
	State:          Withdrawn
	Type:           Process
	Vote:           Pending
	Created:        07-Jan-2002
	Post-History:   
-----

# Abstract

This document proposes a division of [incr Tcl]'s source code into
functional areas so that each area may be assigned to one or more
maintainers.

# Background

In order for [incr Tcl] to be adopted by the Tcl Core Team \(see
[[50]](50.md)\), it must be managed by the processes already established for
handling source \(see [[0]](0.md) and [[16]](16.md) for details and rationale.\)

# Functional Areas

[incr Tcl] shall be divided into the following 8 functional units
\(the seventh is the obsolete parts of the source tree, and the eighth
is the shared part of the source tree\), each to be assigned one or
more maintainers:

 1. _Objects_ -
      generic/itcl\_bicmds.c,
      generic/itcl\_class.c,
      generic/itcl\_methods.c,
      generic/itcl\_objects.c,
      generic/itcl\_parse.c,
      doc/body.n,
      doc/class.n,
      doc/configbody.n,
      tests/basic.test,
      tests/body.test,
      tests/info.test,
      tests/inherit.test,
      tests/interp.test,
      tests/methods.test,
      tests/protection.test,

 2. _Other Commands_ -
      generic/itcl\_cmds.c,
      generic/itcl\_ensemble.c,
      doc/code.n,
      doc/delete.n,
      doc/ensemble.n,
      doc/find.n,
      doc/local.n,
      doc/scope.n,
      tests/chain.test,
      tests/delete.test,
      tests/ensemble.test,
      tests/import.test,
      tests/local.test,
      tests/namespace.test,
      tests/scope.test

 3. _Mac Build\+Support_ -
      mac/tclMacAppInit.c,
      mac/MW\_ItclHeader.pch,
      mac/pkgIndex.tcl,
      mac/itclMacApplication.r,
      mac/itclMacLibrary.r,
      mac/itclMacResource.r,
      mac/itclMacTclCode.r,
      mac/itclStaticApplication.r

 4. _Unix Build\+Support_ -
      unix/tclAppInit.c,
      configure.in _\(to move to unix/configure.in\)_,
      aclocal.m4  _\(to move to unix/aclocal.m4 and gain pieces the
                     poorly-named Itcl tcl.m4\)_,
      Makefile.in _\(to move to unix/Makefile.in\)_,
      itclConfig.sh.in _\(to move to unix/itclConfig.sh.in\)_,
      pkgIndex.tcl.in _\(to move to unix/pkgIndex.tcl.in\)_

 5. _Windows Build\+Support_ -
      win/dllEntryPoint.c,
      win/makefile.bc,
      win/makefile.vc,
      win/rc/itcl.rc

 6. _Other_ -
      generic/itcl\_util.c,
      generic/itcl\_linkage.c,
      doc/itclsh.1,
      library/itcl.tcl,
      tests/mkindex.itcl,
      tests/mkindex.test,
      tests/tclIndex

# Obsolete Files

These files are all obsolete in one way or another, and will be removed
as part of the migration process.  They are listed here for completeness
only.

 * generic/itcl\_obsolete.c,
   generic/itcl\_migrate.c,
   doc/man.macros,
   doc/itcl\_class.n,
   doc/itcl\_info.n
   tests/defs,
   tests/old/AAA.test,
   tests/old/Bar.tcl,
   tests/old/BarFoo.tcl,
   tests/old/Baz.tcl,
   tests/old/Foo.tcl,
   tests/old/FooBar.tcl,

︙ ︙ 
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164

   tests/old/toasters/Hazard.tcl,
   tests/old/toasters/Outlet.tcl,
   tests/old/toasters/SmartToaster.tcl,
   tests/old/toasters/Toaster.tcl,
   tests/old/toasters/tclIndex,
   tests/old/toasters/usualway.tcl

~ Shared Files

The following files are shared by all of [[incr Tcl]].  Any maintainer
may modify them as necessary to complete changes they are making to
their portion of [[incr Tcl]].  Some of the following files define
[[incr Tcl]]'s API and should be changed only in accordance with TCT
approval.

 * generic/itcl.h,
   generic/itclInt.h,
   generic/itcl.decls,
   generic/itclInt.decls,
   doc/itcl.n,
   doc/itclvars.n,
   tests/all,
   tests/all.tcl

~ Generated Files

The following files are generated, so they don't need maintainers.

 * generic/itclDecls.h,
   generic/itclIntDecls.h,
   generic/itclStubInit.c,
   generic/itclStubLib.c,
   configure ''(moves to unix/configure, trailing configure.in)''

~ Copyright

This document has been placed in the public domain.

|

|

|
|

|

|

|

>
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
   tests/old/toasters/Hazard.tcl,
   tests/old/toasters/Outlet.tcl,
   tests/old/toasters/SmartToaster.tcl,
   tests/old/toasters/Toaster.tcl,
   tests/old/toasters/tclIndex,
   tests/old/toasters/usualway.tcl

# Shared Files

The following files are shared by all of [incr Tcl].  Any maintainer
may modify them as necessary to complete changes they are making to
their portion of [incr Tcl].  Some of the following files define
[incr Tcl]'s API and should be changed only in accordance with TCT
approval.

 * generic/itcl.h,
   generic/itclInt.h,
   generic/itcl.decls,
   generic/itclInt.decls,
   doc/itcl.n,
   doc/itclvars.n,
   tests/all,
   tests/all.tcl

# Generated Files

The following files are generated, so they don't need maintainers.

 * generic/itclDecls.h,
   generic/itclIntDecls.h,
   generic/itclStubInit.c,
   generic/itclStubLib.c,
   configure _\(moves to unix/configure, trailing configure.in\)_

# Copyright

This document has been placed in the public domain.

Name change from tip/82.tip to tip/82.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84

TIP:		82
Title:		Add -offrelief Option to Checkbutton and Radiobutton
Version:	$Revision: 1.4 $
Author:		D. Richard Hipp <[email protected]>
State:		Final
Type:		Project
Vote:		Done
Created:	10-Jan-2002
Post-History:	
Tcl-Version:	8.4

~ Abstract

This TIP proposes adding option ''-offrelief'' to the checkbutton and
radiobutton widgets to specify the relief of the widget when
''-indicatoron'' is off and the state of the button is off.  This
feature is needed to support the use of checkbutton and radiobutton
widgets on toolbars.

~ Rationale

The checkbutton and radiobutton widgets both support the
''-overrelief'' option which is suppose to provide the capability to
change the relief of the widget on mouse-over.  The ''-overrelief''
option is not used by the underlying C code.  The value of
''-overrelief'' is used only by the script bindings to change the
''-relief'' option in response to ''<Enter>'' and ''<Leave>'' events.
But with the checkbutton and radiobutton widgets, the value of
''-relief'' is ignored when ''-indicatoron'' is turned off.  Hence,
''-overrelief'' has no effect when ''-indicatoron'' is off.

An example of the effect we would like to achieve is the
Bold/Italic/Underline and text justification toolbar buttons on word
processors.  The Bold/Italic/Underline toolbar buttons are most
naturally implemented using Tk checkbuttons and the text justification
toolbar buttons are most naturally implemented using Tk radiobuttons.
The buttons are configured to be flat most of the time (''-relief''
flat) but raise up on mouseover (''-overrelief'' raised).  Toolbar
buttons do not show indicators (''-indicatoron'' off).  This last
configuration option is the crux of the problem since when
''-indicatoron'' is off, the relief of the button is hard-coded to be
raised when the button is on and sunken when the button is off.  In
the current implementation, there is no way to get the off-relief to
be flat, and hence there is no way to achieve the customary look for
these common toolbar buttons.

~ Proposed Enhancement

This TIP proposes to modify the checkbutton and radiobutton widgets to
support a ''-offrelief'' option.  ''-offrelief'' will take any of the
usual relief values.  The default value will be ''raised''.  The
''-offrelief'' option determines the relief of the widget when
''-indicatoron'' option is off and the button itself is off.

The default bindings for checkbuttons and radiobuttons will also need
to be changed so that they copy the value of ''-overrelief'' into
''-offrelief'' instead of into ''-relief'' when the value of
''-indicatoron'' is false.

When ''-indicatoron'' is off and the button itself is on, the relief
continues to be hard-coded to sunken.  For symmetry, we might consider
adding another ''-onrelief'' option to cover this case.  But it is
difficult to imagine ever wanting to change the value of ''-onrelief''
so it has been omitted from this TIP.  If there as strong desire to
have ''-onrelief'', it can be added later.

~ Alternative Proposals

A simpler solution would be to change the ''-indicatoron'' option so
that it causes the off-relief to come from the ''-relief'' option
instead of using a hard-coded ''raised'' relief.  That approach is
conceptually simpler, but it breaks backwards compatibility and so
must be rejected.

Another possibility is to modify ''-indicatoron'' so that it takes a
third value (other than ''on'' or ''off'') where the third value works
like ''off'' but takes the off-relief from the ''-relief'' option
instead of always using ''raised''.  But this second idea seems more
contrived and makes it more difficult to define an alternative
on-relief value with a later modification.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|
|

|
|

|
|
|

|

|

|
|
|
|

|
|
|

|

|
|

|

|

|
|
|

|
|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84

# TIP 82: Add -offrelief Option to Checkbutton and Radiobutton

	Author:		D. Richard Hipp <[email protected]>
	State:		Final
	Type:		Project
	Vote:		Done
	Created:	10-Jan-2002
	Post-History:	
	Tcl-Version:	8.4
-----

# Abstract

This TIP proposes adding option _-offrelief_ to the checkbutton and
radiobutton widgets to specify the relief of the widget when
_-indicatoron_ is off and the state of the button is off.  This
feature is needed to support the use of checkbutton and radiobutton
widgets on toolbars.

# Rationale

The checkbutton and radiobutton widgets both support the
_-overrelief_ option which is suppose to provide the capability to
change the relief of the widget on mouse-over.  The _-overrelief_
option is not used by the underlying C code.  The value of
_-overrelief_ is used only by the script bindings to change the
_-relief_ option in response to _<Enter>_ and _<Leave>_ events.
But with the checkbutton and radiobutton widgets, the value of
_-relief_ is ignored when _-indicatoron_ is turned off.  Hence,
_-overrelief_ has no effect when _-indicatoron_ is off.

An example of the effect we would like to achieve is the
Bold/Italic/Underline and text justification toolbar buttons on word
processors.  The Bold/Italic/Underline toolbar buttons are most
naturally implemented using Tk checkbuttons and the text justification
toolbar buttons are most naturally implemented using Tk radiobuttons.
The buttons are configured to be flat most of the time \(_-relief_
flat\) but raise up on mouseover \(_-overrelief_ raised\).  Toolbar
buttons do not show indicators \(_-indicatoron_ off\).  This last
configuration option is the crux of the problem since when
_-indicatoron_ is off, the relief of the button is hard-coded to be
raised when the button is on and sunken when the button is off.  In
the current implementation, there is no way to get the off-relief to
be flat, and hence there is no way to achieve the customary look for
these common toolbar buttons.

# Proposed Enhancement

This TIP proposes to modify the checkbutton and radiobutton widgets to
support a _-offrelief_ option.  _-offrelief_ will take any of the
usual relief values.  The default value will be _raised_.  The
_-offrelief_ option determines the relief of the widget when
_-indicatoron_ option is off and the button itself is off.

The default bindings for checkbuttons and radiobuttons will also need
to be changed so that they copy the value of _-overrelief_ into
_-offrelief_ instead of into _-relief_ when the value of
_-indicatoron_ is false.

When _-indicatoron_ is off and the button itself is on, the relief
continues to be hard-coded to sunken.  For symmetry, we might consider
adding another _-onrelief_ option to cover this case.  But it is
difficult to imagine ever wanting to change the value of _-onrelief_
so it has been omitted from this TIP.  If there as strong desire to
have _-onrelief_, it can be added later.

# Alternative Proposals

A simpler solution would be to change the _-indicatoron_ option so
that it causes the off-relief to come from the _-relief_ option
instead of using a hard-coded _raised_ relief.  That approach is
conceptually simpler, but it breaks backwards compatibility and so
must be rejected.

Another possibility is to modify _-indicatoron_ so that it takes a
third value \(other than _on_ or _off_\) where the third value works
like _off_ but takes the off-relief from the _-relief_ option
instead of always using _raised_.  But this second idea seems more
contrived and makes it more difficult to define an alternative
on-relief value with a later modification.

# Copyright

This document has been placed in the public domain.

Name change from tip/83.tip to tip/83.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69

70
71
72
73
74
75
76

77
78
79
80
81
82
83

84

85

86

87

88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106

107
108
109
110
111
112
113
114
115

116
117
118
119
120
121
122
123

124
125
126
127
128
129
130
131

132
133
134
135
136
137
138

139
140
141

142
143
144
145
146
147
148

149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164

165
166
167

168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188

189
190
191
192
193
194

195
196
197
198
199
200
201
202
203

TIP:            83
Title:          Augment Tcl_EvalFile with Tcl_EvalChannel and Tcl_EvalUrl
Version:        $Revision: 1.6 $
Author:         Marian Szczepkowski <[email protected]>
Author:         <[email protected]>
State:          Withdrawn
Type:           Project
Vote:           Pending
Created:        24-Jan-2002
Post-History:   
Tcl-Version:    8.5

~ Abstract

This TIP adds the ability to load Tcl files directly from URLs to the
core, together with a basic mechanism to simply evaluate a stream of
characters from a channel.

~ Proposal

I propose to split the ''Tcl_EvalFile'' function into two components
to enable the [[source]] command to use URL's to obtain source
material.

This will mean splitting ''Tcl_EvalFile'' into ''Tcl_EvalFile'' and
''Tcl_EvalChannel'' which are the two logical entities.  Maintaining
''Tcl_EvalFile'' will preserve backward compatability.

Creating ''Tcl_EvalChannel'' will provide generic functionality for
future use.

Adding ''Tcl_EvalUrl'' will enable handling standard URL format
strings.

This would enable this [[source http://anywhere.com/file.tcl]] to be
used.

Code will also need to be added to ''Tcl_SourceObjCmd'' to select
functionality requested.

~ Pro.

In a corporate environment where scripts are subject to change but the
interface is not, this allows scripts to be stored remotely on a
central server.

This also allows Tcl to interwork in a networked environment.

~ Con.

Security!!!!

This may mean in the long run adding a signing layer, but don't use it
if you don't want to.

~ Sample

I figure it looking something like this.
Snipped from 8.3 source.

~ Tcl_SourceObjCmd

|int
|Tcl_SourceObjCmd(dummy, interp, objc, objv)
|    ClientData dummy;		/* Not used. */
|    Tcl_Interp *interp;		/* Current interpreter. */
|    int objc;			/* Number of arguments. */
|    Tcl_Obj *CONST objv[];	/* Argument objects. */
|{

|    char *bytes;
|    int result;
|
|    if (objc != 2) {
|        Tcl_WrongNumArgs(interp, 1, objv, "fileName");
|        return TCL_ERROR;
|    }

|
|    bytes = Tcl_GetString(objv[1]);
|    if (strstr(ptr,"://")) {
|        result = Tcl_EvalFile(interp, bytes);
|    } else {
|        result = Tcl_EvalUrl(interp, bytes);
|    }

|    return result;

|}

~ Tcl_EvalFile

|int
|Tcl_EvalFile(interp, fileName)
|    Tcl_Interp *interp;         /* Interpreter in which to process file. */
|    char *fileName;             /* Name of file to process.  Tilde-substitution
|                                 * will be performed on this name. */
|{
|    int result, length;
|    struct stat statBuf;
|    Interp *iPtr;
|    Tcl_DString nameString;
|    char *name, *string;
|    Tcl_Channel chan;
|    Tcl_Obj *objPtr;
|
|    name = Tcl_TranslateFileName(interp, fileName, &nameString);
|    if (name == NULL) {
|        return TCL_ERROR;
|    }

|
|    result = TCL_ERROR;
|
|    if (TclStat(name, &statBuf) == -1) {
|        Tcl_SetErrno(errno);
|        Tcl_AppendResult(interp, "couldn't read file \"", fileName,
|                "\": ", Tcl_PosixError(interp), (char *) NULL);
|        goto end;
|    }

|
|    chan = Tcl_OpenFileChannel(interp, name, "r", 0644);
|    if (chan == (Tcl_Channel) NULL) {
|        Tcl_ResetResult(interp);
|        Tcl_AppendResult(interp, "couldn't read file \"", fileName,
|                "\": ", Tcl_PosixError(interp), (char *) NULL);
|        goto end;
|    }

|
|    result = Tcl_EvalChannel(interp, chan);
|
|  end:
|    Tcl_DStringFree(&nameString);
|    return result;
|}

~ Tcl_EvalUrl

|int
|Tcl_EvalUrl(interp, fileName)
|    Tcl_Interp *interp;	/* Interpreter in which to process file. */
|    char *fileName;	/* Name of URL to process. */
|{

|    return TCL_ERROR;
|}

~ Tcl_EvalChannel

|int
|Tcl_EvalChannel(interp, chan)
|    Tcl_Interp *interp; /* Interpreter in which to process file. */
|    Tcl_Channel chan;   /* Name of file to process. */
|{

|    int result, length;
|    struct stat statBuf;
|    char *oldScriptFile;
|    Interp *iPtr;
|    char *name, *string;
|    Tcl_Obj *objPtr;
|
|    result = TCL_ERROR;
|    objPtr = Tcl_NewObj();
|
|    if (Tcl_ReadChars(chan, objPtr, -1, 0) < 0) {
|        Tcl_Close(interp, chan);
|        Tcl_AppendResult(interp, "couldn't read file \"", fileName,
|                "\": ", Tcl_PosixError(interp), (char *) NULL);
|        goto end;
|    }

|    if (Tcl_Close(interp, chan) != TCL_OK) {
|        goto end;
|    }

|
|    iPtr = (Interp *) interp;
|    oldScriptFile = iPtr->scriptFile;
|    iPtr->scriptFile = fileName;
|    string = Tcl_GetStringFromObj(objPtr, &length);
|    result = Tcl_EvalEx(interp, string, length, 0);
|    iPtr->scriptFile = oldScriptFile;
|
|    if (result == TCL_RETURN) {
|        result = TclUpdateReturnInfo(iPtr);
|    } else if (result == TCL_ERROR) {
|        char msg[200 + TCL_INTEGER_SPACE];
|
|        /*
|         * Record information telling where the error occurred.
|         */
|
|        sprintf(msg, "\n    (file \"%.150s\" line %d)", fileName,
|                interp->errorLine);
|        Tcl_AddErrorInfo(interp, msg);
|    }

|
|  end:
|    Tcl_DecrRefCount(objPtr);
|    return result;
|}

~ Comments

The VFS extension interface of Tcl 8.4 plus the tclvfs and
vfs::http packages provide the ability to [[source]] an URL.
I believe that makes this proposal out of date.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|
|

|
|
|

|

|

|

|

|

|

|

|

|
|
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
>
|
>

>
|
>
>
>
|
<
<
<
<
<
<
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
<
|
>
|

|
|
|
|
<
>
|
<
|
>
|

|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
|
>
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67

68
69
70
71
72
73
74

75
76
77
78
79
80
81

82
83
84
85
86
87
88
89
90
91
92
93

94
95
96
97
98
99
100
101
102
103
104

105
106
107
108
109
110
111
112
113

114
115
116
117
118
119
120
121

122
123
124
125
126
127
128

129
130
131
132
133
134
135
136

137
138

139
140
141
142
143
144
145
146

147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162

163
164
165

166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186

187
188
189
190
191

192
193
194
195
196
197
198
199
200
201
202
203

# TIP 83: Augment Tcl_EvalFile with Tcl_EvalChannel and Tcl_EvalUrl

	Author:         Marian Szczepkowski <[email protected]>
	Author:         <[email protected]>
	State:          Withdrawn
	Type:           Project
	Vote:           Pending
	Created:        24-Jan-2002
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

This TIP adds the ability to load Tcl files directly from URLs to the
core, together with a basic mechanism to simply evaluate a stream of
characters from a channel.

# Proposal

I propose to split the _Tcl\_EvalFile_ function into two components
to enable the [source] command to use URL's to obtain source
material.

This will mean splitting _Tcl\_EvalFile_ into _Tcl\_EvalFile_ and
_Tcl\_EvalChannel_ which are the two logical entities.  Maintaining
_Tcl\_EvalFile_ will preserve backward compatability.

Creating _Tcl\_EvalChannel_ will provide generic functionality for
future use.

Adding _Tcl\_EvalUrl_ will enable handling standard URL format
strings.

This would enable this [source <http://anywhere.com/file.tcl]> to be
used.

Code will also need to be added to _Tcl\_SourceObjCmd_ to select
functionality requested.

# Pro.

In a corporate environment where scripts are subject to change but the
interface is not, this allows scripts to be stored remotely on a
central server.

This also allows Tcl to interwork in a networked environment.

# Con.

Security!!!!

This may mean in the long run adding a signing layer, but don't use it
if you don't want to.

# Sample

I figure it looking something like this.
Snipped from 8.3 source.

# Tcl\_SourceObjCmd

	int
	Tcl_SourceObjCmd(dummy, interp, objc, objv)
	    ClientData dummy;		/* Not used. */
	    Tcl_Interp *interp;		/* Current interpreter. */
	    int objc;			/* Number of arguments. */
	    Tcl_Obj *CONST objv[];	/* Argument objects. */

	{
	    char *bytes;
	    int result;

	    if (objc != 2) {
	        Tcl_WrongNumArgs(interp, 1, objv, "fileName");
	        return TCL_ERROR;

	    }

	    bytes = Tcl_GetString(objv[1]);
	    if (strstr(ptr,"://")) {
	        result = Tcl_EvalFile(interp, bytes);
	    } else {
	        result = Tcl_EvalUrl(interp, bytes);

	    }
	    return result;
	}

# Tcl\_EvalFile

	int
	Tcl_EvalFile(interp, fileName)
	    Tcl_Interp *interp;         /* Interpreter in which to process file. */
	    char *fileName;             /* Name of file to process.  Tilde-substitution
	                                 * will be performed on this name. */
	{

	    int result, length;
	    struct stat statBuf;
	    Interp *iPtr;
	    Tcl_DString nameString;
	    char *name, *string;
	    Tcl_Channel chan;
	    Tcl_Obj *objPtr;

	    name = Tcl_TranslateFileName(interp, fileName, &nameString);
	    if (name == NULL) {
	        return TCL_ERROR;

	    }

	    result = TCL_ERROR;

	    if (TclStat(name, &statBuf) == -1) {
	        Tcl_SetErrno(errno);
	        Tcl_AppendResult(interp, "couldn't read file \"", fileName,
	                "\": ", Tcl_PosixError(interp), (char *) NULL);
	        goto end;

	    }

	    chan = Tcl_OpenFileChannel(interp, name, "r", 0644);
	    if (chan == (Tcl_Channel) NULL) {
	        Tcl_ResetResult(interp);
	        Tcl_AppendResult(interp, "couldn't read file \"", fileName,
	                "\": ", Tcl_PosixError(interp), (char *) NULL);
	        goto end;

	    }

	    result = Tcl_EvalChannel(interp, chan);

	  end:
	    Tcl_DStringFree(&nameString);
	    return result;

	}

# Tcl\_EvalUrl

	int
	Tcl_EvalUrl(interp, fileName)
	    Tcl_Interp *interp;	/* Interpreter in which to process file. */
	    char *fileName;	/* Name of URL to process. */

	{
	    return TCL_ERROR;

	}

# Tcl\_EvalChannel

	int
	Tcl_EvalChannel(interp, chan)
	    Tcl_Interp *interp; /* Interpreter in which to process file. */
	    Tcl_Channel chan;   /* Name of file to process. */

	{
	    int result, length;
	    struct stat statBuf;
	    char *oldScriptFile;
	    Interp *iPtr;
	    char *name, *string;
	    Tcl_Obj *objPtr;

	    result = TCL_ERROR;
	    objPtr = Tcl_NewObj();

	    if (Tcl_ReadChars(chan, objPtr, -1, 0) < 0) {
	        Tcl_Close(interp, chan);
	        Tcl_AppendResult(interp, "couldn't read file \"", fileName,
	                "\": ", Tcl_PosixError(interp), (char *) NULL);
	        goto end;

	    }
	    if (Tcl_Close(interp, chan) != TCL_OK) {
	        goto end;

	    }

	    iPtr = (Interp *) interp;
	    oldScriptFile = iPtr->scriptFile;
	    iPtr->scriptFile = fileName;
	    string = Tcl_GetStringFromObj(objPtr, &length);
	    result = Tcl_EvalEx(interp, string, length, 0);
	    iPtr->scriptFile = oldScriptFile;

	    if (result == TCL_RETURN) {
	        result = TclUpdateReturnInfo(iPtr);
	    } else if (result == TCL_ERROR) {
	        char msg[200 + TCL_INTEGER_SPACE];

	        /*
	         * Record information telling where the error occurred.
	         */

	        sprintf(msg, "\n    (file \"%.150s\" line %d)", fileName,
	                interp->errorLine);
	        Tcl_AddErrorInfo(interp, msg);

	    }

	  end:
	    Tcl_DecrRefCount(objPtr);
	    return result;

	}

# Comments

The VFS extension interface of Tcl 8.4 plus the tclvfs and
vfs::http packages provide the ability to [source] an URL.
I believe that makes this proposal out of date.

# Copyright

This document has been placed in the public domain.

Name change from tip/84.tip to tip/84.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43

TIP:            84
Title:          Add control for mouse movement filtering
Version:        $Revision: 1.5 $
Author:         Jyrki Alakuijala <[email protected]>
Author:         Jeff Hobbs <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        26-Feb-2002
Post-History:   
Tcl-Version:    8.4

~ Abstract

When the mouse is moved, the Tcl/Tk system eats most of the mouse
movement events and only the last movement event when Tcl/Tk is not
busy is stored in the event queue.  I would like to obtain all the
movement events from the X-server or the Windows UI.

~ Rationale

I have an artistic drawing program where I need to track mouse as
accurately as possible.  At the moment I poll the ''XQueryPointer()''
in the busy loops to create (pseudo)events in the C-side of the code
to compensate for the missing events, but (of course) this does not
work in Windows.

I would like to have an option for the widget system or for the window
control so that a window (or, alternatively all the windows) could
receive all the movement events instead of only the last buffered one.

This has been a problem for me since 1995 and has - at many times -
caused me to consider changing the widget system.

~ Implementation

|    int Tk_CollapseMotionEvents(Display *display, int collapse)

A reference implementation is SF Tk patch 564642, which adds a flag to the TkDisplay that specifies whether motions events should be collapsed or not.  The default is the current behavior of collapsing these events.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|
|
|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43

# TIP 84: Add control for mouse movement filtering

	Author:         Jyrki Alakuijala <[email protected]>
	Author:         Jeff Hobbs <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        26-Feb-2002
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

When the mouse is moved, the Tcl/Tk system eats most of the mouse
movement events and only the last movement event when Tcl/Tk is not
busy is stored in the event queue.  I would like to obtain all the
movement events from the X-server or the Windows UI.

# Rationale

I have an artistic drawing program where I need to track mouse as
accurately as possible.  At the moment I poll the _XQueryPointer\(\)_
in the busy loops to create \(pseudo\)events in the C-side of the code
to compensate for the missing events, but \(of course\) this does not
work in Windows.

I would like to have an option for the widget system or for the window
control so that a window \(or, alternatively all the windows\) could
receive all the movement events instead of only the last buffered one.

This has been a problem for me since 1995 and has - at many times -
caused me to consider changing the widget system.

# Implementation

	    int Tk_CollapseMotionEvents(Display *display, int collapse)

A reference implementation is SF Tk patch 564642, which adds a flag to the TkDisplay that specifies whether motions events should be collapsed or not.  The default is the current behavior of collapsing these events.

# Copyright

This document has been placed in the public domain.

Name change from tip/85.tip to tip/85.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59

60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107

108
109
110

111
112
113
114
115

116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137

138
139

140
141
142
143

144
145
146
147

148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169

170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211

TIP:            85
Title:          Custom Comparisons in Tcltest
Version:        $Revision: 1.14 $
Author:         Arjen Markus <[email protected]>
Author:         Don Porter <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        31-Jan-2002
Post-History:   
Keywords:       test,string comparison,floating-point
Tcl-Version:    8.4

~ Abstract

This TIP proposes a simple mechanism to make the ''tcltest'' package
an even more flexible package than it already is by allowing the
programmer to define his or her own comparison procedures.  Such
procedures can deal with issues like allowing a (small) tolerance in
floating-point results.

~ Rationale

The ''test'' command of the package ''tcltest 2.0'' supports the
comparison of the actual result with the expected result by a number
of methods: exact matching, glob-style matching and matching via a
regular expression, according to the ''-match'' option.  The flexibility
is indeed enhanced over the package ''tcltest 1.0,'' as it is now much
easier to allow for small variations in ''string'' results.  But
it is nearly impossible to define an accurate test that checks if
floating-point results are the "same" - exact matching will seldom
suffice due to platform-specific round-off errors or differences in
formatting a floating-point number (''0.12'' versus ''.12'' for
instance).

It is also impossible to compare results that are not easily expressed
as strings, for instance an application that produces binary files
that need to be compared or simply very long strings - these could
easily be stored in an external file, but would be awkward in a file
with a large number of such tests.

~ Proposal

The package ''tcltest 2.0.2'' defines an internal comparison procedure,
''CompareStrings'' that performs matching according to the three built-in
''-match'' options of ''test''.  This
procedure can easily be replaced by one that invokes registered 
commands or procedures. Such a command or procedure takes two 
arguments and returns 1 for a match and a 0 for failure, 
just as ''CompareStrings'' does in the current implementation:

| proc myMatchProc { expected actual } { 
|   if { $expected (is somehow equal) $actual } {
|      return 1
|   } else
|      return 0
|   }
| }

A new public command ''customMatch'' is proposed for the purpose
of registering these matching commands.  It can register a procedure,
such as ''myMatchProc'' defined above:

| ::tcltest::customMatch mytype myMatchProc

or, as in the sample implementation, an incomplete command:

| ::tcltest::customMatch exact [list ::string equal]

When the ''test'' command is called with the ''-match mytype'' option,
the command ''myMatchProc'' will be completed with two arguments,
the expected and actual results, and will be evaluated in the global
namespace to determine whether the test result matches the expected
result.  Likewise, the ''test'' option ''-match exact'' will
cause matching to be tested by the command ''::string equal''.
The default value of the ''-match'' option will continue to be ''exact''.

Allowing procedures to be invoked by their type names gives us the 
flexibility to register as many such procedures or commands as required.

Because this proposal adds a new public command to the ''tcltest''
package, the version will be incremented to 2.1.

A patch to the current HEAD that implements this proposal is
available as Tcl Patch 521362 at the Tcl project at SourceForge.
http://sf.net/tracker/?func=detail&aid=521362&group_id=10894&atid=310894

~ Two Examples

To show how this works, we include two simple examples:

 * Testing a package for calculating mathematical functions like
   Bessel functions.

 * Testing for negative results, as when providing an alternative, but
   incompatible implementation of a feature.

First, suppose you have defined a package for calculating the value of
a general Bessel function, just the sort of function that returns
floating-point numbers.  Then the results may be imprecise due to
rounding-off errors, different values of ''tcl_precision'' or, even
more banally, differences in the formatting of floating-point numbers
(''0.12'' versus ''.12'' for instance). 

The following shows how to do this:

| #

| # Test implementation of Bessel functions
| # (Table only provides 4 decimals)
| #

| customMatch 4decimals matchFloat4Decimals
|
| proc matchFloat4Decimals { expected actual } {
|    return [expr {abs($expected-$actual) <= 0.5e-4}]
| }

|
| test "J0-1.1" "J0 for x=1.0" -match 4decimals -body {
|    J0 1.0
| } -result 0.7652
|
| test "J1-1.1" "J0 for x=1.0" -match 4decimals -body {
|    J1 1.0
| } -result 0.4401

The second example occurs for instance when testing alternative
implementations: you want to check that the original standard feature
is failing whereas the new but incompatible alternative gets it right.
Then:

| proc matchNegative { expected actual } {
|    set match 0
|    foreach a $actual e $expected {
|       if { $a != $e } {
|          set match 1
|          break
|       }
|    }

|    return $match
| }

|
| customMatch negative matchNegative
|
| #

| # Floating-point comparisons are imprecise. The following
| # test returns typically such a list as {643 1357 1921 79 781 1219}
| # so nothing even close to the expected values.
| # 

| test "ManyCompares-1.2" "Compare fails - naive comparison" \
|    -match negative -body {
|    set naiv_eq 0
|    set naiv_ne 0
|    set naiv_ge 0
|    set naiv_gt 0
|    set naiv_le 0
|    set naiv_lt 0
|
|    for { set i -1000 } { $i <= 1000 } { incr i } {
|       if { $i == 0 } continue
|
|       set x [expr {1.01/double($i)}]
|       set y [expr {(2.1*$x)*(double($i)/2.1)}]
|
|       if { $y == 1.01 } { incr naiv_eq }
|       if { $y != 1.01 } { incr naiv_ne }
|       if { $y >= 1.01 } { incr naiv_ge }
|       if { $y >  1.01 } { incr naiv_gt }
|       if { $y <= 1.01 } { incr naiv_le }
|       if { $y <  1.01 } { incr naiv_lt }
|    }

|    set result [list $naiv_eq $naiv_ne $naiv_ge $naiv_gt $naiv_le $naiv_lt]
| } -result {2000 0 2000 0 2000 0}

makes sure that a mismatch is treated as the expected outcome.

~ Alternatives and objections

Of course, it is possible to achieve these effects within the current
framework of ''tcltest'', by putting these match procedures inside the
body of the test case. No extra user command would be necessary then.

There are at least two drawbacks to this approach:

 * The result against which we want to match is hidden in the code

 * If the test fails, the actual result is not printed (at least not
   by the ''tcltest'' framework).

As a matter of fact, the proposed mechanism actually simplifies the 
current implementation of the three match types to a certain degree by 
turning a switch between the three types into an array index.

~ See Also

Tcl Feature Request 490298.
http://sf.net/tracker/?func=detail&aid=490298&group_id=10894&atid=360894

~ History

''Cameron Laird'' was quite enthousiastic about the idea of providing 
custom match procedures.

''Mo DeJong'' requested the explicit examples (the second is actually 
the situation that triggered this TIP in the first place).

''Don Porter <[email protected]>'' revised the registration mechanism 
such that an arbitrary set of matching commands or procedures can be supported. His suggestions led to a revision of the TIP. He also 
revised the draft implementation.

~ Copyright

This document is placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|
|
|

|
|

|

|
|
|

|

|
|
|
|
|
<
<
|
>
>
|

|

|

|

|
|

|
|
|

|

|

|

|

|

<
>
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
|
|

|
|
|
|
|
|
<
<
>
>
|
<
>
|
|
|
<
>
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|

|

|

|
|

|

|

|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55

56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105

106
107
108

109
110
111
112
113

114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134

135
136
137

138
139
140
141

142
143
144
145

146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167

168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211

# TIP 85: Custom Comparisons in Tcltest

	Author:         Arjen Markus <[email protected]>
	Author:         Don Porter <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        31-Jan-2002
	Post-History:   
	Keywords:       test,string comparison,floating-point
	Tcl-Version:    8.4
-----

# Abstract

This TIP proposes a simple mechanism to make the _tcltest_ package
an even more flexible package than it already is by allowing the
programmer to define his or her own comparison procedures.  Such
procedures can deal with issues like allowing a \(small\) tolerance in
floating-point results.

# Rationale

The _test_ command of the package _tcltest 2.0_ supports the
comparison of the actual result with the expected result by a number
of methods: exact matching, glob-style matching and matching via a
regular expression, according to the _-match_ option.  The flexibility
is indeed enhanced over the package _tcltest 1.0,_ as it is now much
easier to allow for small variations in _string_ results.  But
it is nearly impossible to define an accurate test that checks if
floating-point results are the "same" - exact matching will seldom
suffice due to platform-specific round-off errors or differences in
formatting a floating-point number \(_0.12_ versus _.12_ for
instance\).

It is also impossible to compare results that are not easily expressed
as strings, for instance an application that produces binary files
that need to be compared or simply very long strings - these could
easily be stored in an external file, but would be awkward in a file
with a large number of such tests.

# Proposal

The package _tcltest 2.0.2_ defines an internal comparison procedure,
_CompareStrings_ that performs matching according to the three built-in
_-match_ options of _test_.  This
procedure can easily be replaced by one that invokes registered 
commands or procedures. Such a command or procedure takes two 
arguments and returns 1 for a match and a 0 for failure, 
just as _CompareStrings_ does in the current implementation:

	 proc myMatchProc { expected actual } { 
	   if { $expected (is somehow equal) $actual } {
	      return 1
	   } else
	      return 0

	   }
	 }

A new public command _customMatch_ is proposed for the purpose
of registering these matching commands.  It can register a procedure,
such as _myMatchProc_ defined above:

	 ::tcltest::customMatch mytype myMatchProc

or, as in the sample implementation, an incomplete command:

	 ::tcltest::customMatch exact [list ::string equal]

When the _test_ command is called with the _-match mytype_ option,
the command _myMatchProc_ will be completed with two arguments,
the expected and actual results, and will be evaluated in the global
namespace to determine whether the test result matches the expected
result.  Likewise, the _test_ option _-match exact_ will
cause matching to be tested by the command _::string equal_.
The default value of the _-match_ option will continue to be _exact_.

Allowing procedures to be invoked by their type names gives us the 
flexibility to register as many such procedures or commands as required.

Because this proposal adds a new public command to the _tcltest_
package, the version will be incremented to 2.1.

A patch to the current HEAD that implements this proposal is
available as Tcl Patch 521362 at the Tcl project at SourceForge.
<http://sf.net/tracker/?func=detail&aid=521362&group\_id=10894&atid=310894>

# Two Examples

To show how this works, we include two simple examples:

 * Testing a package for calculating mathematical functions like
   Bessel functions.

 * Testing for negative results, as when providing an alternative, but
   incompatible implementation of a feature.

First, suppose you have defined a package for calculating the value of
a general Bessel function, just the sort of function that returns
floating-point numbers.  Then the results may be imprecise due to
rounding-off errors, different values of _tcl\_precision_ or, even
more banally, differences in the formatting of floating-point numbers
\(_0.12_ versus _.12_ for instance\). 

The following shows how to do this:

	 #
	 # Test implementation of Bessel functions
	 # (Table only provides 4 decimals)

	 #
	 customMatch 4decimals matchFloat4Decimals

	 proc matchFloat4Decimals { expected actual } {
	    return [expr {abs($expected-$actual) <= 0.5e-4}]

	 }

	 test "J0-1.1" "J0 for x=1.0" -match 4decimals -body {
	    J0 1.0
	 } -result 0.7652

	 test "J1-1.1" "J0 for x=1.0" -match 4decimals -body {
	    J1 1.0
	 } -result 0.4401

The second example occurs for instance when testing alternative
implementations: you want to check that the original standard feature
is failing whereas the new but incompatible alternative gets it right.
Then:

	 proc matchNegative { expected actual } {
	    set match 0
	    foreach a $actual e $expected {
	       if { $a != $e } {
	          set match 1
	          break

	       }
	    }
	    return $match

	 }

	 customMatch negative matchNegative

	 #
	 # Floating-point comparisons are imprecise. The following
	 # test returns typically such a list as {643 1357 1921 79 781 1219}
	 # so nothing even close to the expected values.

	 # 
	 test "ManyCompares-1.2" "Compare fails - naive comparison" \
	    -match negative -body {
	    set naiv_eq 0
	    set naiv_ne 0
	    set naiv_ge 0
	    set naiv_gt 0
	    set naiv_le 0
	    set naiv_lt 0

	    for { set i -1000 } { $i <= 1000 } { incr i } {
	       if { $i == 0 } continue

	       set x [expr {1.01/double($i)}]
	       set y [expr {(2.1*$x)*(double($i)/2.1)}]

	       if { $y == 1.01 } { incr naiv_eq }
	       if { $y != 1.01 } { incr naiv_ne }
	       if { $y >= 1.01 } { incr naiv_ge }
	       if { $y >  1.01 } { incr naiv_gt }
	       if { $y <= 1.01 } { incr naiv_le }
	       if { $y <  1.01 } { incr naiv_lt }

	    }
	    set result [list $naiv_eq $naiv_ne $naiv_ge $naiv_gt $naiv_le $naiv_lt]
	 } -result {2000 0 2000 0 2000 0}

makes sure that a mismatch is treated as the expected outcome.

# Alternatives and objections

Of course, it is possible to achieve these effects within the current
framework of _tcltest_, by putting these match procedures inside the
body of the test case. No extra user command would be necessary then.

There are at least two drawbacks to this approach:

 * The result against which we want to match is hidden in the code

 * If the test fails, the actual result is not printed \(at least not
   by the _tcltest_ framework\).

As a matter of fact, the proposed mechanism actually simplifies the 
current implementation of the three match types to a certain degree by 
turning a switch between the three types into an array index.

# See Also

Tcl Feature Request 490298.
<http://sf.net/tracker/?func=detail&aid=490298&group\_id=10894&atid=360894>

# History

_Cameron Laird_ was quite enthousiastic about the idea of providing 
custom match procedures.

_Mo DeJong_ requested the explicit examples \(the second is actually 
the situation that triggered this TIP in the first place\).

_Don Porter <[email protected]>_ revised the registration mechanism 
such that an arbitrary set of matching commands or procedures can be supported. His suggestions led to a revision of the TIP. He also 
revised the draft implementation.

# Copyright

This document is placed in the public domain.

Name change from tip/86.tip to tip/86.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346

TIP:            86
Title:          Improved Debugger Support
Version:        $Revision: 1.26 $
Author:         Peter MacDonald <[email protected]>
Author:         Peter MacDonald <[email protected]>
State:          Draft
Type:           Project
Vote:           Pending
Created:        08-Feb-2002
Post-History:   
Tcl-Version:    8.7

~ Abstract

This TIP proposes the storage by Tcl of source code file-name and
line-numbering information, making it available at script execution
time. It also adds additional '''trace''' and '''info''' subcommands
to make it easier for a debugger to control a Tcl script much as
''gdb'' can control a C program.

~ Rationale

Currently, although Tcl provides quite reasonable information to users
in error traces, the line numbers within those traces are always
relative to the evaluation context containing them (often the
procedure, but not always) and not to the script file containing the
procedure.  This is substantially different to virtually every other
computer language and makes correlating errors with the source line
that caused them much more difficult.  This also makes coupling a Tcl
interpreter to an external debugging tool more difficult.  This TIP
proposes adding new interfaces to the Tcl core to make such debugging
activity easier.

A new '''trace execution''' option enables Tcl to track line number
and source file information associated with statements being executed
and call a single callback.
A new '''info line'' option provides access to line number information.
As a result, it becomes a simple matter to implement a debugger for
Tcl, in Tcl.  Furthermore, the implementation also serves as example usage of
the C interface, enabling similar capabilities at the lower level.

A simple Tcl debugger, ''tgdb'', written in Tcl and emulating ''gdb'',
is included with this TIP to demonstrate the
use of this interface.  ''tgdb'' runs and controls a Tcl application
in a sub-interp using '''trace execution'''  and '''interp alias'''.
It supports breakpoints on lines/procs/vars/etc,
single-stepping, up/down stack and evals.  It is designed to work both as a
commandline and a slave process (see ''Reference Implementation'').

Finally, upon error within a procedure, the file path and
absolute (as opposed to relative) line number are printed out when
available, even in the case where called from an after or callback
invocation.  Aside from aiding the user in more easily locating and
dealing with errors, the message is machine parseable  For example:
automatically bring the user into an editor at the offending line.

~ Specification

A new '''execution''' subcommand to the '''trace''' command.

 > '''trace execution''' ''target'' ?''level''?

This arranges for an execution trace to be setup for commands at nesting
''level'' or above, thereby providing a simple Tcl interface for tracing
commands to say, implement a  debugger.  With no arguments,  the current
target is returned.  If target is the empty string, the execution trace
is removed.  The ''target'' argument is assumed to be a command string to be
executed.  When level is
not specified, it defaults to 0, meaning trace all  commands.  For  each
traced command, the following data will be produced:

 * linenumber

 >  The  line number the instruction begins on.

 * filename

 > The fully normalized file name.

 * nestlevel

 > The nesting level of the command.

 * stacklevel

 > The stack call level as per '''info level'''.

 * curnsproc

 > The current fully qualified namespace/function.

 * cmdname

 > The fully qualified command name of the command to be invoked.

 * command

 > The command line to be executed including arguments.

 * flags

 > Integer bit flags, currently bit 1 is set for breakpoint.

The target is presumed
to be a valid Tcl command onto which is appended the above arguments
before evaluation. Any return
from the command other than a normal return results in the command not
being executed.  As with all traces, execution tracing is disabled
within a trace handler.

Second, a new '''line''' subcommand to '''info''' gives access to the file
path and line number information.  It takes
subcommands of its own in turn:

 * '''info line current'''

 > Returns a list with two items: the line number and file name of
   the current statement.

 * '''info line number''' ''proc'' ?''line''?

 > Get/set the line number for the start of ''proc''.  In get mode,
   returns the definition line number in the message body.

 * '''info line level''' ''number''

 > Like the info level command, but returns the line number and
  file name from which the call at level number originates.
  For use with ''trace execution''.

 * '''info line file''' ?''proc'' ?''file''??

 > Get/set the sourced file path for ''proc''.  If ''proc'' is not specified,
   dump all known sourced file paths.

 * '''info line find''' ''file line''

 > Given a file path and line number, return the ''proc'' name
   containing line number ''line''.  A new nonstatic procedure
   ''TclFindProcByLine()'' provides this function.

 * '''info line relativeerror''' ?''bool''?

 > Set to 1 to disable absolute line number and file path on a
   procedure error.  This demotes procedure traceback errors to the
   same format as all other traceback errors, that is, using the
   relative the line number and file name.

These exhibit the following behavior:

 * ''What does this do when you redefine a proc?''

 > You get the values from the latest definition.

 * ''What about when you use interp aliases?''

 > You get an error, as it is not considered a proc.

 * ''And if proc itself gets redefined by someone's special
   debugger?''

 > If the definition is not the result of a source, the file/line come
   back as an empty string.

Third, a new ''info'' subcommand ''return''.

 * '''info return'''

 > When in ''trace execution'' mode, returns the saved last result of
  the previously executed command. Otherwise returns an empty string.
  Commands executed as part of a trace handler do not affect or change
  the saved last result. 

Forth, an additional flag option ''debug'' to ''trace add variable''

  * '''trace add variable {read write unset debug} command'''

  > Used in conjuction with ''read'', ''write'', and ''unset'',
   this option allows a debugger to set read/write traces that will
   not trigger the execution trace. In other words, it specifies
   that the command is debugger code that is not to be traced.
   Normally, debugger code is entered via the 
   '''trace execution''' handler and so has tracing disabled.
   This just provides a similar feature for '''trace variable'''

Fifth,  a new '''breakpoint''' subcommand to the '''trace''' command.

 > '''trace breakpoint''' ??''line file'' ?''level'' ...??

The ''trace breakpoint'' manages a list of breakpoints that cause an
''execution trace'' to trigger, even when the nestlevel is exceeded.
With no arguments it returns a ternery list of all breakpoints in sets
of the triples: line, file, and state.  With two arguments, the
current state for the breakpoint is returned.  With three or more arguments,
new breakpoints are created.  If created with a state of zero,
the breakpoint is considered inactive.  Setting the state of a
breakpoint to the empty string effectively deletes the breakpoint.
A state set to an N greater than zero triggers every Nth time.

~ Changes

Sourced file paths are stored per interp in a hash table.  File/line
numbering information is also stored in the ''Interp'', ''Proc'', ''After'',
and ''CallFrame'' structures.  Newline counting/shifting code was
added to ''proc'', ''while'', ''for'', ''foreach'', and ''if''.
All but the non-trivial code is active
only when the new TRACE_LINE_NUMBERS interp flag is active,
which is the case when using ''trace execution''.

Most new variables within Interp are in the struct subfield sourceInfo
of type ''Tcl_SourceInfo'', which can be retrieved via the new
''Tcl_GetSourceInfo(interp)'' stubbed/public call.

~ Overhead/Impact

The runtime impact to Tcl should be modest: a few 10's of
kilobytes of memory, even for moderately large programs.  Most of the space
impact occurs in storing the file paths.
A typical example from a large system:

|  100 sourced files * 100 bytes = 10K.

The other space overhead adds up to several words (8 bytes on a 32-bit
platform) per defined procedure, plus an additional words in
the ''Interp'' structure.

Runtime processing overhead should be negligible.

However, there have been no benchmarks done to validate these
assertions.

~ Reference Implementation

This patch is against Tcl 8.4.9 and represents a complete rework
of the approach.

http://pdqi.com/download/tclline-8.4.9.diff.gz

There is a simple demonstration debugger script: ''tgdb.tcl''.

http://pdqi.com/download/tgdb.tcl

~ Previous/Old Reference Implementation

http://pdqi.com/download/tclline-cvs.diff.gz - Patch against CVS head.

http://pdqi.com/download/tclline-8.4.6.diff.gz - Patch against Tcl 8.4.6

The CVS patch was against the CVS head is as of June 13/2004.
These have been lightly tested against numerous small Tcl programs.

There is also an initial version of a debugger: ''tgdb''.

http://pdqi.com/download/tgdb-2.0.tar.gz

''tgdb'' emulates the basic commands of ''gdb'' (''s'', ''n'',
''c'', ''f'', ''bt'', ''break'', ''info locals'', etc).  This newest
version also supports watchpoints and display variables.
With ''load'' and ''run'' commands added, ''tgdb'' should probably
work even with ''emacs'' and ''ddd''.

An additional package ''pdqi'' provides ''tdb'', a GUI front-end to
''gdb'', modified to also work with ''tgdb''.

~ Possible Future Enhancements

Build and store a line number table internally during parse?

Line number lookup via the source string.
A simple way to implement this might be to lookup string against the
''codePtr->source+bestSrcOffset'' as returned by ''GetSrcInfoForPc()''.

Add special handling for eval.  Cases like ''eval $str'' should eventually
be changed to report a line number of 0 (or more likely the line number
of the original statement) for all statements with any argument involving
a sub-eval. 

Possibly implement character offsets within a line.

~ Notes

A test has been added to the tests/trace.test.
A utility ''trcline.tcl'' is provided that the test uses to
provide some measure of the accuracy of the line number tracing.

~ Comments and Feedback

Jeff Hobbs asked what about '''interp alias''', etc.

   * Updated TIP to document cases

Jeff Hobbs notes filename storage is inefficient and finalization

   * Code changed to just increment ref count

   * '''TODO:''' What needs to be done for ''Tcl_Finalize''?

Neil Madden/Stephen Trier comment on info subcommand names ''line'',
''file'' and ''proc'' and possible future uses for ''line''

   * Changed to a single subcommand ''line'' and use sub-sub commands.

   * Additional subsubcommands can easily be added.

Donal Fellows writes: Is there a way to do an equivalent of ''#line''
directives in C

   * we can now set line number etc of a proc.  Is that enough?

Donald Porter notes that changing Tcl_Parse breaks binary compatibility

   * Move all parse variables to Interp and save/restore values
     on entry/exit to Tcl_EvalEx and TclCompileScript.

Donald Porter notes that the hash table should be per Interp

   * Code changed to move hash table to Interp.

Mo DeJong notes: file path should be used in place of file name

   * TIP updated to use path where appropriate

Mo DeJong suggests to maybe use ''TclpObjNormalizePath(fileName)''

   * No action yet

Donal Fellows objects to no support for '''proc'''s in subevals and
Andreas Kupries suggests defining a line number ''Tcl_Token'' type.

   * Add support for '''proc''' in subeval by addition to
     ''ResolvedCmdName''

   * This is now fixed.

Donal Fellows asks if trace is disabled in the execution handler,
how tracing to a sub-interp would work, and clarification on the
purpose and use of trace variable {debug}.

    * The documentation was updated to clarify these points.

~ Copyright

This document has been placed in the public domain.

''tgdb'' and ''pdqi'' have a BSD copyright by Peter MacDonald and
 PDQ Interfaces Inc.

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|
|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|
|

|

|

|
|

|

|
|
|

|
|

|
|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|
|

|
|

|
|

|

|

|
|
|

|

|

|

|

|

|
|

|

|

|

|

|

|
|

|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346

# TIP 86: Improved Debugger Support

	Author:         Peter MacDonald <[email protected]>
	Author:         Peter MacDonald <[email protected]>
	State:          Draft
	Type:           Project
	Vote:           Pending
	Created:        08-Feb-2002
	Post-History:   
	Tcl-Version:    8.7
-----

# Abstract

This TIP proposes the storage by Tcl of source code file-name and
line-numbering information, making it available at script execution
time. It also adds additional **trace** and **info** subcommands
to make it easier for a debugger to control a Tcl script much as
_gdb_ can control a C program.

# Rationale

Currently, although Tcl provides quite reasonable information to users
in error traces, the line numbers within those traces are always
relative to the evaluation context containing them \(often the
procedure, but not always\) and not to the script file containing the
procedure.  This is substantially different to virtually every other
computer language and makes correlating errors with the source line
that caused them much more difficult.  This also makes coupling a Tcl
interpreter to an external debugging tool more difficult.  This TIP
proposes adding new interfaces to the Tcl core to make such debugging
activity easier.

A new **trace execution** option enables Tcl to track line number
and source file information associated with statements being executed
and call a single callback.
A new **info line_ option provides access to line number information.
As a result, it becomes a simple matter to implement a debugger for
Tcl, in Tcl.  Furthermore, the implementation also serves as example usage of
the C interface, enabling similar capabilities at the lower level.

A simple Tcl debugger, _tgdb_, written in Tcl and emulating _gdb_,
is included with this TIP to demonstrate the
use of this interface.  _tgdb_ runs and controls a Tcl application
in a sub-interp using **trace execution**  and **interp alias**.
It supports breakpoints on lines/procs/vars/etc,
single-stepping, up/down stack and evals.  It is designed to work both as a
commandline and a slave process \(see _Reference Implementation_\).

Finally, upon error within a procedure, the file path and
absolute \(as opposed to relative\) line number are printed out when
available, even in the case where called from an after or callback
invocation.  Aside from aiding the user in more easily locating and
dealing with errors, the message is machine parseable  For example:
automatically bring the user into an editor at the offending line.

# Specification

A new **execution** subcommand to the **trace** command.

 > **trace execution** _target_ ?_level_?

This arranges for an execution trace to be setup for commands at nesting
_level_ or above, thereby providing a simple Tcl interface for tracing
commands to say, implement a  debugger.  With no arguments,  the current
target is returned.  If target is the empty string, the execution trace
is removed.  The _target_ argument is assumed to be a command string to be
executed.  When level is
not specified, it defaults to 0, meaning trace all  commands.  For  each
traced command, the following data will be produced:

 * linenumber

	 >  The  line number the instruction begins on.

 * filename

	 > The fully normalized file name.

 * nestlevel

	 > The nesting level of the command.

 * stacklevel

	 > The stack call level as per **info level**.

 * curnsproc

	 > The current fully qualified namespace/function.

 * cmdname

	 > The fully qualified command name of the command to be invoked.

 * command

	 > The command line to be executed including arguments.

 * flags

	 > Integer bit flags, currently bit 1 is set for breakpoint.

The target is presumed
to be a valid Tcl command onto which is appended the above arguments
before evaluation. Any return
from the command other than a normal return results in the command not
being executed.  As with all traces, execution tracing is disabled
within a trace handler.

Second, a new **line** subcommand to **info** gives access to the file
path and line number information.  It takes
subcommands of its own in turn:

 * **info line current**

	 > Returns a list with two items: the line number and file name of
   the current statement.

 * **info line number** _proc_ ?_line_?

	 > Get/set the line number for the start of _proc_.  In get mode,
   returns the definition line number in the message body.

 * **info line level** _number_

	 > Like the info level command, but returns the line number and
  file name from which the call at level number originates.
  For use with _trace execution_.

 * **info line file** ?_proc_ ?_file_??

	 > Get/set the sourced file path for _proc_.  If _proc_ is not specified,
   dump all known sourced file paths.

 * **info line find** _file line_

	 > Given a file path and line number, return the _proc_ name
   containing line number _line_.  A new nonstatic procedure
   _TclFindProcByLine\(\)_ provides this function.

 * **info line relativeerror** ?_bool_?

	 > Set to 1 to disable absolute line number and file path on a
   procedure error.  This demotes procedure traceback errors to the
   same format as all other traceback errors, that is, using the
   relative the line number and file name.

These exhibit the following behavior:

 * _What does this do when you redefine a proc?_

	 > You get the values from the latest definition.

 * _What about when you use interp aliases?_

	 > You get an error, as it is not considered a proc.

 * _And if proc itself gets redefined by someone's special
   debugger?_

	 > If the definition is not the result of a source, the file/line come
   back as an empty string.

Third, a new _info_ subcommand _return_.

 * **info return**

	 > When in _trace execution_ mode, returns the saved last result of
  the previously executed command. Otherwise returns an empty string.
  Commands executed as part of a trace handler do not affect or change
  the saved last result. 

Forth, an additional flag option _debug_ to _trace add variable_

  * **trace add variable \{read write unset debug\} command**

	  > Used in conjuction with _read_, _write_, and _unset_,
   this option allows a debugger to set read/write traces that will
   not trigger the execution trace. In other words, it specifies
   that the command is debugger code that is not to be traced.
   Normally, debugger code is entered via the 
   **trace execution** handler and so has tracing disabled.
   This just provides a similar feature for **trace variable**

Fifth,  a new **breakpoint** subcommand to the **trace** command.

 > **trace breakpoint** ??_line file_ ?_level_ ...??

The _trace breakpoint_ manages a list of breakpoints that cause an
_execution trace_ to trigger, even when the nestlevel is exceeded.
With no arguments it returns a ternery list of all breakpoints in sets
of the triples: line, file, and state.  With two arguments, the
current state for the breakpoint is returned.  With three or more arguments,
new breakpoints are created.  If created with a state of zero,
the breakpoint is considered inactive.  Setting the state of a
breakpoint to the empty string effectively deletes the breakpoint.
A state set to an N greater than zero triggers every Nth time.

# Changes

Sourced file paths are stored per interp in a hash table.  File/line
numbering information is also stored in the _Interp_, _Proc_, _After_,
and _CallFrame_ structures.  Newline counting/shifting code was
added to _proc_, _while_, _for_, _foreach_, and _if_.
All but the non-trivial code is active
only when the new TRACE\_LINE\_NUMBERS interp flag is active,
which is the case when using _trace execution_.

Most new variables within Interp are in the struct subfield sourceInfo
of type _Tcl\_SourceInfo_, which can be retrieved via the new
_Tcl\_GetSourceInfo\(interp\)_ stubbed/public call.

# Overhead/Impact

The runtime impact to Tcl should be modest: a few 10's of
kilobytes of memory, even for moderately large programs.  Most of the space
impact occurs in storing the file paths.
A typical example from a large system:

	  100 sourced files * 100 bytes = 10K.

The other space overhead adds up to several words \(8 bytes on a 32-bit
platform\) per defined procedure, plus an additional words in
the _Interp_ structure.

Runtime processing overhead should be negligible.

However, there have been no benchmarks done to validate these
assertions.

# Reference Implementation

This patch is against Tcl 8.4.9 and represents a complete rework
of the approach.

<http://pdqi.com/download/tclline-8.4.9.diff.gz>

There is a simple demonstration debugger script: _tgdb.tcl_.

<http://pdqi.com/download/tgdb.tcl>

# Previous/Old Reference Implementation

<http://pdqi.com/download/tclline-cvs.diff.gz> - Patch against CVS head.

<http://pdqi.com/download/tclline-8.4.6.diff.gz> - Patch against Tcl 8.4.6

The CVS patch was against the CVS head is as of June 13/2004.
These have been lightly tested against numerous small Tcl programs.

There is also an initial version of a debugger: _tgdb_.

<http://pdqi.com/download/tgdb-2.0.tar.gz>

_tgdb_ emulates the basic commands of _gdb_ \(_s_, _n_,
_c_, _f_, _bt_, _break_, _info locals_, etc\).  This newest
version also supports watchpoints and display variables.
With _load_ and _run_ commands added, _tgdb_ should probably
work even with _emacs_ and _ddd_.

An additional package _pdqi_ provides _tdb_, a GUI front-end to
_gdb_, modified to also work with _tgdb_.

# Possible Future Enhancements

Build and store a line number table internally during parse?

Line number lookup via the source string.
A simple way to implement this might be to lookup string against the
_codePtr->source\+bestSrcOffset_ as returned by _GetSrcInfoForPc\(\)_.

Add special handling for eval.  Cases like _eval $str_ should eventually
be changed to report a line number of 0 \(or more likely the line number
of the original statement\) for all statements with any argument involving
a sub-eval. 

Possibly implement character offsets within a line.

# Notes

A test has been added to the tests/trace.test.
A utility _trcline.tcl_ is provided that the test uses to
provide some measure of the accuracy of the line number tracing.

# Comments and Feedback

Jeff Hobbs asked what about **interp alias**, etc.

   * Updated TIP to document cases

Jeff Hobbs notes filename storage is inefficient and finalization

   * Code changed to just increment ref count

   * **TODO:** What needs to be done for _Tcl\_Finalize_?

Neil Madden/Stephen Trier comment on info subcommand names _line_,
_file_ and _proc_ and possible future uses for _line_

   * Changed to a single subcommand _line_ and use sub-sub commands.

   * Additional subsubcommands can easily be added.

Donal Fellows writes: Is there a way to do an equivalent of _\#line_
directives in C

   * we can now set line number etc of a proc.  Is that enough?

Donald Porter notes that changing Tcl\_Parse breaks binary compatibility

   * Move all parse variables to Interp and save/restore values
     on entry/exit to Tcl\_EvalEx and TclCompileScript.

Donald Porter notes that the hash table should be per Interp

   * Code changed to move hash table to Interp.

Mo DeJong notes: file path should be used in place of file name

   * TIP updated to use path where appropriate

Mo DeJong suggests to maybe use _TclpObjNormalizePath\(fileName\)_

   * No action yet

Donal Fellows objects to no support for **proc**s in subevals and
Andreas Kupries suggests defining a line number _Tcl\_Token_ type.

   * Add support for **proc** in subeval by addition to
     _ResolvedCmdName_

   * This is now fixed.

Donal Fellows asks if trace is disabled in the execution handler,
how tracing to a sub-interp would work, and clarification on the
purpose and use of trace variable \{debug\}.

    * The documentation was updated to clarify these points.

# Copyright

This document has been placed in the public domain.

_tgdb_ and _pdqi_ have a BSD copyright by Peter MacDonald and
 PDQ Interfaces Inc.

Name change from tip/87.tip to tip/87.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104

TIP:            87
Title:          Allow Tcl Access to the Recursion Limit
Version:        $Revision: 1.11 $
Author:         Stephen Trier <[email protected]>
Author:         Richard Suchenwirth <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        19-Feb-2002
Post-History:   
Discussions-To: news:comp.lang.tcl
Keywords:       Tcl_SetRecusionLimit,recursion limit
Tcl-Version:    8.4

~ Abstract

An extension to the [[interp]] command, [[interp recursionlimit]],
will permit Tcl scripts to control their own recursion limits.  Until
now, this limit has been changeable from a C API, but not from within
Tcl.

~ Rationale

As of Tcl 8.4a3, Tcl scripts must live with the default recursion
depth of 1000 nested calls to the ''Tcl_Eval'' family of functions or
resort to C code to change the limit.  Nevertheless, Tcl programmers
may find it useful to reduce the limit when debugging or to increase
it for scripts that include deeply recursive functions.  The changes
proposed in this TIP will make this possible in pure Tcl code.

~ Specification

 generic/tclInterp.c: Add subcommands to [interp] and to the slave
   interpreter object command with the following syntax:

 > interp recursionlimit ''path'' ''?newlimit?''

 > ''slave'' recursionlimit ''?newlimit?''

 > The parameter ''newlimit'' must be a positive integer.  When it is
   present, the limit is changed to ''newlimit'' and the command
   returns the new recursion limit.  If the ''newlimit'' parameter is
   absent, the command returns the current recursion limit.

 > No maximum value is enforced.  It is the programmer's
   responsibility to ensure the recursion limit will not overflow the
   process stack.

 > A safe interpreter is not allowed to change the recursion limit for
   itself nor for any other interpreter.  Attempting to do so will
   generate an error.  Safe interpreters are allowed to query
   recursion limits.

 > If an interpreter changes its own recursion limit to a value lower
   than the current Tcl_Eval nesting level, the limit will be
   changed, then an error message appropriate to this particular
   situation will be issued by the recursionlimit command.
   (Error text: "falling back due to new recursion limit")

 > If an interpreter changes a sub-interpreter's recursion limit to
   less than the sub-interpreter's current Tcl_Eval nesting level,
   no immediate error is issued.  The sub-interpreter will throw a
   "too many nested calls to Tcl_Eval (infinite loop?)" error if
   its nesting is still deeper than its recursion limit when next
   a command is executed in its context.

 generic/tclTest.c: Remove the now-unnecessary testsetrecursionlimit
   command.

 doc/interp.n: Add documentation for the new subcommands, including a
   warning about stack overflow, much like the warning in the
   documentation for ''Tcl_SetRecursionLimit()''.

 test/interp.test: Add tests for the new subcommands.

~ Comments Received

Discussion of this TIP took place in the following threads:

http://groups.google.com/groups?hl=en&threadm=3C6D0A88.5DC9D8B4%40utdt.edu

http://groups.google.com/groups?hl=en&threadm=3C73E98A.8ED9DDE6%40cisco.com

http://www.geocrawler.com/mail/thread.php3?subject=%5BTCLCORE%5D+TIP+%2387%3A+Allow+Tcl+Access+to+the+Recursion+Limit&list=7375

Using a command or variable ''::tcl::recursionLimit'' to manipulate
the limit was initially considered, but Miguel Sofer suggested making
the function a subcommand of [[interp]] because the recursion limit is
logically an attribute of each interpreter.  Miguel also pointed out that implementing ''TclpCheckStackSpace()'' for Unix would mitigate
the dangers of setting the recursion limit too high.

comp.lang.tcl saw some discussion of whether it would be appropriate to have a way to completely remove the recursion limit. The consensus was to not add such a feature.

The initial version of this TIP did not provide for a diagnostic error message for the case where the nesting is already deeper than the new recursion level. Ken Fitch, Don Porter, Miguel Sofer, and Donal Fellows discussed whether this was important. This version of the TIP uses Donal Fellows's suggestion of changing the recursion limit as requested, but providing a meaningful error message if the nesting is too deep for the new limit.

Donal Fellows suggested that slave interpreters should inherit their recursion limit from their parent. As it turns out, this behavior was already present but was not documented. The reference implementation documents it.

~ Reference Implementation

An implementation of this TIP, with tests and documentation, is patch number 522849 on SourceForge.

~ Copyright

This document is in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

|
|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104

# TIP 87: Allow Tcl Access to the Recursion Limit

	Author:         Stephen Trier <[email protected]>
	Author:         Richard Suchenwirth <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        19-Feb-2002
	Post-History:   
	Discussions-To: news:comp.lang.tcl
	Keywords:       Tcl_SetRecusionLimit,recursion limit
	Tcl-Version:    8.4
-----

# Abstract

An extension to the [interp] command, [interp recursionlimit],
will permit Tcl scripts to control their own recursion limits.  Until
now, this limit has been changeable from a C API, but not from within
Tcl.

# Rationale

As of Tcl 8.4a3, Tcl scripts must live with the default recursion
depth of 1000 nested calls to the _Tcl\_Eval_ family of functions or
resort to C code to change the limit.  Nevertheless, Tcl programmers
may find it useful to reduce the limit when debugging or to increase
it for scripts that include deeply recursive functions.  The changes
proposed in this TIP will make this possible in pure Tcl code.

# Specification

 generic/tclInterp.c: Add subcommands to [interp] and to the slave
   interpreter object command with the following syntax:

 > interp recursionlimit _path_ _?newlimit?_

 > _slave_ recursionlimit _?newlimit?_

 > The parameter _newlimit_ must be a positive integer.  When it is
   present, the limit is changed to _newlimit_ and the command
   returns the new recursion limit.  If the _newlimit_ parameter is
   absent, the command returns the current recursion limit.

 > No maximum value is enforced.  It is the programmer's
   responsibility to ensure the recursion limit will not overflow the
   process stack.

 > A safe interpreter is not allowed to change the recursion limit for
   itself nor for any other interpreter.  Attempting to do so will
   generate an error.  Safe interpreters are allowed to query
   recursion limits.

 > If an interpreter changes its own recursion limit to a value lower
   than the current Tcl\_Eval nesting level, the limit will be
   changed, then an error message appropriate to this particular
   situation will be issued by the recursionlimit command.
   \(Error text: "falling back due to new recursion limit"\)

 > If an interpreter changes a sub-interpreter's recursion limit to
   less than the sub-interpreter's current Tcl\_Eval nesting level,
   no immediate error is issued.  The sub-interpreter will throw a
   "too many nested calls to Tcl\_Eval \(infinite loop?\)" error if
   its nesting is still deeper than its recursion limit when next
   a command is executed in its context.

 generic/tclTest.c: Remove the now-unnecessary testsetrecursionlimit
   command.

 doc/interp.n: Add documentation for the new subcommands, including a
   warning about stack overflow, much like the warning in the
   documentation for _Tcl\_SetRecursionLimit\(\)_.

 test/interp.test: Add tests for the new subcommands.

# Comments Received

Discussion of this TIP took place in the following threads:

<http://groups.google.com/groups?hl=en&threadm=3C6D0A88.5DC9D8B4%40utdt.edu>

<http://groups.google.com/groups?hl=en&threadm=3C73E98A.8ED9DDE6%40cisco.com>

<http://www.geocrawler.com/mail/thread.php3?subject=%5BTCLCORE%5D\+TIP\+%2387%3A\+Allow\+Tcl\+Access\+to\+the\+Recursion\+Limit&list=7375>

Using a command or variable _::tcl::recursionLimit_ to manipulate
the limit was initially considered, but Miguel Sofer suggested making
the function a subcommand of [interp] because the recursion limit is
logically an attribute of each interpreter.  Miguel also pointed out that implementing _TclpCheckStackSpace\(\)_ for Unix would mitigate
the dangers of setting the recursion limit too high.

comp.lang.tcl saw some discussion of whether it would be appropriate to have a way to completely remove the recursion limit. The consensus was to not add such a feature.

The initial version of this TIP did not provide for a diagnostic error message for the case where the nesting is already deeper than the new recursion level. Ken Fitch, Don Porter, Miguel Sofer, and Donal Fellows discussed whether this was important. This version of the TIP uses Donal Fellows's suggestion of changing the recursion limit as requested, but providing a meaningful error message if the nesting is too deep for the new limit.

Donal Fellows suggested that slave interpreters should inherit their recursion limit from their parent. As it turns out, this behavior was already present but was not documented. The reference implementation documents it.

# Reference Implementation

An implementation of this TIP, with tests and documentation, is patch number 522849 on SourceForge.

# Copyright

This document is in the public domain.

Name change from tip/88.tip to tip/88.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119

TIP:            88
Title:          Extend Tcl Process Id Control via 'pid'
Version:        $Revision: 1.9 $
Author:         Jeff Hobbs <[email protected]>
Author:         Vince Darley <[email protected]>
State:          Rejected
Type:           Project
Vote:           Done
Created:        11-Mar-2002
Post-History:   
Tcl-Version:    8.4
Obsoleted-By:	240

~ Abstract

This TIP proposes extended the [[pid]] command to provide more
control over native processes in Tcl.

~ Rationale

Certain process control functions have shown themselves to be portable
and of high usefulness to Tcl programmers.  Most of these already
exist in TclX, but simply requiring that extension isn't always
acceptable.  The [[pid]] command in Tcl is a command that is often
overlooked, and so simple that it lends itself easily to being
enhanced with new syntax.  This TIP proposes adding subcommands to
[[pid]] that extend the process control functionality of pure Tcl.

~ Specification

|   pid ?fileId?
|   pid terminate ?-force? ?--? pid ?pid ...?

The first line is the current definition for [[pid]], which is to
return the name of the current process id, or that attached to a
''fileId'' (as returned by [[open]] or [[socket]]).  I propose to add
only ''terminate'' initially.  This command is adapted almost directly
from TclX's signal handling, but changed to work as a subcommand of
[[pid]].  This is to satisfy one of the most common requests from
users regarding process management - killing a known process.  It also
establishes the framework of extending [[pid]] for future
modifications.  The ?-force? argument causes a forceful termination
(the usage of SIGKILL on Unix, for Windows and Mac termination is
already forceful).

~ Reference Implementation

Although TclX's current documentation denies it, the ''send'' is
already implemented for Windows (as ''kill'' under TclX) as well as
Unix.  Macintosh implementations for OS 9 and below would need to be
created, or the documentation would need to stress that these are not
available there (OS X is Unix based).  Jim Ingham notes that a variant
of ''kill'' could be created for OS 9.  These functions are really
meant to round out the process control functionality in Tcl (started
with ''exec'' and ''open|''), which are already of limited portability
to Mac OS 9 (but undeniable usefulness elsewhere).

File: ''tcl/mac/tclMacChan.c''

File: ''tcl/unix/tclUnixPipe.c''

File: ''tcl/win/tclWinPipe.c''

Function: ''Tcl_PidObjCmd''

~ Future Potential

What this also provides is a blueprint for future process management
functions like these:

|   pid id ?-user ?userIdOrName?? ?-group ?groupIdOrName?? ?-parent parentId?
|   pid wait ?-nohang bool? ?-untraced bool? ?-group bool? ?fileIdOrPid?
|   pid nice ?-level niceLevel? fileIdOrPid
|   pid list ?pattern?
|   pid id ?-session id? ?-processgroup id?
|   pid handle action signal ...
|   pid send signalType fileIdOrPid ?fileIdOrPid ...?

[[pid wait]] was in the initial tip, but it was recommended to rework
it with callback to make it much more useful to the user.  The [[pid
id]] command was intended for Unix only, operating on the current
process id, and would function similar to the [[file attributes]]
command, but Windows NT does have similar functionality.  The
''-user'' and ''-group'' options will return the name if possible,
otherwise the id. The ''-parent'' option would be read-only (like
''-longname'' for [[file attributes]]).  [[pid send]] suffers from
cross-platform portability as well.  On Windows, you can only
''raise'' signals inside of your own process.

[[pid nice]] is easy to implement, while [[pid list]] is very much
platform sensitive.  [[pid handle]] is for signal handling, another
oft-requested feature for the core, and would be based on the TclX
[[signal]] command (perhaps named ''trap'' as in Expect?).  It could
be massaged to various forms.  These aren't to be addressed in this
TIP, but are just ideas for the future.

~ Comments

|   pid kill ?-group bool? ?-signal signalType? fileIdOrPid ?fileIdOrPid ...?

This was the original form for [[pid send]], but it was noted that we
are really sending signals.  While I prefer the specificity of users
recognizing ''kill'' as a command, what this really does is send
specific signals (ANSI C specifies SIGABRT, SIGINT and SIGTERM, and
for Unix we would handle the other POSIX names too).  Thus I changed
it to the ''send'' command documented above.

[[process]] rather than [[pid]] seems a more logical name for this
command, but we are working within the constraints of the existing
commands in order to prevent command bloat.  There is still logic in
the naming, as we are dealing with process ids.

[[pid terminate]] was also recommended to have the ability to
terminate a process ''and'' all its children.  This would be useful,
but is not in the scope of the current tip.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|
|

|

|
|

|

|

|
|

|

|
|

|
|
|
|
|

|

|

|

|

|

|
|
|
|
|
|
|

|
|
|
|

|
|
|

|

|
|

|

|

|

|

|
|
|
|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119

# TIP 88: Extend Tcl Process Id Control via 'pid'

	Author:         Jeff Hobbs <[email protected]>
	Author:         Vince Darley <[email protected]>
	State:          Rejected
	Type:           Project
	Vote:           Done
	Created:        11-Mar-2002
	Post-History:   
	Tcl-Version:    8.4
	Obsoleted-By:	240
-----

# Abstract

This TIP proposes extended the [pid] command to provide more
control over native processes in Tcl.

# Rationale

Certain process control functions have shown themselves to be portable
and of high usefulness to Tcl programmers.  Most of these already
exist in TclX, but simply requiring that extension isn't always
acceptable.  The [pid] command in Tcl is a command that is often
overlooked, and so simple that it lends itself easily to being
enhanced with new syntax.  This TIP proposes adding subcommands to
[pid] that extend the process control functionality of pure Tcl.

# Specification

	   pid ?fileId?
	   pid terminate ?-force? ?--? pid ?pid ...?

The first line is the current definition for [pid], which is to
return the name of the current process id, or that attached to a
_fileId_ \(as returned by [open] or [socket]\).  I propose to add
only _terminate_ initially.  This command is adapted almost directly
from TclX's signal handling, but changed to work as a subcommand of
[pid].  This is to satisfy one of the most common requests from
users regarding process management - killing a known process.  It also
establishes the framework of extending [pid] for future
modifications.  The ?-force? argument causes a forceful termination
\(the usage of SIGKILL on Unix, for Windows and Mac termination is
already forceful\).

# Reference Implementation

Although TclX's current documentation denies it, the _send_ is
already implemented for Windows \(as _kill_ under TclX\) as well as
Unix.  Macintosh implementations for OS 9 and below would need to be
created, or the documentation would need to stress that these are not
available there \(OS X is Unix based\).  Jim Ingham notes that a variant
of _kill_ could be created for OS 9.  These functions are really
meant to round out the process control functionality in Tcl \(started
with _exec_ and _open\|_\), which are already of limited portability
to Mac OS 9 \(but undeniable usefulness elsewhere\).

File: _tcl/mac/tclMacChan.c_

File: _tcl/unix/tclUnixPipe.c_

File: _tcl/win/tclWinPipe.c_

Function: _Tcl\_PidObjCmd_

# Future Potential

What this also provides is a blueprint for future process management
functions like these:

	   pid id ?-user ?userIdOrName?? ?-group ?groupIdOrName?? ?-parent parentId?
	   pid wait ?-nohang bool? ?-untraced bool? ?-group bool? ?fileIdOrPid?
	   pid nice ?-level niceLevel? fileIdOrPid
	   pid list ?pattern?
	   pid id ?-session id? ?-processgroup id?
	   pid handle action signal ...
	   pid send signalType fileIdOrPid ?fileIdOrPid ...?

[pid wait] was in the initial tip, but it was recommended to rework
it with callback to make it much more useful to the user.  The [pid
id] command was intended for Unix only, operating on the current
process id, and would function similar to the [file attributes]
command, but Windows NT does have similar functionality.  The
_-user_ and _-group_ options will return the name if possible,
otherwise the id. The _-parent_ option would be read-only \(like
_-longname_ for [file attributes]\).  [pid send] suffers from
cross-platform portability as well.  On Windows, you can only
_raise_ signals inside of your own process.

[pid nice] is easy to implement, while [pid list] is very much
platform sensitive.  [pid handle] is for signal handling, another
oft-requested feature for the core, and would be based on the TclX
[signal] command \(perhaps named _trap_ as in Expect?\).  It could
be massaged to various forms.  These aren't to be addressed in this
TIP, but are just ideas for the future.

# Comments

	   pid kill ?-group bool? ?-signal signalType? fileIdOrPid ?fileIdOrPid ...?

This was the original form for [pid send], but it was noted that we
are really sending signals.  While I prefer the specificity of users
recognizing _kill_ as a command, what this really does is send
specific signals \(ANSI C specifies SIGABRT, SIGINT and SIGTERM, and
for Unix we would handle the other POSIX names too\).  Thus I changed
it to the _send_ command documented above.

[process] rather than [pid] seems a more logical name for this
command, but we are working within the constraints of the existing
commands in order to prevent command bloat.  There is still logic in
the naming, as we are dealing with process ids.

[pid terminate] was also recommended to have the ability to
terminate a process _and_ all its children.  This would be useful,
but is not in the scope of the current tip.

# Copyright

This document has been placed in the public domain.

Name change from tip/89.tip to tip/89.md.

1
2
3
4
5
6
7
8
9
10
11
12
13

14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136

137
138
139
140
141
142
143
144
145

146
147
148
149
150
151
152
153
154

155
156
157
158
159
160
161
162
163
164
165
166

167
168
169
170
171
172
173
174
175
176
177
178

179
180
181
182
183
184
185
186
187

188
189
190

191
192
193
194
195

196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223

224
225
226
227
228
229
230
231
232
233
234
235
236
237
238

239
240

241
242
243

244
245
246
247
248
249
250
251
252

253
254
255
256
257
258
259
260

261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277

278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299

300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317

318
319
320
321
322
323
324
325
326
327
328
329

330
331
332
333
334
335
336

337
338
339
340

341
342
343
344
345
346
347
348

349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366

367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392

393
394
395

396
397
398
399
400
401

402
403
404

405
406
407
408
409
410
411
412
413

414

415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446

447
448
449
450
451
452
453

454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482

483
484
485
486
487
488
489
490
491
492
493
494
495
496
497

498
499
500
501
502

503
504
505
506

507
508
509
510
511
512

513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530

531
532
533
534
535

536
537
538
539
540

541
542
543
544
545
546
547

548
549
550
551
552
553
554
555
556
557
558
559
560
561

562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583

584
585
586
587
588
589
590
591
592
593
594

595
596
597
598
599
600
601
602
603
604
605
606
607
608

609
610
611
612
613
614
615
616

617
618
619
620
621
622
623

624
625
626
627

628
629

630
631
632
633
634
635
636
637

638
639
640
641
642
643

644
645
646
647

648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677

678
679
680

TIP:            89
Title:          Try/Catch Exception Handling in the Core
Version:        $Revision: 1.10 $
Author:         Tom Wilkason <[email protected]>
Author:         Frank Pilhofer <[email protected]>
State:          Withdrawn
Type:           Project
Vote:           Pending
Created:        11-Mar-2002
Post-History:   
Discussions-To: news:comp.lang.tcl
Tcl-Version:    8.6
Obsoleted-By:	329

~ Abstract

This TIP proposes the addition of a
'''try'''...'''catch'''...'''finally''' command to provide a more
robust and powerful exception handling mechanism.

~ Rationale

Exceptions are currently supported very well in Tcl, in fact they are
a major advantage over many other languages.  However the mechanism to
'''catch''' and handle the errors is someone limited and does not
promote the full use of existing error codes.  Wrapper procedures can
be written to improve on this, however both a performance and
compatibility penalty is incurred.

This TIP proposes adding a '''try/catch''' command to the Tcl core (or
C based Tcl library).  This implementation is not unlike those found
in C++, C#, Java and Python (to name a few languages).

An argument to add this to the core is that it modernizes the Tcl
exception handling without impacting performance in any other way.
'''try/catch''' are isolated commands that can easily be added, and do
not interact with other commands or require other changes.
'''try/catch''' is not an isolated extension that is useful for
special purposes only.  These commands, if implemented into the core,
will be useful for any script currently using the catch construct.

~ Specification

I propose the following two commands be added to Tcl:

 * '''throw''' command.

 > '''throw''' ?''type''? ?''message''? ?''info''?

 > A '''throw''' command with ''type'' throws an error exception with
   the errorCode ''type''. The '''throw''' command works as the
   '''error''' command, but the arguments are reordered to encourage
   the use of error-codes. The optional ''message'' and ''info''
   parameters work as they do in the '''error''' command.

 > The throw ''type'' can be any user defined or built in type,
   built-in types include POSIX, ARITH, CORE, REGEXP, WINDOWS, NONE,
   ...  The ''message'' is optional, and is the same as that issued by
   the '''catch''' command, '''error -code error''' "''message''"

 > An instance of '''throw''' with no arguments can be used within a
   '''catch''' block to immediately re-throw the current exception
   that is being handled by the '''catch''' block.  When an error is
   re-thrown in the catch block, the current error is propagated up
   one level following the evaluation of the '''finally''' block (if
   on exists).  Enclosing error handlers can then deal with the error.

 > Note that

|    throw type message info

 > is the same as

|    error message info type

 * '''try''' command.

 > '''try''' ''body'' ?'''catch''' {{''type_list''} ?''ecvar''? ?''msgvar''? ?''infovar''?} ''body ...''? ?'''finally''' ''body''?

 > If one or more '''catch''' blocks are specified, each corresponding
   ''body'' represents a required block of code that is evaluated if
   the resulting errorCode matches the ''type'' condition.  The
   required body of the '''finally''' block is evaluated following the
   '''try''' block and '''catch''' block (if any matches).

 > ''type_list'' represents a list of glob style patterns used to
   match eache of the error-code list conditions.  A match is declared
   if the ''type_list'' patterns or errorCode elements are exhausted
   (whichever comes first) and a mismatch has not occurred.  If a
   match occurs, and ''ecvar'' is specified, the errorCode list will
   be stored in ''ecvar'' within the local scope prior to executing
   the ''body''.  Moreover, if a ''msgvar'' or ''infovar'' are
   specified, the error message and errorInfo contents will be stored
   in the local context.

 > If an error occurs during the '''try''', and no ''catch'' blocks
   are specified, the offending error is rethrown following execution
   of the ''finally'' block (if specified).

 > If an error occurs during execution of a '''catch''' or
   '''finally''' block, this error will take precedence and will
   propagate upwards with a new stack trace.  If an error is rethrown
   within a catch block, the existing stack trace will be preserved
   with the rethrown error.  This allows later discrimination of the
   two different error conditions (rethrown vs. unintended).

 > Note, '''catch''' {''*''}, if specified, will catch all remaining
   errors.  If used, it should be placed last since each of the catch
   blocks are evaluated in the order specified.  ''type'' is that set
   in errorCode, and can be any user defined type, or built-in types
   including POSIX *, ARITH *, CHILD *, CORE, REGEXP, WINDOWS, or
   NONE.

 > If one or more '''catch''' blocks are specified, and no '''catch'''
   block matches the errorCode condition, the error will be propagated
   up to the next level following evaluation of the '''finally'''
   clause (if specified).  An enclosing '''try''' block (or
   '''catch''' command) can then be used to handle the error.

 > The '''finally''' block is used to perform all the clean up code.
   The '''finally''' body is evaluated whether the error occurs or
   not, or whether a '''catch''' block matched the errorCode.  It is
   also evaluated if a ''throw'' statement occurs within the
   '''catch''' clause.

~ Examples

'''throw'''

|    throw DEVICE "Could not write to device"

'''try''' only (no practical use)

|    try {
|       incr i
|    }

'''try - catch'''

|    try {
|       incr i
|    } catch * {
|       set i 0
|    }

'''try - finally'''

|    try {
|       . config -cursor watch
|       #do some busy stuff here, don't care about errors
|    } finally {
|       . config -cursor arrow
|    }

'''try - catch - catch'''

|    try {
|       ;# Some code that will cause an error
|    } catch {{POSIX *} eCode eMessage} {
|       ;# Statements to handle POSIX type errors
|    } catch {NULL eCode eMessage} {
|       ;# Statements to handle NULL (a user created) type errors
|    } catch {* eMessage} {
|       ;# Statements to handle all other errors
|    }

'''try - catch - catch - finally'''

|    try {
|       ;# Some code that will cause an error
|    } catch {POSIX eCode eMessage} {
|       ;# Statements to handle POSIX type errors
|    } catch {* eCode eMessage} {
|       ;# Statements to handle all other errors
|    } finally {
|       ;# Statements to execute whether an error occurred or not
|    }

Re-throw '''try - catch - finally'''

|    try {
|       try {
|          set b [expr {$a/0}]
|       } catch {ARITH} {
|          if {$a == 0} {
|             throw   ;# re-throw to outer try
|          }

|       } finally {
|          set b 1    ;# will execute before throw above
|       }

|    } catch {ARITH eCode eMessage} {
|       ;# This will catch the inner throw
|       puts "$res"
|    }

~ Revisions: Tom Wilkason March 26, 2002

  * Added additional ''ecvar'' and ''infovar'' optional arguments to
    the '''catch''' clause.

  * All uncaught errors are propagated up after execution of the
    finally block (if specified).

  * Unanticipated errors within a '''catch''' or '''finally''' block
    start a new stack trace and are propagated up.

  * Additional ''info'' optional argument added to '''throw''' for
    completeness.

~ Reference Implementation

| /*
|  * Implementation of try/catch and throw commands according to TIP 89
|  */
|
| #include <tcl.h>
|
| /*
|  * We keep a stack of contexts; whenever we have to handle an error,
|  * i.e. are executing a catch {} clause, we store the current error
|  * (errorCode, errorInfo and message), so that a throw with no arguments
|  * can re-throw it.
|  *

|  * This is interpreter-specific data. Each element is a list, with the
|  * last element being the most current one.
|  */
|
| typedef struct {
|   Tcl_Obj * errorCodeStack;
|   Tcl_Obj * errorInfoStack;
|   Tcl_Obj * errorMsgStack;
|   Tcl_Obj * errorCodeName;
|   Tcl_Obj * errorInfoName;
| } TryCatchTsd;
|
| /*
|  * Throw an Exception
|  *

|  * throw ?<type> ?<message>? ?<info>??
|  *

|  * Throws an exception with the errorCode <type>, the message <message>
|  * and the errorInfo <info>.
|  *

|  * An instance of throw with no arguments can be used within a catch or
|  * finally block to immediately re-throw the current exception that is
|  * being handled by the catch block.
|  */
|
| static int
| Tcl_ThrowObjCmd (ClientData clientData, Tcl_Interp *interp,
|                  int objc, Tcl_Obj *CONST objv[])
| {

|   TryCatchTsd * myTsd = (TryCatchTsd *) clientData;
|
|   if (objc < 1 || objc > 4) {
|     Tcl_AppendResult (interp, "wrong # args: should be \"",
|                       Tcl_GetStringFromObj (objv[0], NULL),
|                       " ?<type> ?<message>? ?<info>??\"", NULL);
|     return TCL_ERROR;
|   }

|
|   /*
|    * Re-throw an error
|    */
|
|   if (objc < 2) {
|     Tcl_Obj *errorCode, *errorInfo, *errorMsg;
|     int lastelement;
|
|     Tcl_ListObjLength (interp, myTsd->errorMsgStack, &lastelement);
|
|     if (lastelement < 1) {
|       Tcl_AppendResult (interp, "error: throw with no parameters ",
|                         "outside of a catch",
|                         NULL);
|       return TCL_ERROR;
|     }

|
|     lastelement--;
|     Tcl_ListObjIndex (interp, myTsd->errorMsgStack,
|                       lastelement, &errorMsg);
|     Tcl_ListObjIndex (interp, myTsd->errorCodeStack,
|                       lastelement, &errorCode);
|     Tcl_ListObjIndex (interp, myTsd->errorInfoStack,
|                       lastelement, &errorInfo);
|
|     Tcl_ResetResult (interp);
|     Tcl_SetObjResult (interp, errorMsg);
|     Tcl_SetObjErrorCode (interp, errorCode);
|
| #ifdef _TCLINT
|     Tcl_ObjSetVar2 (interp, myTsd->errorInfoName, NULL, errorInfo,
|                     TCL_GLOBAL_ONLY);
|     interp->flags = ERR_IN_PROGRESS;
| #else
|     Tcl_AddErrorInfo (interp, Tcl_GetStringFromObj (errorInfo, NULL));
| #endif
|     return TCL_ERROR;
|   }

|
|   /*
|    * throw with parameters
|    */
|
|   Tcl_ResetResult (interp);
|
|   if (objc >= 3) {
|     Tcl_SetObjResult (interp, objv[2]);
|   } else {
|     /*
|      * fabricate some error message for human consumption
|      */
|
|     Tcl_AppendResult (interp, "error: ",
|                       Tcl_GetStringFromObj (objv[1], NULL),
|                       NULL);
|   }

|
|   Tcl_SetObjErrorCode (interp, objv[1]);
|
|   if (objc >= 4) {
| #ifdef _TCLINT
|     Tcl_ObjSetVar2 (interp, myTsd->errorInfoName, NULL, objv[3],
|                     TCL_GLOBAL_ONLY);
|     interp->flags = ERR_IN_PROGRESS;
| #else
|     Tcl_AddErrorInfo (interp, Tcl_GetStringFromObj (objv[3], NULL));
| #endif
|   }

|
|   /*
|    * throw error
|    */
|
|   return TCL_ERROR;
| }

|
| /*
|  * exception handling
|  *

|  * try body ?catch {type-list ?ecvar? ?msgvar? ?infovar?} body ...?
|  *          ?finally body?
|  */
|
| static int
| Tcl_TryObjCmd (ClientData clientData, Tcl_Interp *interp,
|                int objc, Tcl_Obj *CONST objv[])
| {

|   TryCatchTsd * myTsd = (TryCatchTsd *) clientData;
|   int currentIndex, finallyIndex, catchInfoLength, hasCatch;
|   char * blockType;
|   int res;
|
|   /*
|    * first check for syntactic correctness before doing anything
|    */
|
|   if (objc < 2) {
|     Tcl_AppendResult (interp, "wrong # args: should be \"",
|                       Tcl_GetStringFromObj (objv[0], NULL),
|                       " body ",
|                       "?catch {type-list ?ecvar? ?msgvar? ?infovar?} ",
|                       "body ...? ",
|                       "?finally body?\"", NULL);
|     return TCL_ERROR;
|   }

|
|   currentIndex = 2;
|   finallyIndex = -1;
|   hasCatch = 0;
|
|   while (currentIndex < objc) {
|     blockType = Tcl_GetStringFromObj (objv[currentIndex], NULL);
|
|     if (strcmp (blockType, "catch") == 0) {
|       Tcl_Obj * typeList;
|       int typeListLength;
|
|       if (currentIndex+2 >= objc ||
|           Tcl_ListObjLength (interp, objv[currentIndex+1],
|                              &catchInfoLength) != TCL_OK ||
|           (catchInfoLength < 1 && catchInfoLength > 4) ||
|           Tcl_ListObjIndex (interp, objv[currentIndex+1],
|                             0, &typeList) != TCL_OK ||
|           Tcl_ListObjLength (interp, typeList,
|                              &typeListLength) != TCL_OK) {
|         Tcl_AppendResult (interp, "invalid syntax in catch clause: ",
|                           "should be \"",
|                           "catch {type-list ?ecvar? ?msgvar? ?infovar?} ",
|                           "body\"", NULL);
|         return TCL_ERROR;
|       }

|       hasCatch = 1;
|       currentIndex += 3;
|     }

|     else if (strcmp (blockType, "finally") == 0) {
|       if (currentIndex+2 != objc) {
|         Tcl_AppendResult (interp, "trailing args after finally clause",
|                           NULL);
|         return TCL_ERROR;
|       }

|       finallyIndex = currentIndex;
|       currentIndex += 2;
|     }

|     else {
|       Tcl_AppendResult (interp, "invalid syntax: should be \"",
|                         Tcl_GetStringFromObj (objv[0], NULL),
|                         " body ",
|                         "?catch {type-list ?ecvar? ?msgvar? ?infovar?} ",
|                         "body ...? ",
|                         "?finally body?\"", NULL);
|       return TCL_ERROR;
|     }

|   }

|
|   /*
|    * Eval main body
|    */
|
|   res = Tcl_EvalObjEx (interp, objv[1], 0);
|
|   /*
|    * In case of error, check the catch clauses
|    */
|
|   if (res == TCL_ERROR) {
|     Tcl_Obj *errorCode, *errorInfo, *errorMsg;
|     int errorCodeLength, stackLength;
|
|     errorMsg = Tcl_GetObjResult (interp);
|     errorCode = Tcl_ObjGetVar2 (interp, myTsd->errorCodeName, NULL,
|                                 TCL_GLOBAL_ONLY);
|     errorInfo = Tcl_ObjGetVar2 (interp, myTsd->errorInfoName, NULL,
|                                 TCL_GLOBAL_ONLY);
|
|     /*
|      * After an error has happened, errorCode and errorInfo should
|      * exist.
|      */
|
|     if (errorCode == NULL || errorInfo == NULL) {
|       Tcl_AppendResult (interp, "assertion error in try: ",
|                         "no errorCode or no errorInfo",
|                         NULL);
|       return TCL_ERROR;
|     }

|
|     if (Tcl_ListObjLength (interp, errorCode, &errorCodeLength) != TCL_OK) {
|       Tcl_AppendResult (interp, "assertion error in try: "
|                         "errorCode is not a list",
|                         NULL);
|       return TCL_ERROR;
|     }

|
|     /*
|      * push error data on stack, so that throw can rethrow the error
|      */
|
|     Tcl_ListObjAppendElement (interp, myTsd->errorMsgStack, errorMsg);
|     Tcl_ListObjAppendElement (interp, myTsd->errorCodeStack, errorCode);
|     Tcl_ListObjAppendElement (interp, myTsd->errorInfoStack, errorInfo);
|
|     /*
|      * Look for a matching clause
|      */
|
|     currentIndex = 2;
|
|     while (currentIndex < objc) {
|       blockType = Tcl_GetStringFromObj (objv[currentIndex], NULL);
|
|       if (strcmp (blockType, "catch") == 0) {
|         int typeListLength, matchIndex;
|         Tcl_Obj *typeList;
|
|         Tcl_ListObjIndex  (interp, objv[currentIndex+1], 0, &typeList);
|         Tcl_ListObjLength (interp, typeList, &typeListLength);
|
|         if (typeListLength > errorCodeLength) {
|           currentIndex += 3;
|           continue;
|         }

|
|         for (matchIndex=0; matchIndex<typeListLength; matchIndex++) {
|           Tcl_Obj *errorCodeItem, *typeListItem;
|           const char *errorCodeItemStr, *typeListItemStr;
|
|           Tcl_ListObjIndex (interp, errorCode, matchIndex, &errorCodeItem);
|           Tcl_ListObjIndex (interp, typeList, matchIndex, &typeListItem);
|
|           errorCodeItemStr = Tcl_GetStringFromObj (errorCodeItem, NULL);
|           typeListItemStr = Tcl_GetStringFromObj (typeListItem, NULL);
|
|           if (!Tcl_StringMatch (errorCodeItemStr, typeListItemStr)) {
|             break;
|           }
|         }

|
|         if (matchIndex >= typeListLength) {
|           /* matching catch clause found */
|           break;
|         }

|
|         /* continue looking */
|         currentIndex += 3;
|       }

|       else {
|         /* not a catch clause - there are no matching catch clauses */
|         currentIndex = objc;
|         break;
|       }
|     }

|
|     /*
|      * Did we find a matching catch clause?
|      */
|
|     if (currentIndex < objc) {
|       Tcl_Obj *ecvar, *msgvar, *infovar;
|
|       Tcl_ListObjLength (interp, objv[currentIndex+1], &catchInfoLength);
|
|       /*
|        * set variables with error data
|        */
|
|       if (catchInfoLength >= 2) {
|         Tcl_ListObjIndex (interp, objv[currentIndex+1], 1, &ecvar);
|         Tcl_ObjSetVar2 (interp, ecvar, NULL, errorCode, 0);
|       }

|
|       if (catchInfoLength >= 3) {
|         Tcl_ListObjIndex (interp, objv[currentIndex+1], 2, &msgvar);
|         Tcl_ObjSetVar2 (interp, msgvar, NULL, errorMsg, 0);
|       }

|
|       if (catchInfoLength >= 4) {
|         Tcl_ListObjIndex (interp, objv[currentIndex+1], 3, &infovar);
|         Tcl_ObjSetVar2 (interp, infovar, NULL, errorInfo, 0);
|       }

|
|       /*
|        * call body; the error code of this body takes precedence
|        */
|
|       res = Tcl_EvalObjEx (interp, objv[currentIndex+2], 0);
|     }

|
|     /*
|      * pop error data from stack
|      */
|
|     Tcl_ListObjLength (interp, myTsd->errorMsgStack, &stackLength);
|     stackLength--;
|     Tcl_ListObjReplace (interp, myTsd->errorMsgStack,
|                         stackLength, 1, 0, NULL);
|     Tcl_ListObjReplace (interp, myTsd->errorCodeStack,
|                         stackLength, 1, 0, NULL);
|     Tcl_ListObjReplace (interp, myTsd->errorInfoStack,
|                         stackLength, 1, 0, NULL);
|   }

|
|   /*
|    * Execute finally body. Preserve errorCode and friends; they might
|    * be corrupted by the code in the body - e.g. by a try in the code,
|    * or in a proc called by the code.
|    */
|
|   if (finallyIndex != -1) {
|     Tcl_Obj *errorCode, *errorInfo, *errorMsg;
|     int finallyres, origres=res;
|
|     errorMsg = Tcl_GetObjResult (interp);
|     Tcl_IncrRefCount (errorMsg);
|
|     if (origres == TCL_ERROR) {
|       errorCode = Tcl_ObjGetVar2 (interp, myTsd->errorCodeName, NULL,
|                                   TCL_GLOBAL_ONLY);
|       errorInfo = Tcl_ObjGetVar2 (interp, myTsd->errorInfoName, NULL,
|                                   TCL_GLOBAL_ONLY);
|       Tcl_IncrRefCount (errorCode);
|       Tcl_IncrRefCount (errorInfo);
|     }

|
|     finallyres = Tcl_EvalObjEx (interp, objv[finallyIndex+1], 0);
|
|     /*
|      * An Error in the finally clause takes precedence, else restore
|      * previous error data
|      */
|
|     if (finallyres != TCL_OK) {
|       res = finallyres;
|     }

|     else {
|       Tcl_SetObjResult (interp, errorMsg);
|
|       if (origres == TCL_ERROR) {
|         Tcl_SetObjErrorCode (interp, errorCode);
| #ifdef _TCLINT
|         Tcl_ObjSetVar2 (interp, myTsd->errorInfoName, NULL, errorInfo,
|                         TCL_GLOBAL_ONLY);
|         interp->flags = ERR_IN_PROGRESS;
| #else
|         Tcl_AddErrorInfo (interp, Tcl_GetStringFromObj (errorInfo, NULL));
| #endif
|       }
|     }

|
|     Tcl_DecrRefCount (errorMsg);
|
|     if (origres == TCL_ERROR) {
|       Tcl_DecrRefCount (errorCode);
|       Tcl_DecrRefCount (errorInfo);
|     }
|   }

|
|   /*
|    * Pass along return code
|    */
|
|   return res;
| }

|
| /*
|  * ----------------------------------------------------------------------
|  *

|  * "Main" function, install our commands in the Tcl interpreter
|  *

|  * ----------------------------------------------------------------------
|  */
|
| #undef TCL_STORAGE_CLASS
| #define TCL_STORAGE_CLASS DLLEXPORT
| EXTERN int
| Trycatch_Init (Tcl_Interp *interp)
| {

|   TryCatchTsd * myTsd;
|
| #ifdef USE_TCL_STUBS
|   if (Tcl_InitStubs (interp, TCL_VERSION, 0) == NULL) {
|     return TCL_ERROR;
|   }

| #else
|   if (Tcl_PkgRequire (interp, "Tcl", TCL_VERSION, 1) == NULL) {
|     return TCL_ERROR;
|   }

| #endif
|
|   /*
|    * Allocate Tsd
|    */
|
|   myTsd = (TryCatchTsd *) Tcl_Alloc (sizeof (TryCatchTsd));
|   myTsd->errorCodeStack = Tcl_NewObj ();
|   myTsd->errorInfoStack = Tcl_NewObj ();
|   myTsd->errorMsgStack  = Tcl_NewObj ();
|   myTsd->errorCodeName  = Tcl_NewStringObj ("errorCode", -1);
|   myTsd->errorInfoName  = Tcl_NewStringObj ("errorInfo", -1);
|
|   /*
|    * add commands
|    */
|
|   Tcl_CreateObjCommand (interp, "throw", Tcl_ThrowObjCmd,
|                         (ClientData) myTsd, NULL);
|   Tcl_CreateObjCommand (interp, "try", Tcl_TryObjCmd,
|                         (ClientData) myTsd, NULL);
|
|   /*
|    * Ready
|    */
|
|   Tcl_PkgProvide (interp, "trycatch", "0.1");
|   return TCL_OK;
| }

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
|
>

|

|

|

|

|
|
|

|

|

|

|

|

|
|
|
|
|

|

|
|

|
|
|

|
|

|

|

|

|

|

|

|
|
|
|
|

|

|
|
|
|
|

|

|

|
|

|

|

|

|

|

|
|
|

|
|
|
|
|

|

|

|

|

|
|
<
>

|

|
|
|
|
<
|
>
|

|
|
|
|
|
<
|
>
|

|
|
|
|
|
|
|
|
<
|
>
|

|
|
|
|
|
|
|
|
<
|
>
|

|
|
|
|
|
|
<
>
|
|
<
>
|
|
|
<
|
>
|

|
|

|

|

|

|

|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
<
>
|
|
|
|
|
<
>
|
|
<
>
|
|
|
|
|
|
|
|
<
>
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
<
<
>
>
|
|
|
|
<
>
|
|
|
<
>
|
|
|
|
<
<
>
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
<
<
>
>
|
|
|
|
|
|
<
<
>
>
|
|
|
|
|
|
<
>
|
|
|
<
>
|
<
>
|
|
|
|
|
|
|
<
>
|
|
|
|
|
<
>
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
|
>
|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134

135
136
137
138
139
140
141
142

143
144
145
146
147
148
149
150
151

152
153
154
155
156
157
158
159
160
161
162
163

164
165
166
167
168
169
170
171
172
173
174
175

176
177
178
179
180
181
182
183
184
185

186
187
188

189
190
191
192

193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221

222
223
224
225
226
227
228
229
230
231
232
233
234
235
236

237
238

239
240
241

242
243
244
245
246
247
248
249
250

251
252
253
254
255
256
257
258

259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275

276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297

298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315

316
317
318
319
320
321
322
323
324
325
326
327

328
329
330
331
332
333
334

335
336
337
338

339
340
341
342
343
344
345
346

347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364

365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390

391
392
393

394
395
396
397
398
399

400
401
402

403
404
405
406
407
408
409
410
411

412

413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444

445
446
447
448
449
450
451

452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480

481
482
483
484
485
486
487
488
489
490
491
492
493
494

495
496
497
498
499
500

501
502
503
504

505
506
507
508
509

510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528

529
530
531
532
533

534
535
536
537
538

539
540
541
542
543
544
545

546
547
548
549
550
551
552
553
554
555
556
557
558
559

560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581

582
583
584
585
586
587
588
589
590
591
592

593
594
595
596
597
598
599
600
601
602
603
604
605

606
607
608
609
610
611
612
613

614
615
616
617
618
619
620
621

622
623
624
625

626
627

628
629
630
631
632
633
634
635

636
637
638
639
640
641

642
643
644
645

646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674

675
676
677
678
679
680

# TIP 89: Try/Catch Exception Handling in the Core

	Author:         Tom Wilkason <[email protected]>
	Author:         Frank Pilhofer <[email protected]>
	State:          Withdrawn
	Type:           Project
	Vote:           Pending
	Created:        11-Mar-2002
	Post-History:   
	Discussions-To: news:comp.lang.tcl
	Tcl-Version:    8.6
	Obsoleted-By:	329
-----

# Abstract

This TIP proposes the addition of a
**try**...**catch**...**finally** command to provide a more
robust and powerful exception handling mechanism.

# Rationale

Exceptions are currently supported very well in Tcl, in fact they are
a major advantage over many other languages.  However the mechanism to
**catch** and handle the errors is someone limited and does not
promote the full use of existing error codes.  Wrapper procedures can
be written to improve on this, however both a performance and
compatibility penalty is incurred.

This TIP proposes adding a **try/catch** command to the Tcl core \(or
C based Tcl library\).  This implementation is not unlike those found
in C\+\+, C\#, Java and Python \(to name a few languages\).

An argument to add this to the core is that it modernizes the Tcl
exception handling without impacting performance in any other way.
**try/catch** are isolated commands that can easily be added, and do
not interact with other commands or require other changes.
**try/catch** is not an isolated extension that is useful for
special purposes only.  These commands, if implemented into the core,
will be useful for any script currently using the catch construct.

# Specification

I propose the following two commands be added to Tcl:

 * **throw** command.

	 > **throw** ?_type_? ?_message_? ?_info_?

	 > A **throw** command with _type_ throws an error exception with
   the errorCode _type_. The **throw** command works as the
   **error** command, but the arguments are reordered to encourage
   the use of error-codes. The optional _message_ and _info_
   parameters work as they do in the **error** command.

	 > The throw _type_ can be any user defined or built in type,
   built-in types include POSIX, ARITH, CORE, REGEXP, WINDOWS, NONE,
   ...  The _message_ is optional, and is the same as that issued by
   the **catch** command, **error -code error** "_message_"

	 > An instance of **throw** with no arguments can be used within a
   **catch** block to immediately re-throw the current exception
   that is being handled by the **catch** block.  When an error is
   re-thrown in the catch block, the current error is propagated up
   one level following the evaluation of the **finally** block \(if
   on exists\).  Enclosing error handlers can then deal with the error.

	 > Note that

		    throw type message info

	 > is the same as

		    error message info type

 * **try** command.

	 > **try** _body_ ?**catch** \{\{_type\_list_\} ?_ecvar_? ?_msgvar_? ?_infovar_?\} _body ..._? ?**finally** _body_?

	 > If one or more **catch** blocks are specified, each corresponding
   _body_ represents a required block of code that is evaluated if
   the resulting errorCode matches the _type_ condition.  The
   required body of the **finally** block is evaluated following the
   **try** block and **catch** block \(if any matches\).

	 > _type\_list_ represents a list of glob style patterns used to
   match eache of the error-code list conditions.  A match is declared
   if the _type\_list_ patterns or errorCode elements are exhausted
   \(whichever comes first\) and a mismatch has not occurred.  If a
   match occurs, and _ecvar_ is specified, the errorCode list will
   be stored in _ecvar_ within the local scope prior to executing
   the _body_.  Moreover, if a _msgvar_ or _infovar_ are
   specified, the error message and errorInfo contents will be stored
   in the local context.

	 > If an error occurs during the **try**, and no _catch_ blocks
   are specified, the offending error is rethrown following execution
   of the _finally_ block \(if specified\).

	 > If an error occurs during execution of a **catch** or
   **finally** block, this error will take precedence and will
   propagate upwards with a new stack trace.  If an error is rethrown
   within a catch block, the existing stack trace will be preserved
   with the rethrown error.  This allows later discrimination of the
   two different error conditions \(rethrown vs. unintended\).

	 > Note, **catch** \{_\*_\}, if specified, will catch all remaining
   errors.  If used, it should be placed last since each of the catch
   blocks are evaluated in the order specified.  _type_ is that set
   in errorCode, and can be any user defined type, or built-in types
   including POSIX \*, ARITH \*, CHILD \*, CORE, REGEXP, WINDOWS, or
   NONE.

	 > If one or more **catch** blocks are specified, and no **catch**
   block matches the errorCode condition, the error will be propagated
   up to the next level following evaluation of the **finally**
   clause \(if specified\).  An enclosing **try** block \(or
   **catch** command\) can then be used to handle the error.

	 > The **finally** block is used to perform all the clean up code.
   The **finally** body is evaluated whether the error occurs or
   not, or whether a **catch** block matched the errorCode.  It is
   also evaluated if a _throw_ statement occurs within the
   **catch** clause.

# Examples

**throw**

	    throw DEVICE "Could not write to device"

**try** only \(no practical use\)

	    try {
	       incr i

	    }

**try - catch**

	    try {
	       incr i
	    } catch * {
	       set i 0

	    }

**try - finally**

	    try {
	       . config -cursor watch
	       #do some busy stuff here, don't care about errors
	    } finally {
	       . config -cursor arrow

	    }

**try - catch - catch**

	    try {
	       ;# Some code that will cause an error
	    } catch {{POSIX *} eCode eMessage} {
	       ;# Statements to handle POSIX type errors
	    } catch {NULL eCode eMessage} {
	       ;# Statements to handle NULL (a user created) type errors
	    } catch {* eMessage} {
	       ;# Statements to handle all other errors

	    }

**try - catch - catch - finally**

	    try {
	       ;# Some code that will cause an error
	    } catch {POSIX eCode eMessage} {
	       ;# Statements to handle POSIX type errors
	    } catch {* eCode eMessage} {
	       ;# Statements to handle all other errors
	    } finally {
	       ;# Statements to execute whether an error occurred or not

	    }

Re-throw **try - catch - finally**

	    try {
	       try {
	          set b [expr {$a/0}]
	       } catch {ARITH} {
	          if {$a == 0} {
	             throw   ;# re-throw to outer try

	          }
	       } finally {
	          set b 1    ;# will execute before throw above

	       }
	    } catch {ARITH eCode eMessage} {
	       ;# This will catch the inner throw
	       puts "$res"

	    }

# Revisions: Tom Wilkason March 26, 2002

  * Added additional _ecvar_ and _infovar_ optional arguments to
    the **catch** clause.

  * All uncaught errors are propagated up after execution of the
    finally block \(if specified\).

  * Unanticipated errors within a **catch** or **finally** block
    start a new stack trace and are propagated up.

  * Additional _info_ optional argument added to **throw** for
    completeness.

# Reference Implementation

	 /*
	  * Implementation of try/catch and throw commands according to TIP 89
	  */

	 #include <tcl.h>

	 /*
	  * We keep a stack of contexts; whenever we have to handle an error,
	  * i.e. are executing a catch {} clause, we store the current error
	  * (errorCode, errorInfo and message), so that a throw with no arguments
	  * can re-throw it.

	  *
	  * This is interpreter-specific data. Each element is a list, with the
	  * last element being the most current one.
	  */

	 typedef struct {
	   Tcl_Obj * errorCodeStack;
	   Tcl_Obj * errorInfoStack;
	   Tcl_Obj * errorMsgStack;
	   Tcl_Obj * errorCodeName;
	   Tcl_Obj * errorInfoName;
	 } TryCatchTsd;

	 /*
	  * Throw an Exception

	  *
	  * throw ?<type> ?<message>? ?<info>??

	  *
	  * Throws an exception with the errorCode <type>, the message <message>
	  * and the errorInfo <info>.

	  *
	  * An instance of throw with no arguments can be used within a catch or
	  * finally block to immediately re-throw the current exception that is
	  * being handled by the catch block.
	  */

	 static int
	 Tcl_ThrowObjCmd (ClientData clientData, Tcl_Interp *interp,
	                  int objc, Tcl_Obj *CONST objv[])

	 {
	   TryCatchTsd * myTsd = (TryCatchTsd *) clientData;

	   if (objc < 1 || objc > 4) {
	     Tcl_AppendResult (interp, "wrong # args: should be \"",
	                       Tcl_GetStringFromObj (objv[0], NULL),
	                       " ?<type> ?<message>? ?<info>??\"", NULL);
	     return TCL_ERROR;

	   }

	   /*
	    * Re-throw an error
	    */

	   if (objc < 2) {
	     Tcl_Obj *errorCode, *errorInfo, *errorMsg;
	     int lastelement;

	     Tcl_ListObjLength (interp, myTsd->errorMsgStack, &lastelement);

	     if (lastelement < 1) {
	       Tcl_AppendResult (interp, "error: throw with no parameters ",
	                         "outside of a catch",
	                         NULL);
	       return TCL_ERROR;

	     }

	     lastelement--;
	     Tcl_ListObjIndex (interp, myTsd->errorMsgStack,
	                       lastelement, &errorMsg);
	     Tcl_ListObjIndex (interp, myTsd->errorCodeStack,
	                       lastelement, &errorCode);
	     Tcl_ListObjIndex (interp, myTsd->errorInfoStack,
	                       lastelement, &errorInfo);

	     Tcl_ResetResult (interp);
	     Tcl_SetObjResult (interp, errorMsg);
	     Tcl_SetObjErrorCode (interp, errorCode);

	 #ifdef _TCLINT
	     Tcl_ObjSetVar2 (interp, myTsd->errorInfoName, NULL, errorInfo,
	                     TCL_GLOBAL_ONLY);
	     interp->flags = ERR_IN_PROGRESS;
	 #else
	     Tcl_AddErrorInfo (interp, Tcl_GetStringFromObj (errorInfo, NULL));
	 #endif
	     return TCL_ERROR;

	   }

	   /*
	    * throw with parameters
	    */

	   Tcl_ResetResult (interp);

	   if (objc >= 3) {
	     Tcl_SetObjResult (interp, objv[2]);
	   } else {
	     /*
	      * fabricate some error message for human consumption
	      */

	     Tcl_AppendResult (interp, "error: ",
	                       Tcl_GetStringFromObj (objv[1], NULL),
	                       NULL);

	   }

	   Tcl_SetObjErrorCode (interp, objv[1]);

	   if (objc >= 4) {
	 #ifdef _TCLINT
	     Tcl_ObjSetVar2 (interp, myTsd->errorInfoName, NULL, objv[3],
	                     TCL_GLOBAL_ONLY);
	     interp->flags = ERR_IN_PROGRESS;
	 #else
	     Tcl_AddErrorInfo (interp, Tcl_GetStringFromObj (objv[3], NULL));
	 #endif

	   }

	   /*
	    * throw error
	    */

	   return TCL_ERROR;

	 }

	 /*
	  * exception handling

	  *
	  * try body ?catch {type-list ?ecvar? ?msgvar? ?infovar?} body ...?
	  *          ?finally body?
	  */

	 static int
	 Tcl_TryObjCmd (ClientData clientData, Tcl_Interp *interp,
	                int objc, Tcl_Obj *CONST objv[])

	 {
	   TryCatchTsd * myTsd = (TryCatchTsd *) clientData;
	   int currentIndex, finallyIndex, catchInfoLength, hasCatch;
	   char * blockType;
	   int res;

	   /*
	    * first check for syntactic correctness before doing anything
	    */

	   if (objc < 2) {
	     Tcl_AppendResult (interp, "wrong # args: should be \"",
	                       Tcl_GetStringFromObj (objv[0], NULL),
	                       " body ",
	                       "?catch {type-list ?ecvar? ?msgvar? ?infovar?} ",
	                       "body ...? ",
	                       "?finally body?\"", NULL);
	     return TCL_ERROR;

	   }

	   currentIndex = 2;
	   finallyIndex = -1;
	   hasCatch = 0;

	   while (currentIndex < objc) {
	     blockType = Tcl_GetStringFromObj (objv[currentIndex], NULL);

	     if (strcmp (blockType, "catch") == 0) {
	       Tcl_Obj * typeList;
	       int typeListLength;

	       if (currentIndex+2 >= objc ||
	           Tcl_ListObjLength (interp, objv[currentIndex+1],
	                              &catchInfoLength) != TCL_OK ||
	           (catchInfoLength < 1 && catchInfoLength > 4) ||
	           Tcl_ListObjIndex (interp, objv[currentIndex+1],
	                             0, &typeList) != TCL_OK ||
	           Tcl_ListObjLength (interp, typeList,
	                              &typeListLength) != TCL_OK) {
	         Tcl_AppendResult (interp, "invalid syntax in catch clause: ",
	                           "should be \"",
	                           "catch {type-list ?ecvar? ?msgvar? ?infovar?} ",
	                           "body\"", NULL);
	         return TCL_ERROR;

	       }
	       hasCatch = 1;
	       currentIndex += 3;

	     }
	     else if (strcmp (blockType, "finally") == 0) {
	       if (currentIndex+2 != objc) {
	         Tcl_AppendResult (interp, "trailing args after finally clause",
	                           NULL);
	         return TCL_ERROR;

	       }
	       finallyIndex = currentIndex;
	       currentIndex += 2;

	     }
	     else {
	       Tcl_AppendResult (interp, "invalid syntax: should be \"",
	                         Tcl_GetStringFromObj (objv[0], NULL),
	                         " body ",
	                         "?catch {type-list ?ecvar? ?msgvar? ?infovar?} ",
	                         "body ...? ",
	                         "?finally body?\"", NULL);
	       return TCL_ERROR;

	     }

	   }

	   /*
	    * Eval main body
	    */

	   res = Tcl_EvalObjEx (interp, objv[1], 0);

	   /*
	    * In case of error, check the catch clauses
	    */

	   if (res == TCL_ERROR) {
	     Tcl_Obj *errorCode, *errorInfo, *errorMsg;
	     int errorCodeLength, stackLength;

	     errorMsg = Tcl_GetObjResult (interp);
	     errorCode = Tcl_ObjGetVar2 (interp, myTsd->errorCodeName, NULL,
	                                 TCL_GLOBAL_ONLY);
	     errorInfo = Tcl_ObjGetVar2 (interp, myTsd->errorInfoName, NULL,
	                                 TCL_GLOBAL_ONLY);

	     /*
	      * After an error has happened, errorCode and errorInfo should
	      * exist.
	      */

	     if (errorCode == NULL || errorInfo == NULL) {
	       Tcl_AppendResult (interp, "assertion error in try: ",
	                         "no errorCode or no errorInfo",
	                         NULL);
	       return TCL_ERROR;

	     }

	     if (Tcl_ListObjLength (interp, errorCode, &errorCodeLength) != TCL_OK) {
	       Tcl_AppendResult (interp, "assertion error in try: "
	                         "errorCode is not a list",
	                         NULL);
	       return TCL_ERROR;

	     }

	     /*
	      * push error data on stack, so that throw can rethrow the error
	      */

	     Tcl_ListObjAppendElement (interp, myTsd->errorMsgStack, errorMsg);
	     Tcl_ListObjAppendElement (interp, myTsd->errorCodeStack, errorCode);
	     Tcl_ListObjAppendElement (interp, myTsd->errorInfoStack, errorInfo);

	     /*
	      * Look for a matching clause
	      */

	     currentIndex = 2;

	     while (currentIndex < objc) {
	       blockType = Tcl_GetStringFromObj (objv[currentIndex], NULL);

	       if (strcmp (blockType, "catch") == 0) {
	         int typeListLength, matchIndex;
	         Tcl_Obj *typeList;

	         Tcl_ListObjIndex  (interp, objv[currentIndex+1], 0, &typeList);
	         Tcl_ListObjLength (interp, typeList, &typeListLength);

	         if (typeListLength > errorCodeLength) {
	           currentIndex += 3;
	           continue;

	         }

	         for (matchIndex=0; matchIndex<typeListLength; matchIndex++) {
	           Tcl_Obj *errorCodeItem, *typeListItem;
	           const char *errorCodeItemStr, *typeListItemStr;

	           Tcl_ListObjIndex (interp, errorCode, matchIndex, &errorCodeItem);
	           Tcl_ListObjIndex (interp, typeList, matchIndex, &typeListItem);

	           errorCodeItemStr = Tcl_GetStringFromObj (errorCodeItem, NULL);
	           typeListItemStr = Tcl_GetStringFromObj (typeListItem, NULL);

	           if (!Tcl_StringMatch (errorCodeItemStr, typeListItemStr)) {
	             break;

	           }
	         }

	         if (matchIndex >= typeListLength) {
	           /* matching catch clause found */
	           break;

	         }

	         /* continue looking */
	         currentIndex += 3;

	       }
	       else {
	         /* not a catch clause - there are no matching catch clauses */
	         currentIndex = objc;
	         break;

	       }
	     }

	     /*
	      * Did we find a matching catch clause?
	      */

	     if (currentIndex < objc) {
	       Tcl_Obj *ecvar, *msgvar, *infovar;

	       Tcl_ListObjLength (interp, objv[currentIndex+1], &catchInfoLength);

	       /*
	        * set variables with error data
	        */

	       if (catchInfoLength >= 2) {
	         Tcl_ListObjIndex (interp, objv[currentIndex+1], 1, &ecvar);
	         Tcl_ObjSetVar2 (interp, ecvar, NULL, errorCode, 0);

	       }

	       if (catchInfoLength >= 3) {
	         Tcl_ListObjIndex (interp, objv[currentIndex+1], 2, &msgvar);
	         Tcl_ObjSetVar2 (interp, msgvar, NULL, errorMsg, 0);

	       }

	       if (catchInfoLength >= 4) {
	         Tcl_ListObjIndex (interp, objv[currentIndex+1], 3, &infovar);
	         Tcl_ObjSetVar2 (interp, infovar, NULL, errorInfo, 0);

	       }

	       /*
	        * call body; the error code of this body takes precedence
	        */

	       res = Tcl_EvalObjEx (interp, objv[currentIndex+2], 0);

	     }

	     /*
	      * pop error data from stack
	      */

	     Tcl_ListObjLength (interp, myTsd->errorMsgStack, &stackLength);
	     stackLength--;
	     Tcl_ListObjReplace (interp, myTsd->errorMsgStack,
	                         stackLength, 1, 0, NULL);
	     Tcl_ListObjReplace (interp, myTsd->errorCodeStack,
	                         stackLength, 1, 0, NULL);
	     Tcl_ListObjReplace (interp, myTsd->errorInfoStack,
	                         stackLength, 1, 0, NULL);

	   }

	   /*
	    * Execute finally body. Preserve errorCode and friends; they might
	    * be corrupted by the code in the body - e.g. by a try in the code,
	    * or in a proc called by the code.
	    */

	   if (finallyIndex != -1) {
	     Tcl_Obj *errorCode, *errorInfo, *errorMsg;
	     int finallyres, origres=res;

	     errorMsg = Tcl_GetObjResult (interp);
	     Tcl_IncrRefCount (errorMsg);

	     if (origres == TCL_ERROR) {
	       errorCode = Tcl_ObjGetVar2 (interp, myTsd->errorCodeName, NULL,
	                                   TCL_GLOBAL_ONLY);
	       errorInfo = Tcl_ObjGetVar2 (interp, myTsd->errorInfoName, NULL,
	                                   TCL_GLOBAL_ONLY);
	       Tcl_IncrRefCount (errorCode);
	       Tcl_IncrRefCount (errorInfo);

	     }

	     finallyres = Tcl_EvalObjEx (interp, objv[finallyIndex+1], 0);

	     /*
	      * An Error in the finally clause takes precedence, else restore
	      * previous error data
	      */

	     if (finallyres != TCL_OK) {
	       res = finallyres;

	     }
	     else {
	       Tcl_SetObjResult (interp, errorMsg);

	       if (origres == TCL_ERROR) {
	         Tcl_SetObjErrorCode (interp, errorCode);
	 #ifdef _TCLINT
	         Tcl_ObjSetVar2 (interp, myTsd->errorInfoName, NULL, errorInfo,
	                         TCL_GLOBAL_ONLY);
	         interp->flags = ERR_IN_PROGRESS;
	 #else
	         Tcl_AddErrorInfo (interp, Tcl_GetStringFromObj (errorInfo, NULL));
	 #endif

	       }
	     }

	     Tcl_DecrRefCount (errorMsg);

	     if (origres == TCL_ERROR) {
	       Tcl_DecrRefCount (errorCode);
	       Tcl_DecrRefCount (errorInfo);

	     }
	   }

	   /*
	    * Pass along return code
	    */

	   return res;

	 }

	 /*
	  * ----------------------------------------------------------------------

	  *
	  * "Main" function, install our commands in the Tcl interpreter

	  *
	  * ----------------------------------------------------------------------
	  */

	 #undef TCL_STORAGE_CLASS
	 #define TCL_STORAGE_CLASS DLLEXPORT
	 EXTERN int
	 Trycatch_Init (Tcl_Interp *interp)

	 {
	   TryCatchTsd * myTsd;

	 #ifdef USE_TCL_STUBS
	   if (Tcl_InitStubs (interp, TCL_VERSION, 0) == NULL) {
	     return TCL_ERROR;

	   }
	 #else
	   if (Tcl_PkgRequire (interp, "Tcl", TCL_VERSION, 1) == NULL) {
	     return TCL_ERROR;

	   }
	 #endif

	   /*
	    * Allocate Tsd
	    */

	   myTsd = (TryCatchTsd *) Tcl_Alloc (sizeof (TryCatchTsd));
	   myTsd->errorCodeStack = Tcl_NewObj ();
	   myTsd->errorInfoStack = Tcl_NewObj ();
	   myTsd->errorMsgStack  = Tcl_NewObj ();
	   myTsd->errorCodeName  = Tcl_NewStringObj ("errorCode", -1);
	   myTsd->errorInfoName  = Tcl_NewStringObj ("errorInfo", -1);

	   /*
	    * add commands
	    */

	   Tcl_CreateObjCommand (interp, "throw", Tcl_ThrowObjCmd,
	                         (ClientData) myTsd, NULL);
	   Tcl_CreateObjCommand (interp, "try", Tcl_TryObjCmd,
	                         (ClientData) myTsd, NULL);

	   /*
	    * Ready
	    */

	   Tcl_PkgProvide (interp, "trycatch", "0.1");
	   return TCL_OK;

	 }

# Copyright

This document has been placed in the public domain.

Name change from tip/9.tip to tip/9.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
TIP:            9
Title:          Tk Standard Library
Version:        $Revision: 1.9 $
Author:         Marty Backe <[email protected]>
Author:         Larry W. Virden <[email protected]>
Author:         Jeff Hobbs <[email protected]>
State:          Withdrawn
Type:           Project
Vote:           Pending
Created:        07-Nov-2000
Post-History:   
Tcl-Version:    8.4

~ Abstract

A Tk standard library shall be bundled with the core Tcl/Tk
distribution.  The library will consist of general purpose widgets and
composite widgets for use in constructing Tcl/Tk applications.  The
library of Tk components will be written in Tcl/Tk.

~ Rationale

Although Tcl "ships" with a comprehensive set of native (compiled)
base Tk widgets, it lacks a library of composite widgets, from which
sophisticated applications can readily be built with minimal
reinvention.

Although the Tcl community has created a wealth of general purpose Tk
widgets, generally they are not centrally located or distributed,
making their use problematic. This requires that Tcl programs which
make use of such widgets must either distribute them or direct the end
user on their acquisition and installation. Arguably, the success and
higher visibility of other "competing" scripting languages can be
attributed in some part to their extensive libraries. Tcl/Tk should
continue this trend.

Tcl is perhaps unique in that it is considered both a graphical (Tk)
and non-graphical (Tcl) programming language. Work has begun in
implementing a standard library for Tcl. It could be argued that
Tcl/Tk's largest base, and its largest growth area, is with regards to
graphical applications. To this end, Tcl needs a comprehensive, and
well maintained Tk standard library.

Finally, to lower the barrier of using the Tk libraries, they should
be Tcl/Tk based.  This helps to assure cross platform independence
without requiring the user to compile code against a source
distribution.

~ Specification

 * The standard Tk library will be called "tklibX.Y", where "X.Y" will
   follow the version number of the Tcl/Tk distribution that it's
   compatible with.

 * Major/minor releases of the tklib shall coincide with the
   major/minor releases of Tcl/Tk. That is, if Tcl/Tk version 8.5 is
<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|
|

|

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55

# TIP 9: Tk Standard Library

	Author:         Marty Backe <[email protected]>
	Author:         Larry W. Virden <[email protected]>
	Author:         Jeff Hobbs <[email protected]>
	State:          Withdrawn
	Type:           Project
	Vote:           Pending
	Created:        07-Nov-2000
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

A Tk standard library shall be bundled with the core Tcl/Tk
distribution.  The library will consist of general purpose widgets and
composite widgets for use in constructing Tcl/Tk applications.  The
library of Tk components will be written in Tcl/Tk.

# Rationale

Although Tcl "ships" with a comprehensive set of native \(compiled\)
base Tk widgets, it lacks a library of composite widgets, from which
sophisticated applications can readily be built with minimal
reinvention.

Although the Tcl community has created a wealth of general purpose Tk
widgets, generally they are not centrally located or distributed,
making their use problematic. This requires that Tcl programs which
make use of such widgets must either distribute them or direct the end
user on their acquisition and installation. Arguably, the success and
higher visibility of other "competing" scripting languages can be
attributed in some part to their extensive libraries. Tcl/Tk should
continue this trend.

Tcl is perhaps unique in that it is considered both a graphical \(Tk\)
and non-graphical \(Tcl\) programming language. Work has begun in
implementing a standard library for Tcl. It could be argued that
Tcl/Tk's largest base, and its largest growth area, is with regards to
graphical applications. To this end, Tcl needs a comprehensive, and
well maintained Tk standard library.

Finally, to lower the barrier of using the Tk libraries, they should
be Tcl/Tk based.  This helps to assure cross platform independence
without requiring the user to compile code against a source
distribution.

# Specification

 * The standard Tk library will be called "tklibX.Y", where "X.Y" will
   follow the version number of the Tcl/Tk distribution that it's
   compatible with.

 * Major/minor releases of the tklib shall coincide with the
   major/minor releases of Tcl/Tk. That is, if Tcl/Tk version 8.5 is

︙ ︙ 
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130

   by the component. A picture is worth a thousand words! The Tk,
   BWidgets, and Iwidgets demos are prime examples to be emulated.

 * Tklib components can be dependent on other tklib components.  If
   tklib and tcllib become coordinated efforts, the tklib components
   can be dependent on tcllib components.

 * The tklib can (and hopefully will) include megawidgets.

 * Tklib components shall be written in Tcl/Tk.

 * Tklib components shall be implemented in their own namespace and
   distributed in package form.

 * Tklib components do not have to be unique with regards to other
   tklib components, although there shall be differentiating
   characteristics between them. There is more then one way to skin a
   cat.

 * The tklib shall not contain applications, IDEs, or development
   tools.

~ Notes

A tklib module has been created next to the aforementioned tcllib at
http://tcllib.sf.net/  This creates the basic infrastructure for
people to work in, but does not set any status related to the core as
yet.

----

''Larry W. Virden writes'':

 > It appears to me that tklib isn't going to be bundled with the tk
   source code distribution any more than tcllib getting distributed
   with the tcl core distribution.

 > If the TCT concurs that this is the case, then I would propose that
   this TIP be withdrawn.  tklib exists now, and to date, submissions
   are extremely rare.

 > Here we are, some time later, and no action still on either
   withdrawing or rejecting this TIP.  Perhaps some action could be
   taken on this TIP?

----

~ Copyright

This document has been placed in the public domain.

|

|

|

|

|

>
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
   by the component. A picture is worth a thousand words! The Tk,
   BWidgets, and Iwidgets demos are prime examples to be emulated.

 * Tklib components can be dependent on other tklib components.  If
   tklib and tcllib become coordinated efforts, the tklib components
   can be dependent on tcllib components.

 * The tklib can \(and hopefully will\) include megawidgets.

 * Tklib components shall be written in Tcl/Tk.

 * Tklib components shall be implemented in their own namespace and
   distributed in package form.

 * Tklib components do not have to be unique with regards to other
   tklib components, although there shall be differentiating
   characteristics between them. There is more then one way to skin a
   cat.

 * The tklib shall not contain applications, IDEs, or development
   tools.

# Notes

A tklib module has been created next to the aforementioned tcllib at
<http://tcllib.sf.net/>  This creates the basic infrastructure for
people to work in, but does not set any status related to the core as
yet.

----

_Larry W. Virden writes_:

 > It appears to me that tklib isn't going to be bundled with the tk
   source code distribution any more than tcllib getting distributed
   with the tcl core distribution.

 > If the TCT concurs that this is the case, then I would propose that
   this TIP be withdrawn.  tklib exists now, and to date, submissions
   are extremely rare.

 > Here we are, some time later, and no action still on either
   withdrawing or rejecting this TIP.  Perhaps some action could be
   taken on this TIP?

----

# Copyright

This document has been placed in the public domain.

Name change from tip/90.tip to tip/90.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55

56
57
58

59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248

249
250
251
252
253

254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271

272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297

298
299
300

301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316

317
318
319
320
321

322
323
324
325
326
327
328
329
330
331
332
333
334

335
336

337
338

339
340
341
342
343
344
345
346
347
348
349
350

351
352
353
354
355
356
357
358

359
360
361
362
363

364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415

416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502

TIP:            90
Title:          Enable [return -code] in Control Structure Procs
Version:        $Revision: 1.39 $
Author:         Don Porter <[email protected]>
Author:         Donal K. Fellows <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        15-Mar-2002
Post-History:   
Tcl-Version:    8.5

~ Abstract

This TIP analyzes existing limitations on the coding of control
structure commands as ''proc''s, and presents expanded forms of
''catch'' and ''return'' to remove those limitations.

~ Background

It is a distinguishing feature of Tcl that everything is a command,
including control structure functionality that in many other languages
are part of the language itself, such as ''if'', ''for'', and
''switch''.  The command interface of Tcl, including both a return
code and a result, allows extensions to create their own control
structure commands.

Control structure commands have the feature that one or more of their
arguments is a script, often called a ''body'', meant to be evaluated
in the caller's context.  The control structure command exists to
control whether, when, in what context, or how many times that script
is evaluated.  When the body is evaluated, however, it is intended to
behave as if it were interpreted directly in the place of the control
structure command.

The built-in commands of Tcl provide the ability for scripts
themselves to define new commands.  Notably, the ''proc'' command
makes this possible.  In addition, other commands such as ''catch'',
''return'', ''uplevel'', and ''upvar'' offer enough control and access
to the caller's context that it is possible to create new control
structure commands for Tcl, entirely at the script level.

Almost.

There is one limitation that separates control structure commands
created by ''proc'' from those created in C by a direct call to
''Tcl_Create(Obj)Command''.  It is most easily seen in the following
example that compares the built-in command ''while'' to the command
''control::do'' created by ''proc'' in the control package of tcllib.

|  % package require control
|  % proc a {} {while 1 {return -code error}}
|  % proc b {} {control::do {return -code error} while 1}
|  % catch a
|  1

|  % catch b
|  0

The control structure command ''control::do'' fails to evaluate
''return -code error'' in such a way that it acts the same as if
''return -code error'' was evaluated directly within proc ''b''.

~ Analysis

There are two deficiencies in Tcl's built-in commands that lead to
this incapacity in control structure commands defined by ''proc''.

First, ''catch'' is not able to capture the information.  Consider:

|   %  set code [catch {
|          return -code error -errorinfo foo -errorcode bar baz
|      } message]

After evaluation, ''code'' contains "2" (''TCL_RETURN''), and
''message'' contains "baz", but the other values are locked away in
internal fields of the ''Tcl_Interp'' structure as
''interp->returnCode'', ''interp->errorCode'', and 
''interp->errorInfo''.  The "-errorcode" and "-errorinfo" values
will be copied to the global variables "::errorCode" and 
"::errorInfo", respectively, but there will be no way at the
script level to get at the ''interp->returnCode'' value which
was the value of the original "-code" option.

Second, even if the information were available, there is no built-in
command in Tcl that can be evaluated within the body of a proc to make
the proc itself act as if it were the command ''return -code''.
Stated another way, it is not possible to create a command with
''proc'' that behaves exactly the same as ''return -code''.  Because
of that, it is also not possible to create a command with ''proc''
that behaves exactly the same as ''while'', ''if'', etc. - any
command that evaluates any of its arguments as a script in the
caller's context.

This is a curious, and likely unintentional, limitation.  Tcl goes to
great lengths to be sure I can create my own ''break'' replacement
with ''proc''.

| proc myBreak {} {return -code break}

It would be a welcome completion of Tcl's set of built-in commands to
be able to create a replacement for every one of them using ''proc''.

~ Specification

The ''return'' command shall have syntax:

| return ?option value ...? ?result?

There can be any number of ''option value'' pairs, and
any value at all is acceptable for an ''option'' argument.
The legal values of a ''value'' argument are limited for
some ''option''s, as follows:

 > the ''value'' after a "-code" must be either
   an integer (32-bit only), or one of the strings, "ok",
   "error", "return", "break", or "continue",
   just as in the 8.4 spec for ''return''.  The default ''value''
   for the "-code" option is "0".

 > the ''value'' after a "-level" must be a non-negative integer.
   The default ''value'' for the "-level" option is "1".

 > the ''value'' after a "-options" must be a dictionary ([111]).
   The default ''value'' for the "-options" option is an empty
   dictionary.

The keys and values in the dictionary ''value'' of the "-options"
option are pulled out and treated as additional ''option value''
arguments to the ''return'' command.  Note that this "-options" option
for option expansion is offered only because Tcl itself has no
syntax for argument expansion, as observed many,
many times before (for example, [103]).

The ''result'' argument, if any, is stored in the interp as the
result of the ''return'' command.  In default operation, this
becomes the result of the procedure in which the ''return'' command
is evaluated.

The return code of the ''return'' command is determined by the
''value''s of the "-code" and "-level" options.  If the ''value''
of the "-level" option is non-zero, then the return code of
''return'' is TCL_RETURN.  If the ''value'' of the "-level" option
is "0", then the return code of ''return'' is the ''value'' of the
"-code" option, translated from string, as needed.  In this way,

| return -level 0 -code break

is a synonym for

| break

while

| return -code break

spelled out with defaults filled in as:

| return -level 1 -code break

continues to function as before, causing the procedure in which
the ''return'' is evaluated to return the TCL_BREAK return code.

All ''option value'' arguments to ''return'' are stored in a
return options dictionary kept in the interp, just as the
''result'' argument gets stored in the result of the interp.

The TclUpdateReturnInfo() function is modified, so that each
level of procedure returning decrements the value of the "-level"
key in the return options dictionary.  When the value of the
"-level" key reaches "0", the return code from the current procedure
will be the value of the "-code" key in the return options dictionary.
Otherwise, the return code of the current procedure will be TCL_RETURN.

In this way,

| return -level 2 -code ok

is equivalent to

| return -code return

and should (absent some intervening ''catch'') cause a normal return
to the caller's caller.  Likewise,

| return -level 3 -code ok

would cause a normal return to the caller's caller's caller
(again absent an intervening ''catch''), something
that can't currently be accomplished.

The ''catch'' command shall have syntax:

| catch script ?resultVar? ?optionsVar?

The new argument ''optionsVar'', if present, will be the
name of a variable in which a dictionary of return options
should be stored.  The return options stored in that dictionary
are exactly those needed so that the evaluation of

| catch $script result options
| return -options $options $result

is completely indistinguishable (except for the existence
and values of variables "result" and "options") from the
direct evaluation of ''$script'' by the interpreter.  In
particular, any values of the "::errorCode" and "::errorInfo"
variables are the same as if there were never a ''catch'' in
the first place.

In addition, when the result of ''catch'' is TCL_ERROR, the
value in the ''errorLine'' field of the ''Interp'' struct
will be stored as the value of the "-errorline" key in the
return options dictionary.

This specification may seem a bit complex, but it makes possible
very simple solutions to the problems posed above.

~ Examples

First lets revisit the analysis:

|   %  set code [catch {
|          return -code error -errorinfo foo -errorcode bar baz
|      } message options]

After evaluation, ''code'' contains "2" (''TCL_RETURN''), ''message''
contains "baz", and now ''options'' contains:

| -errorcode bar -errorinfo foo -code 1 -level 1

So, the ''options'' variable now contains the information that
was previously inaccessible.  We can now

| return -options $options $message

to get the same results as if the ''catch'' had never been
there in the first place.

In 8.4 Tcl, it is not possible to implement a replacement
for the ''return'' command as a proc.  After this proposal,
such a replacement is:

| proc myReturn args {
|     set result ""
|     if {[llength $args] % 2} {
|         set result [lindex $args end]
|         set args [lrange $args 0 end-1]
|     }

|     set options [eval [list dict create -level 1] $args]
|     dict incr options -level
|     return -options $options $result
| }

In every way ''myReturn'' should be an equivalent to ''return''.

The new ability to exactly reproduce stack traces makes a
''catch'' of large scripts more attractive.  For example, a
procedure that allocates some resource, then performs operations,
and finally frees the resource before returning.  In order to
be sure the resource is freed, we must ''catch'' any errors
that might cause the procedure to return before the freeing
of the resource.  The solution looks like:

| proc doSomething {} {
|     set resource [allocate]
|     catch {
|          # Arbitrarily long script of operations
|     } result options
|     deallocate $resource
|     return -options $options $result
| }

With that structure, we are confident the resource is always
freed, but any error or exception will be presented to the
caller exactly as if it had never been caught in the first place.

Here are two examples of how to use the new features in a 
control structure proc.  The essence of a control structure
command is its ability to evaluate a script in the caller's
context, preserving the illusion that no additional stack
frame was ever used.  So, a proc replacement for ''eval''
illustrates the technique.

The first approach assumes one knows
the internal details of how the ''uplevel'' command adds to
the stack trace. This is straightforward, but will require a
rewrite if ''uplevel'' ever changes how it manipulates the
stack trace.

| proc myEval script {
|     if {[catch {uplevel 1 $script} result options] == 1} {
|         set stack [dict get $options -errorinfo]
|         regsub {\s+invoked from within\s+"uplevel 1 \$script"$} $stack {} stack
|         regsub {\("uplevel" body line (\d+)\)$} $stack [subst -nobackslashes \
|                 {("[lindex [info level 0] 0]" body line \1)}] stack
|         dict set options -errorinfo $stack
|     }

|     dict incr options -level
|     return -options $options $result
| }

A second, more robust solution is possible, but requires a bit
more context gymnastics.

| namespace eval control {
|     proc eval script {
|         variable result
|         variable options
|         set code [uplevel 1 \
|                 [list ::catch $script [namespace which -variable result] \
|                         [namespace which -variable options]]]
|         if {$code == 1} {
|             set line [dict get $options -errorline]
|             dict append options -errorinfo \
|                     "\n    (\"[lindex [info level 0] 0]\" body line $line)"
|         }

|         dict incr options -level
|         return -options $options $result
|     }
| }

Note that in the second solution we did not have to strip away the
contributions of ''uplevel'' to the stack trace, because we captured
the stack trace before ''uplevel'' added anything.  Then we could add
our own information (drawing in part on the new "-errorline" value
available to us now at the script level).

We confirm that either approach solves the original problem:

| % proc a {} {eval {return -code error}}
| % proc b {} {myEval {return -code error}}
| % proc c {} {control::eval {return -code error}}
| % catch a
| 1

| % catch b
| 1

| % catch c
| 1

Finally, the new features make possible a utility command that
can be of use to people making simple control structure commands,
or doing simple wrapping, where there is no need to augment the
stack trace, or to treat any return codes in a special way:

| namespace eval control {
|     proc ascaller script {
|         if {[info level] < 2} {
|             return -code error \
|                     "[lindex [info level 0] 0] called outside a proc"
|         }

|         variable result
|         variable options
|         set code [uplevel 2 \
|                 [list ::catch $script   [namespace which -variable result] \
|                                         [namespace which -variable options]]]
|         if {$code == 0} {
|             return $result
|         }

|         dict incr options -level 2
|         return -options $options $result
|     }
| }

Within a proc, ''ascaller $script'' will take care of all aspects
of evaluating ''$script'' in the caller context, and exiting as
appropriate for all non-TCL_OK return codes.

~ Extensibility

The ''return -code'' command has always accepted any integer value
as a valid argument, allowing package and application authors to
define their own new return codes as needed by their own control
structure commands.  Now that ''return'' will accept any ''option''
argument, and ''catch'' can capture all ''option value'' argument
pairs passed to the caught ''return'' command, package and application
authors now have the ability to augment their custom return codes
with additional data.  Some prefix convention should be established
to avoid key name conflicts in the return options dictionary.

~ Potential Concerns

Reviewers of drafts of this TIP wondered whether the new
"-level" option to ''return'' raised the possibility of
trouble with an attempt to return more levels than beyond
the top of the call stack.  

It should be understood that ''return -level N'' does not
take any shortcut past the intervening levels.  Each level
of the call stack gets a TCL_RETURN return code, and a "-level"
value, dropping by one each step up the stack.  Any level in
the stack might choose to ''catch'' the TCL_RETURN and treat
it as it wishes.  This is exactly the way the existing
''return -code return'' is handled.  Normally, it would cause
a normal return to the caller's caller, but if the caller
chooses to 'catch' it, then the caller has control.

At the toplevel we run out of callers.  Then the question becomes
how is a TCL_RETURN code at toplevel handled?

| % return -level 0       ;# same as a TCL_OK at toplevel
| % return -level 1       ;# same as [return]
| % return -level 2       ;# same as [return -code return]
| command returned bad code: 2

From the C level, ''Tcl_AllowExceptions()'' can be used to
modify this toplevel behavior.

The following proc will produce the same results as above, but
from any level in the call stack (absent an intervening ''catch''):

| % proc escape level {
|       set x [info level]
|       incr x $level
|       return -level $x
|   }

| % escape 0
| % escape 1
| % escape 2
| command returned bad code: 2

Another concern was whether this proposal gave slave interpreters
any new powers over their masters.  The return code from evaluation
of an untrusted script in a slave interpreter should always be
wrapped in a ''catch'' already, lest a TCL_ERROR in the script
blow the stack.  Given that, the only thing this proposal does is
give the ''catch'' command more information to use to decide
how to handle the misbehaving script.

~ Compatibility

It is the author's belief that this proposal is completely
compatible with prior Tcl 8.X releases.  Any error-free script
that ran before, should continue to run with the same results.
At the C level, only internal changes are made, and no new interfaces
are defined.  Any extension or embedding C program that sticks to the
public stubs interface should see no visible change.  

~ Prototype

This proposal is implemented by Tcl Patch 531640 at SourceForge.

The prototype covers all described functionality, but might be
further improved with more substantial bytecompiling of [return].

~ Future considerations

The main reason the global variables ''::errorInfo'' and
''::errorCode'' exist is to give the script level access to
stack and error code information following the ''catch''
of a script that raises an error.  After this proposal, the
''catch'' command itself provides access to that information,
so the global variables are not required.  One can imagine
deprecating them, asking users of Tcl 8.5 to stop writing
code that accesses them.  They could still have apparent
existence, to satisfy the needs of scripts written for earlier
Tcl 8.X releases, by means of read traces.  In time,
Tcl 9 could either continue the read trace scheme, or not
provide these global variables at all.

One part of Tcl itself that currently makes use of the
''::errorCode'' and ''::errorInfo'' variables is the
''bgerror'' command.  Currently, ''bgerror'' accepts exactly
one argument, the error message.  To make use of stack or
error code information, ''bgerror'' must retrieve them from
the global variables.  The proper values of these global
variables are re-set by ''Tcl_BackgroundError()'' prior to
evaluation of ''bgerror''.

As an alternative, ''Tcl_BackgroundError()'' could first attempt
to call ''bgerror'' with ''two'' arguments, first the message,
then a dictionary of options.  If that call returned TCL_ERROR,
then a second attempt could be made with a single message
argument.  In that way, cleaner ''bgerror'' commands that get
all data from arguments could be supported, while still keeping
support for those ''bgerror'' commands that were defined for
single argument use.

It has been noted several times that the processing of the
value of ''::errorInfo'' is rather difficult because it is
an arbitrary string with no documented structure.  A different,
more structured way of representing stack trace information would
be an improvement.  This proposal does not propose an alternative,
but because it offers an extensible dictionary for storing arbitrary
return options data, it does provide an infrastructure where such
approaches might be tried out.

~ Acknowledgments

This proposal is a synthesis of ideas from many sources.  As best
I can recall, major contributions came from Joe English, Andreas
Leitgeb, Reinhard Max, and Kevin Kenny.  If you like the idea,
give them some credit; it you don't, blame me for combining the
ideas badly.

~ See also

Documentation for tcllib's control package: 
http://tcllib.sf.net/doc/control.html

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
>

|

|
|

|

|
|

|

|
|
|

|
|
|
|

|
|
|
|
<
>
|
<
|
>
|
|
|

|

|

|

|
|
|

|
|
|
|
|

|

|

|
|
|

|
|

|

|

|

|

|

|
|
|
|

|
|

|

|
|

|
|

|
|
|

|

|
|
|

|
|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|

|
|
|

|

|
|

|

|
|
|

|
|

|

|

|

|

|

|
|
|
|
|
<
>
|
|
|
<
|
>
|

|

|

|
|
|
|
|
|
|
<
>

|

|

|

|
|
|
|
|
|
|
<
>
|
|
<
>

|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
<
<
|
>
>

|
|
|
|

|
|
|
|
<
>
|
<
>
|
<
>

|
|
|
|
|
<
>
|
|
|
|
|
|
|
<
>
|
|
<
<
|
>
>
|
|
|

|

|

|
|
|

|

|

|

|

|

|

|

|
|
|
|

|

|

|
|
|
|
<
>
|
|
|
|

|

|

|

|

|

|
|
|

|

|
|

|

|
|

|
|
|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53

54
55

56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246

247
248
249
250

251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269

270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295

296
297
298

299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314

315
316
317

318
319
320
321
322
323
324
325
326
327
328
329
330
331
332

333
334

335
336

337
338
339
340
341
342
343
344
345
346
347
348

349
350
351
352
353
354
355
356

357
358
359

360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413

414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502

# TIP 90: Enable [return -code] in Control Structure Procs

	Author:         Don Porter <[email protected]>
	Author:         Donal K. Fellows <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        15-Mar-2002
	Post-History:   
	Tcl-Version:    8.5
-----

# Abstract

This TIP analyzes existing limitations on the coding of control
structure commands as _proc_s, and presents expanded forms of
_catch_ and _return_ to remove those limitations.

# Background

It is a distinguishing feature of Tcl that everything is a command,
including control structure functionality that in many other languages
are part of the language itself, such as _if_, _for_, and
_switch_.  The command interface of Tcl, including both a return
code and a result, allows extensions to create their own control
structure commands.

Control structure commands have the feature that one or more of their
arguments is a script, often called a _body_, meant to be evaluated
in the caller's context.  The control structure command exists to
control whether, when, in what context, or how many times that script
is evaluated.  When the body is evaluated, however, it is intended to
behave as if it were interpreted directly in the place of the control
structure command.

The built-in commands of Tcl provide the ability for scripts
themselves to define new commands.  Notably, the _proc_ command
makes this possible.  In addition, other commands such as _catch_,
_return_, _uplevel_, and _upvar_ offer enough control and access
to the caller's context that it is possible to create new control
structure commands for Tcl, entirely at the script level.

Almost.

There is one limitation that separates control structure commands
created by _proc_ from those created in C by a direct call to
_Tcl\_Create\(Obj\)Command_.  It is most easily seen in the following
example that compares the built-in command _while_ to the command
_control::do_ created by _proc_ in the control package of tcllib.

	  % package require control
	  % proc a {} {while 1 {return -code error}}
	  % proc b {} {control::do {return -code error} while 1}
	  % catch a

	  1
	  % catch b

	  0

The control structure command _control::do_ fails to evaluate
_return -code error_ in such a way that it acts the same as if
_return -code error_ was evaluated directly within proc _b_.

# Analysis

There are two deficiencies in Tcl's built-in commands that lead to
this incapacity in control structure commands defined by _proc_.

First, _catch_ is not able to capture the information.  Consider:

	   %  set code [catch {
	          return -code error -errorinfo foo -errorcode bar baz
	      } message]

After evaluation, _code_ contains "2" \(_TCL\_RETURN_\), and
_message_ contains "baz", but the other values are locked away in
internal fields of the _Tcl\_Interp_ structure as
_interp->returnCode_, _interp->errorCode_, and 
_interp->errorInfo_.  The "-errorcode" and "-errorinfo" values
will be copied to the global variables "::errorCode" and 
"::errorInfo", respectively, but there will be no way at the
script level to get at the _interp->returnCode_ value which
was the value of the original "-code" option.

Second, even if the information were available, there is no built-in
command in Tcl that can be evaluated within the body of a proc to make
the proc itself act as if it were the command _return -code_.
Stated another way, it is not possible to create a command with
_proc_ that behaves exactly the same as _return -code_.  Because
of that, it is also not possible to create a command with _proc_
that behaves exactly the same as _while_, _if_, etc. - any
command that evaluates any of its arguments as a script in the
caller's context.

This is a curious, and likely unintentional, limitation.  Tcl goes to
great lengths to be sure I can create my own _break_ replacement
with _proc_.

	 proc myBreak {} {return -code break}

It would be a welcome completion of Tcl's set of built-in commands to
be able to create a replacement for every one of them using _proc_.

# Specification

The _return_ command shall have syntax:

	 return ?option value ...? ?result?

There can be any number of _option value_ pairs, and
any value at all is acceptable for an _option_ argument.
The legal values of a _value_ argument are limited for
some _option_s, as follows:

 > the _value_ after a "-code" must be either
   an integer \(32-bit only\), or one of the strings, "ok",
   "error", "return", "break", or "continue",
   just as in the 8.4 spec for _return_.  The default _value_
   for the "-code" option is "0".

 > the _value_ after a "-level" must be a non-negative integer.
   The default _value_ for the "-level" option is "1".

 > the _value_ after a "-options" must be a dictionary \([[111]](111.md)\).
   The default _value_ for the "-options" option is an empty
   dictionary.

The keys and values in the dictionary _value_ of the "-options"
option are pulled out and treated as additional _option value_
arguments to the _return_ command.  Note that this "-options" option
for option expansion is offered only because Tcl itself has no
syntax for argument expansion, as observed many,
many times before \(for example, [[103]](103.md)\).

The _result_ argument, if any, is stored in the interp as the
result of the _return_ command.  In default operation, this
becomes the result of the procedure in which the _return_ command
is evaluated.

The return code of the _return_ command is determined by the
_value_s of the "-code" and "-level" options.  If the _value_
of the "-level" option is non-zero, then the return code of
_return_ is TCL\_RETURN.  If the _value_ of the "-level" option
is "0", then the return code of _return_ is the _value_ of the
"-code" option, translated from string, as needed.  In this way,

	 return -level 0 -code break

is a synonym for

	 break

while

	 return -code break

spelled out with defaults filled in as:

	 return -level 1 -code break

continues to function as before, causing the procedure in which
the _return_ is evaluated to return the TCL\_BREAK return code.

All _option value_ arguments to _return_ are stored in a
return options dictionary kept in the interp, just as the
_result_ argument gets stored in the result of the interp.

The TclUpdateReturnInfo\(\) function is modified, so that each
level of procedure returning decrements the value of the "-level"
key in the return options dictionary.  When the value of the
"-level" key reaches "0", the return code from the current procedure
will be the value of the "-code" key in the return options dictionary.
Otherwise, the return code of the current procedure will be TCL\_RETURN.

In this way,

	 return -level 2 -code ok

is equivalent to

	 return -code return

and should \(absent some intervening _catch_\) cause a normal return
to the caller's caller.  Likewise,

	 return -level 3 -code ok

would cause a normal return to the caller's caller's caller
\(again absent an intervening _catch_\), something
that can't currently be accomplished.

The _catch_ command shall have syntax:

	 catch script ?resultVar? ?optionsVar?

The new argument _optionsVar_, if present, will be the
name of a variable in which a dictionary of return options
should be stored.  The return options stored in that dictionary
are exactly those needed so that the evaluation of

	 catch $script result options
	 return -options $options $result

is completely indistinguishable \(except for the existence
and values of variables "result" and "options"\) from the
direct evaluation of _$script_ by the interpreter.  In
particular, any values of the "::errorCode" and "::errorInfo"
variables are the same as if there were never a _catch_ in
the first place.

In addition, when the result of _catch_ is TCL\_ERROR, the
value in the _errorLine_ field of the _Interp_ struct
will be stored as the value of the "-errorline" key in the
return options dictionary.

This specification may seem a bit complex, but it makes possible
very simple solutions to the problems posed above.

# Examples

First lets revisit the analysis:

	   %  set code [catch {
	          return -code error -errorinfo foo -errorcode bar baz
	      } message options]

After evaluation, _code_ contains "2" \(_TCL\_RETURN_\), _message_
contains "baz", and now _options_ contains:

	 -errorcode bar -errorinfo foo -code 1 -level 1

So, the _options_ variable now contains the information that
was previously inaccessible.  We can now

	 return -options $options $message

to get the same results as if the _catch_ had never been
there in the first place.

In 8.4 Tcl, it is not possible to implement a replacement
for the _return_ command as a proc.  After this proposal,
such a replacement is:

	 proc myReturn args {
	     set result ""
	     if {[llength $args] % 2} {
	         set result [lindex $args end]
	         set args [lrange $args 0 end-1]

	     }
	     set options [eval [list dict create -level 1] $args]
	     dict incr options -level
	     return -options $options $result

	 }

In every way _myReturn_ should be an equivalent to _return_.

The new ability to exactly reproduce stack traces makes a
_catch_ of large scripts more attractive.  For example, a
procedure that allocates some resource, then performs operations,
and finally frees the resource before returning.  In order to
be sure the resource is freed, we must _catch_ any errors
that might cause the procedure to return before the freeing
of the resource.  The solution looks like:

	 proc doSomething {} {
	     set resource [allocate]
	     catch {
	          # Arbitrarily long script of operations
	     } result options
	     deallocate $resource
	     return -options $options $result

	 }

With that structure, we are confident the resource is always
freed, but any error or exception will be presented to the
caller exactly as if it had never been caught in the first place.

Here are two examples of how to use the new features in a 
control structure proc.  The essence of a control structure
command is its ability to evaluate a script in the caller's
context, preserving the illusion that no additional stack
frame was ever used.  So, a proc replacement for _eval_
illustrates the technique.

The first approach assumes one knows
the internal details of how the _uplevel_ command adds to
the stack trace. This is straightforward, but will require a
rewrite if _uplevel_ ever changes how it manipulates the
stack trace.

	 proc myEval script {
	     if {[catch {uplevel 1 $script} result options] == 1} {
	         set stack [dict get $options -errorinfo]
	         regsub {\s+invoked from within\s+"uplevel 1 \$script"$} $stack {} stack
	         regsub {\("uplevel" body line (\d+)\)$} $stack [subst -nobackslashes \
	                 {("[lindex [info level 0] 0]" body line \1)}] stack
	         dict set options -errorinfo $stack

	     }
	     dict incr options -level
	     return -options $options $result

	 }

A second, more robust solution is possible, but requires a bit
more context gymnastics.

	 namespace eval control {
	     proc eval script {
	         variable result
	         variable options
	         set code [uplevel 1 \
	                 [list ::catch $script [namespace which -variable result] \
	                         [namespace which -variable options]]]
	         if {$code == 1} {
	             set line [dict get $options -errorline]
	             dict append options -errorinfo \
	                     "\n    (\"[lindex [info level 0] 0]\" body line $line)"

	         }
	         dict incr options -level
	         return -options $options $result

	     }
	 }

Note that in the second solution we did not have to strip away the
contributions of _uplevel_ to the stack trace, because we captured
the stack trace before _uplevel_ added anything.  Then we could add
our own information \(drawing in part on the new "-errorline" value
available to us now at the script level\).

We confirm that either approach solves the original problem:

	 % proc a {} {eval {return -code error}}
	 % proc b {} {myEval {return -code error}}
	 % proc c {} {control::eval {return -code error}}
	 % catch a

	 1
	 % catch b

	 1
	 % catch c

	 1

Finally, the new features make possible a utility command that
can be of use to people making simple control structure commands,
or doing simple wrapping, where there is no need to augment the
stack trace, or to treat any return codes in a special way:

	 namespace eval control {
	     proc ascaller script {
	         if {[info level] < 2} {
	             return -code error \
	                     "[lindex [info level 0] 0] called outside a proc"

	         }
	         variable result
	         variable options
	         set code [uplevel 2 \
	                 [list ::catch $script   [namespace which -variable result] \
	                                         [namespace which -variable options]]]
	         if {$code == 0} {
	             return $result

	         }
	         dict incr options -level 2
	         return -options $options $result

	     }
	 }

Within a proc, _ascaller $script_ will take care of all aspects
of evaluating _$script_ in the caller context, and exiting as
appropriate for all non-TCL\_OK return codes.

# Extensibility

The _return -code_ command has always accepted any integer value
as a valid argument, allowing package and application authors to
define their own new return codes as needed by their own control
structure commands.  Now that _return_ will accept any _option_
argument, and _catch_ can capture all _option value_ argument
pairs passed to the caught _return_ command, package and application
authors now have the ability to augment their custom return codes
with additional data.  Some prefix convention should be established
to avoid key name conflicts in the return options dictionary.

# Potential Concerns

Reviewers of drafts of this TIP wondered whether the new
"-level" option to _return_ raised the possibility of
trouble with an attempt to return more levels than beyond
the top of the call stack.  

It should be understood that _return -level N_ does not
take any shortcut past the intervening levels.  Each level
of the call stack gets a TCL\_RETURN return code, and a "-level"
value, dropping by one each step up the stack.  Any level in
the stack might choose to _catch_ the TCL\_RETURN and treat
it as it wishes.  This is exactly the way the existing
_return -code return_ is handled.  Normally, it would cause
a normal return to the caller's caller, but if the caller
chooses to 'catch' it, then the caller has control.

At the toplevel we run out of callers.  Then the question becomes
how is a TCL\_RETURN code at toplevel handled?

	 % return -level 0       ;# same as a TCL_OK at toplevel
	 % return -level 1       ;# same as [return]
	 % return -level 2       ;# same as [return -code return]
	 command returned bad code: 2

From the C level, _Tcl\_AllowExceptions\(\)_ can be used to
modify this toplevel behavior.

The following proc will produce the same results as above, but
from any level in the call stack \(absent an intervening _catch_\):

	 % proc escape level {
	       set x [info level]
	       incr x $level
	       return -level $x

	   }
	 % escape 0
	 % escape 1
	 % escape 2
	 command returned bad code: 2

Another concern was whether this proposal gave slave interpreters
any new powers over their masters.  The return code from evaluation
of an untrusted script in a slave interpreter should always be
wrapped in a _catch_ already, lest a TCL\_ERROR in the script
blow the stack.  Given that, the only thing this proposal does is
give the _catch_ command more information to use to decide
how to handle the misbehaving script.

# Compatibility

It is the author's belief that this proposal is completely
compatible with prior Tcl 8.X releases.  Any error-free script
that ran before, should continue to run with the same results.
At the C level, only internal changes are made, and no new interfaces
are defined.  Any extension or embedding C program that sticks to the
public stubs interface should see no visible change.  

# Prototype

This proposal is implemented by Tcl Patch 531640 at SourceForge.

The prototype covers all described functionality, but might be
further improved with more substantial bytecompiling of [return].

# Future considerations

The main reason the global variables _::errorInfo_ and
_::errorCode_ exist is to give the script level access to
stack and error code information following the _catch_
of a script that raises an error.  After this proposal, the
_catch_ command itself provides access to that information,
so the global variables are not required.  One can imagine
deprecating them, asking users of Tcl 8.5 to stop writing
code that accesses them.  They could still have apparent
existence, to satisfy the needs of scripts written for earlier
Tcl 8.X releases, by means of read traces.  In time,
Tcl 9 could either continue the read trace scheme, or not
provide these global variables at all.

One part of Tcl itself that currently makes use of the
_::errorCode_ and _::errorInfo_ variables is the
_bgerror_ command.  Currently, _bgerror_ accepts exactly
one argument, the error message.  To make use of stack or
error code information, _bgerror_ must retrieve them from
the global variables.  The proper values of these global
variables are re-set by _Tcl\_BackgroundError\(\)_ prior to
evaluation of _bgerror_.

As an alternative, _Tcl\_BackgroundError\(\)_ could first attempt
to call _bgerror_ with _two_ arguments, first the message,
then a dictionary of options.  If that call returned TCL\_ERROR,
then a second attempt could be made with a single message
argument.  In that way, cleaner _bgerror_ commands that get
all data from arguments could be supported, while still keeping
support for those _bgerror_ commands that were defined for
single argument use.

It has been noted several times that the processing of the
value of _::errorInfo_ is rather difficult because it is
an arbitrary string with no documented structure.  A different,
more structured way of representing stack trace information would
be an improvement.  This proposal does not propose an alternative,
but because it offers an extensible dictionary for storing arbitrary
return options data, it does provide an infrastructure where such
approaches might be tried out.

# Acknowledgments

This proposal is a synthesis of ideas from many sources.  As best
I can recall, major contributions came from Joe English, Andreas
Leitgeb, Reinhard Max, and Kevin Kenny.  If you like the idea,
give them some credit; it you don't, blame me for combining the
ideas badly.

# See also

Documentation for tcllib's control package: 
<http://tcllib.sf.net/doc/control.html>

# Copyright

This document has been placed in the public domain.

Name change from tip/91.tip to tip/91.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75

TIP:		91
Title:		Backward Compatibility for Channel Types with 32-bit SeekProcs
State:		Final
Type:		Project
Tcl-Version:	8.4
Vote:		Done
Post-History:	
Version:	$Revision: 1.5 $
Author:		Donal K. Fellows <[email protected]>
Created:	03-May-2002

~ Abstract

[72] broke backward-compatibility for channels that supported the
[[seek]] command, and this TIP adds the ability for old-style channels
to work with the underlying 64-bit architecture.

~ Rationale

Although the ability to work with large files (as added by [72]) is
crucially useful in many situations, it has introduced a few problems,
one of which being that it broke backward compatibility for channel
types (see
http://sourceforge.net/tracker/?func=detail&atid=410295&aid=551677&group_id=34191
for details.)  Following discussion with the people concerned, I
believe it is possible to modify the channel type structure so that
old-style channels - i.e. those compiled against Tcl 8.3 - can still
be supported (though with a limited range of operation.)

~ Proposed Change

The ''Tcl_ChannelType'' structure will have an extra field appended of
type ''Tcl_DriverWideSeekProc'' called ''wideSeekProc'', which shall
be guaranteed to be present (though possibly NULL) whenever the
version of the ''Tcl_ChannelType'' structure is at least
''TCL_CHANNEL_VERSION_3''.  The type ''Tcl_DriverSeekProc'' shall be
reverted to its pre-[72] version, with the current type of
''Tcl_DriverSeekProc'' being transferred to the type
''Tcl_DriverWideSeekProc''.  In order to facilitate stacked channels,
an additional requirement shall be imposed that if a channel driver
implements a ''wideSeekProc'', then it shall also implement a
''seekProc'', so allowing stacked channels to work entirely in one
domain or the other (well, in simple cases at least.)

Semantically, the core will handle seeks by preferring to use a
''wideSeekProc'' if present, and using the ''seekProc'' otherwise.
Considering just the case where the ''seekProc'' is used, if the
offset argument exceeds that which is representable in a ''long'',
''Tcl_Seek'' will fail, simulating a system error of EOVERFLOW.

The only Tcl core channel which will need modification is the ''file''
channel; this will be adapted to generate an error of EOVERFLOW when
the resulting offset in a file would otherwise exceed that which can
be expressed in a ''long'' (which has the downside of making the seek
operation no longer atomic when using the old interface, since the
file offset will need to be restored to its old position in such
cases.)  On 64-bit platforms, both ''seekProc'' and ''wideSeekProc''
will be the same function.

~ Rejected Alternatives

I considered overloading the ''seekProc'' field to have different
types depending on the value of the ''version'' field, but that's
remarkably ugly and forces people to adapt rapidly at a source level.
I don't know about everyone else, but I don't use a lot of programs at
the moment that actually need access to files larger than 2GB.

I also considered allowing code to only implement the ''wideSeekProc''
but it was easier to code the way I ended up doing it and I don't
think there are that many people writing channel drivers that support
seeking anway.

~ Copyright

This document has been placed in the public domain.

<
|
|
|
|
|
|
<
|
|
>

|

|
|

|

|

|
|
|

|

|

|
|
|
|
|
|
|
|

|
|
|

|
|
|
|

|

|

|

|

|
|

|

|

>

1
2
3
4
5
6

7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75

# TIP 91: Backward Compatibility for Channel Types with 32-bit SeekProcs
	State:		Final
	Type:		Project
	Tcl-Version:	8.4
	Vote:		Done
	Post-History:	

	Author:		Donal K. Fellows <[email protected]>
	Created:	03-May-2002
-----

# Abstract

[[72]](72.md) broke backward-compatibility for channels that supported the
[seek] command, and this TIP adds the ability for old-style channels
to work with the underlying 64-bit architecture.

# Rationale

Although the ability to work with large files \(as added by [[72]](72.md)\) is
crucially useful in many situations, it has introduced a few problems,
one of which being that it broke backward compatibility for channel
types \(see
<http://sourceforge.net/tracker/?func=detail&atid=410295&aid=551677&group\_id=34191>
for details.\)  Following discussion with the people concerned, I
believe it is possible to modify the channel type structure so that
old-style channels - i.e. those compiled against Tcl 8.3 - can still
be supported \(though with a limited range of operation.\)

# Proposed Change

The _Tcl\_ChannelType_ structure will have an extra field appended of
type _Tcl\_DriverWideSeekProc_ called _wideSeekProc_, which shall
be guaranteed to be present \(though possibly NULL\) whenever the
version of the _Tcl\_ChannelType_ structure is at least
_TCL\_CHANNEL\_VERSION\_3_.  The type _Tcl\_DriverSeekProc_ shall be
reverted to its pre-[[72]](72.md) version, with the current type of
_Tcl\_DriverSeekProc_ being transferred to the type
_Tcl\_DriverWideSeekProc_.  In order to facilitate stacked channels,
an additional requirement shall be imposed that if a channel driver
implements a _wideSeekProc_, then it shall also implement a
_seekProc_, so allowing stacked channels to work entirely in one
domain or the other \(well, in simple cases at least.\)

Semantically, the core will handle seeks by preferring to use a
_wideSeekProc_ if present, and using the _seekProc_ otherwise.
Considering just the case where the _seekProc_ is used, if the
offset argument exceeds that which is representable in a _long_,
_Tcl\_Seek_ will fail, simulating a system error of EOVERFLOW.

The only Tcl core channel which will need modification is the _file_
channel; this will be adapted to generate an error of EOVERFLOW when
the resulting offset in a file would otherwise exceed that which can
be expressed in a _long_ \(which has the downside of making the seek
operation no longer atomic when using the old interface, since the
file offset will need to be restored to its old position in such
cases.\)  On 64-bit platforms, both _seekProc_ and _wideSeekProc_
will be the same function.

# Rejected Alternatives

I considered overloading the _seekProc_ field to have different
types depending on the value of the _version_ field, but that's
remarkably ugly and forces people to adapt rapidly at a source level.
I don't know about everyone else, but I don't use a lot of programs at
the moment that actually need access to files larger than 2GB.

I also considered allowing code to only implement the _wideSeekProc_
but it was easier to code the way I ended up doing it and I don't
think there are that many people writing channel drivers that support
seeking anway.

# Copyright

This document has been placed in the public domain.

Name change from tip/92.tip to tip/92.md.

1
2
3
4
5
6
7
8
9
10
11

12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97

98
99
100
101
102
103
104

105
106
107
108
109

110
111
112
113
114

115
116

117
118
119
120
121
122
123
124
125

126
127
128
129

130
131
132
133

134
135
136
137
138

139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173

TIP:		92
Title:		Move Package Load Decisions to Application Developer
Version:	$Revision: 1.3 $
Author:		Clif Flynt <[email protected]>
State:		Withdrawn
Type:		Project
Vote:		Pending
Created:	13-May-2002
Post-History: 
Tcl-Version:	8.4
Keywords:	package require, namespace, pkg_mkIndex

~ Abstract

This TIP makes the loading of packages far more flexible, so as to
better support their use by application authors in situations above
and beyond those foreseen by the developer of the package.

~ Overview

I believe that we've been misdirecting our efforts in solutions to
the Package issue.

 * The modifications to ''pkg_mkIndex'' give the library author (or
   package builder) the ability to define when a package will be
   loaded (immediate or deferred), which restricts an application
   developer to the decisions made by the library author.

 * If a package is built to be loaded immediately, it is loaded into
   the top-level namespace.  This breaks previous tricks to force a
   package to load inside an existing namespace.

These techniques limit the application writer to the behavior (and
uses) envisioned by the package author and is counter to the concept
that application developer best understands how they need a tool to
perform for their application.  The Tcl community, in particular, has
grown largely because the tools have had applications far beyond those
imagined by their initial developers.

Moving the decisions about when and how to load a package from
''pkg_mkIndex'' to the ''package require'' command allows the
application writer the freedom to find new styles of use that the
package author may not have conceived.

Being able to force an immediate load into the current namespace
rather than always loading packages into the global scope provides
support for lightweight object style data structures without the need
for extensions like Incr Tcl, OOTcl, etc.

Loading a package/namespace into the current namespace provides
mechanisms for lightweight inheritance, and since namespaces can
contain both code and data, loading a namespace multiple times (in
separate namespaces) is a lightweight aggregation model.

I do not propose that this power removes the need for full object
oriented programming models within the Tcl community.  However, I
believe that putting the power to develop these lightweight models
into the application developer provides the developer with a more
versatile tool kit than they currently have.  (One that I've been
using for several years, with workarounds.)

This proposal is to add new flags to the package require command,
allowing an application developer to determine when and how to load a
package.

 -current: Load the package into the current namespace rather than the
	global space.  Implies immediate.

 -multiple: Allow loading multiple copies of this package, for use
	with ''-current'' when the application programmer wishes to
	create multiple nested copies of a package.

 -immediate: Load immediately, rather than defer loading the package
	until needed.  This is the default behavior with Tcl 8.3 and
	later.

 -defer: Load package when required.  The default with Tcl 8.2 and
	earlier, or when ''pkg_mkIndex -lazy'' used with Tcl 8.3.

 -exact: No change to this option.  Requires an exact Major/Minor
	revision match to be an acceptable package.

~ Script Example

The code below implements a simple stack object that can be merged
into other namespaces to create objects that contain individual
stacks.

| package provide stack 1.0
| namespace eval stack {
|     namespace export push pop peek size
|     variable stack ""
|     
|     proc push {val} {
|         variable stack;
|         lappend stack $val
|     }

|      
|     proc pop {} {
|         variable stack;
|         set rtn [lindex $stack end]
|         set stack [lrange $stack 0 end-1]
|         return $rtn
|         }

| 
|     proc peek {{pos end}} {
|         variable stack;
|         return [lindex $stack $pos]
|     }

|      
|     proc size {} {
|         variable stack;
|         return [llength $stack]
|     }

|      
| }

|  

With this data structure available, the guts of a Tower of Hanoi
puzzle becomes simple:

| namespace eval left {
|         package require -current -multiple  stack 1.0
|         namespace import [namespace current]::stack::*
|     }   

| namespace eval center {
|         package require -current -multiple  stack 1.0
|         namespace import [namespace current]::stack::*
|     }

| namespace eval right {
|         package require -current -multiple  stack 1.0
|         namespace import [namespace current]::stack::*
|     }

|      
| proc move {from to} {
|         ${to}::push [${from}::pop]
|     }

This creates 3 'objects' each of which contains a private stack with
the stack methods.

~ Reference Implementation

A reference implementation of the ''-current'' and ''-multiple''
flags has been created for Tcl 8.4a4 and is available at
http://noucorp.com/PkgPatch8.4.zip

The implementation required these modifications to
''generic/tclPkg.c'':

 * ''Tcl_PackageObjCmd'' needs to be able to parse the new options and
   set the bitmapped flag.

 * ''Tcl_PkgRequireEx'' is modified to accept a bitmapped flag instead
   of the ''exact'' option.

 * The 0x0001 bitmap position is used to map for ''exact'' preserving
   the existing behavior of the ''Tcl_PackageObjCmd'' and
   ''Tcl_PkgRequireEx'' functions.

 * These bitmapped flags are defined exact, current, and multiple:

|#define PKG_EXACT    0x01   /* Use the exact version - as used for exact */
|#define PKG_CURRENT  0x02   /* Load into current namespace, not GLOBAL */
|#define PKG_MULTIPLE 0x04   /* Allow loading multiple copies of a package */

 * ''Tcl_PkgRequireEx'' is modified to process the MULTIPLE and
   CURRENT flags.

 * The Tcl tests have been reworked to understand the new error
   returns, etc.  Running "make tests" will accept the new code.

Minimal testing has been done using pure Tcl packages.  

<
|
<
|
|
|
|
|
|
|
|
>

|

|

|
|
|

|
|

|

|
|

|
|

|

|

|

|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
<
>
|
|
|
|
<
>
|
|
|
|
<
>
|
<
>
|

|
|
|
<
>
|
|
|
<
>
|
|
|
<
>
|
|
|
<
|
>

|

|

|

|

|

|
|

|
|
|

|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95

96
97
98
99
100
101
102

103
104
105
106
107

108
109
110
111
112

113
114

115
116
117
118
119
120
121
122
123

124
125
126
127

128
129
130
131

132
133
134
135

136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173

# TIP 92: Move Package Load Decisions to Application Developer

	Author:		Clif Flynt <[email protected]>
	State:		Withdrawn
	Type:		Project
	Vote:		Pending
	Created:	13-May-2002
	Post-History: 
	Tcl-Version:	8.4
	Keywords:	package require, namespace, pkg_mkIndex
-----

# Abstract

This TIP makes the loading of packages far more flexible, so as to
better support their use by application authors in situations above
and beyond those foreseen by the developer of the package.

# Overview

I believe that we've been misdirecting our efforts in solutions to
the Package issue.

 * The modifications to _pkg\_mkIndex_ give the library author \(or
   package builder\) the ability to define when a package will be
   loaded \(immediate or deferred\), which restricts an application
   developer to the decisions made by the library author.

 * If a package is built to be loaded immediately, it is loaded into
   the top-level namespace.  This breaks previous tricks to force a
   package to load inside an existing namespace.

These techniques limit the application writer to the behavior \(and
uses\) envisioned by the package author and is counter to the concept
that application developer best understands how they need a tool to
perform for their application.  The Tcl community, in particular, has
grown largely because the tools have had applications far beyond those
imagined by their initial developers.

Moving the decisions about when and how to load a package from
_pkg\_mkIndex_ to the _package require_ command allows the
application writer the freedom to find new styles of use that the
package author may not have conceived.

Being able to force an immediate load into the current namespace
rather than always loading packages into the global scope provides
support for lightweight object style data structures without the need
for extensions like Incr Tcl, OOTcl, etc.

Loading a package/namespace into the current namespace provides
mechanisms for lightweight inheritance, and since namespaces can
contain both code and data, loading a namespace multiple times \(in
separate namespaces\) is a lightweight aggregation model.

I do not propose that this power removes the need for full object
oriented programming models within the Tcl community.  However, I
believe that putting the power to develop these lightweight models
into the application developer provides the developer with a more
versatile tool kit than they currently have.  \(One that I've been
using for several years, with workarounds.\)

This proposal is to add new flags to the package require command,
allowing an application developer to determine when and how to load a
package.

 -current: Load the package into the current namespace rather than the
	global space.  Implies immediate.

 -multiple: Allow loading multiple copies of this package, for use
	with _-current_ when the application programmer wishes to
	create multiple nested copies of a package.

 -immediate: Load immediately, rather than defer loading the package
	until needed.  This is the default behavior with Tcl 8.3 and
	later.

 -defer: Load package when required.  The default with Tcl 8.2 and
	earlier, or when _pkg\_mkIndex -lazy_ used with Tcl 8.3.

 -exact: No change to this option.  Requires an exact Major/Minor
	revision match to be an acceptable package.

# Script Example

The code below implements a simple stack object that can be merged
into other namespaces to create objects that contain individual
stacks.

	 package provide stack 1.0
	 namespace eval stack {
	     namespace export push pop peek size
	     variable stack ""

	     proc push {val} {
	         variable stack;
	         lappend stack $val

	     }

	     proc pop {} {
	         variable stack;
	         set rtn [lindex $stack end]
	         set stack [lrange $stack 0 end-1]
	         return $rtn

	         }

	     proc peek {{pos end}} {
	         variable stack;
	         return [lindex $stack $pos]

	     }

	     proc size {} {
	         variable stack;
	         return [llength $stack]

	     }

	 }

With this data structure available, the guts of a Tower of Hanoi
puzzle becomes simple:

	 namespace eval left {
	         package require -current -multiple  stack 1.0
	         namespace import [namespace current]::stack::*

	     }   
	 namespace eval center {
	         package require -current -multiple  stack 1.0
	         namespace import [namespace current]::stack::*

	     }
	 namespace eval right {
	         package require -current -multiple  stack 1.0
	         namespace import [namespace current]::stack::*

	     }

	 proc move {from to} {
	         ${to}::push [${from}::pop]

	     }

This creates 3 'objects' each of which contains a private stack with
the stack methods.

# Reference Implementation

A reference implementation of the _-current_ and _-multiple_
flags has been created for Tcl 8.4a4 and is available at
<http://noucorp.com/PkgPatch8.4.zip>

The implementation required these modifications to
_generic/tclPkg.c_:

 * _Tcl\_PackageObjCmd_ needs to be able to parse the new options and
   set the bitmapped flag.

 * _Tcl\_PkgRequireEx_ is modified to accept a bitmapped flag instead
   of the _exact_ option.

 * The 0x0001 bitmap position is used to map for _exact_ preserving
   the existing behavior of the _Tcl\_PackageObjCmd_ and
   _Tcl\_PkgRequireEx_ functions.

 * These bitmapped flags are defined exact, current, and multiple:

		#define PKG_EXACT    0x01   /* Use the exact version - as used for exact */
		#define PKG_CURRENT  0x02   /* Load into current namespace, not GLOBAL */
		#define PKG_MULTIPLE 0x04   /* Allow loading multiple copies of a package */

 * _Tcl\_PkgRequireEx_ is modified to process the MULTIPLE and
   CURRENT flags.

 * The Tcl tests have been reworked to understand the new error
   returns, etc.  Running "make tests" will accept the new code.

Minimal testing has been done using pure Tcl packages.  

Name change from tip/93.tip to tip/93.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181

182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197

198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236

TIP:            93
Title:          Get/Delete Enhancement for the Tk Text Widget
Version:        $Revision: 1.8 $
Author:         Craig Votava <[email protected]>
Author:         Donal K. Fellows <[email protected]>
Author:         Jeff Hobbs <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        28-Dec-2001
Post-History:   
Tcl-Version:    8.4

~ Abstract

The Tk Text widget provides text tags, which are a very powerful
thing.  However, the current implementation does not provide an
efficient way for a Tk Text widget programmer to extract (get) all of
the actual text that has a given text tag.  This TIP proposes to
enhance the Tk Text widget to provide this functionality.

~ Rationale

While writing applications using the Tk Text widget, I find myself
wanting to extract all of the text that has a given text tag.
Although this is possible with the existing functionality of the Tk
Text widget, it can become extremely inefficient, depending on your
application.

Consider the example where we load a text widget with say, the
contents of a scene from a play, and we tag all of the spoken passages
with the name of the character that utters them.  How can we provide
an efficient way to allow an end user to print out all the spoken text
for a single given character?

My initial impulse was to design something like this (please excuse
the use of Perl-Tk syntax, that's what I'm most comfortable with):

|   $txt->tagGet($tag);

The problem with this design is what should this return? A string?  An
list? If a list, should it be a list of each tagged character?  A list
of strings containing all contiguous characters? In addition, Steve
Lidie points out that the corresponding tagDelete() command would
also have to be modified to mimic this change as well. This line of
thought got icky pretty fast.

My second impulse was to try to induce this functionality with as much
existing stuff as possible.  The ''tagRanges'' command returns a list
of index pairs for all contiguous characters with a given tag.  The
thought here was to combine that command with the ''get'' command to
get all the text with a given tag:

|   $txt->get( $txt->tagRanges($tag) );

This design seems to fit in well with much of the existing
functionality of the text widget.  The main problem here is that the
existing ''get'' command only allows for either one or two arguments,
and returns a single string.  For this design to be implemented, the
get interface would need to be enhanced.  This is the design I chose
to implement as a reference (prototype) implementation.  I believe
that the functionality should be provided in the Tk Text widget, and
believe that this prototype solution could be turned into a production
solution.  However those decisions I happily leave up to the Tk
developers who are more knowledgeable about the Tk Text implementation
than myself.

An additional concern here involves the corresponding text delete
command. Should the delete command also be modified in a similar way so
that it has this same functionality too? It seems like it should.

~ Specification

This specification will only describe how the reference implementation
was produced.  If it is decided that an alternate design is needed for
the final production solution, this specification can be scrapped.

The goal of this design is to enhance the Tk Text ''get'' command from
accepting only one or two arguments, to accepting any number of 1
(+NULL) or 2 arguments sets.  The Tcl-Tk manual page description would
change from this:

|   $t get i1 ?i2?

to something like this:

|   $t get i1 ?i2? ?(i3 ?i4? ...)?

By providing this enhancement, we give the programmer with the ability
to efficiently ''get'' all of the text that is tagged with a given
tag.  The programmer would do this by using a compound statement
utilizing the existing ''tag ranges'' command along with the enhanced
''get'' command, as follows (the examples are using the Perl-Tk
syntax):

|   $txt->get( $txt->tagRanges($tag) );

In addition, the enhancement will preserve compatibility with all of
the existing Tk ''get'' commands currently in use.

Currently, the ''get'' command simply returns a single string
containing all of the characters specified by the first and
(optionally) the second argument(s).  The enhanced ''get'' command
will preserve this existing functionality:

|   my $chr = $text->get('1.0');

 > This command functions exactly the same as the original ''get''
   command.  It will return a string containing the first character
   from the first line.

|   my $str = $text->get('1.0', '1.0 lineend');

 > This command functions exactly the same as the original ''get''
   command.  It will return a string containing all of the characters
   on the first line.

However, if the programmer provides more than one or two argument(s),
the enhanced ''get'' command will return a list of strings, just as if
the original ''get'' command was called multiple times and the results
were loaded into a programmer-defined list:

|   my @lines = $text->get('1.0', '1.0 lineend', '2.0');

 > This command returns a list whose first element (''$lines[[0]]'')
   is a string containing all of the characters from the first line,
   and the second element (''$lines[[1]]'') is a string containing
   just the first character of the second line.

|   my @lines = $text->get('1.0', '', '2.0', '2.0 lineend');

 > This command returns a list whose first element (''$lines[[0]]'')
   is a string containing just the first character from the first
   line, and the second element (''$lines[[1]]'') is a string
   containing all of the characters on the second line.

|   my @lines = $text->get('1.0', '1.0 lineend', '2.0', '2.0 lineend');

 > This command returns a list whose first element (''$lines[[0]]'')
   is a string containing the all of the characters from the first
   line, and the second element (''$lines[[1]]'') is a string
   containing all of the characters from the second line.

All of this paves the way for the programmer to use the compound command:

|   my @lines = $txt->get( $txt->tagRanges($tag) );

 > This command returns a list whose elements are strings of all the
   contiguous characters tagged with a given tag.

~ Example

The following Perl-Tk code illustrates how the enhanced ''get''
command could be used with the existing ''tag ranges'' command to
efficiently extract all of the text that is tagged with a given tag.

|   #! /usr/local/bin/perl -w
|   
|   require 5.005;
|   
|   use strict;
|   use English;
|   
|   use Tk;
|   
|   # Create main window with button and text widget in it...
|   my $top = MainWindow->new;
|   my $btn = $top->Button(-text=>'print odd lines')->pack;
|   my $txt = $top->Scrolled('Text', -relief=>'sunken', -borderwidth=>'2',
|	-setgrid=>'true', -height=>'30', -scrollbars=>'e');
|   $txt->pack(-expand=>'yes', -fill=>'both');
|   $btn->configure(-command=>sub{&GetText($txt)} );
|   
|   # Populate text widget with lines tagged odd and even...
|   my $lno;
|   my $oddeven;
|   foreach $lno (1..20) {
|	if($lno % 2) { $oddeven = "odd" } else { $oddeven = "even" };
|	$lno = "Line $lno ($oddeven)\n";
|	$txt->insert ('end', $lno, $oddeven);
|   }

|   
|   # Do the main processing loop...
|   MainLoop();
|   
|   sub GetText {
|	my $txtobj = shift;
|
|	$txtobj->tag('configure', 'odd', -background=>'lightblue');
|	$txtobj->tag('configure', 'even', -background=>'lightgreen');
|
|	# This is the goal of all the work...
|	my @lines = $txtobj->get($txtobj->tagRanges('odd'));
|
|	print STDERR join("", @lines);
|   }

~ Reference Implementation

The patch for this reference implementation has been posted to the ptk
mailing list. An archived version is available at:

http://faqchest.dynhost.com/prgm/ptk-l/ptk-01/ptk-0112/ptk-011201/ptk01122716_24437.html

I have written and run a single benchmark test (in Perl-Tk) to compare
this reference implementation against a traditional method of
extracting all text with a specific tag.  The results of this specific
benchmark test (tagging odd lines ''odd'' and even lines ''even'' in a
text window with 2000 entries), run on my computer are as follows:

|Reference Implementation   0.105 CPU Seconds (average over 10 runs)
|Traditional Method         0.443 CPU Seconds (average over 10 runs)

I believe that both the CPU the efficiency, and the coding efficiency
that this reference implementation provides, merit the change to the
Tk Widget.  In addition to the ''get'' enhancement, the symmetrical changes would be make to the ''delete'' subcommand.

''The patch has received little testing so far, so any testing is
encouraged.''

~ Notes on Equivalent Behaviour in Tcl/Tk

Tcl has less of a need for this than Perl because it has a striding
[[foreach]] allowing the list of indices returned by the [[$t tag
ranges]] subcommand to be traversed in a straight-forward fashion, but
this sort of functionality is still useful.  The motivating examples
above become (in order):

|   set lines [$t get 1.0 "1.0 lineend" 2.0]
|   set lines [$t get 1.0 {} 2.0 "2.0 lineend"]
|   set lines [$t get 1.0 "1.0 lineend" 2.0 "2.0 lineend"]
|   set lines [eval $t get [$t tag ranges]]

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|

|

|

|
|
|

|

|

|

|

|

|

|

|

|
|
|

|

|

|

|

|

|

|

|

|

|

|

|
|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<
|
>
|

|

|

|
|

|
|

|

|
|

|

|
|

|

|
|
|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179

180
181
182
183
184
185
186
187
188
189
190
191
192
193
194

195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236

# TIP 93: Get/Delete Enhancement for the Tk Text Widget

	Author:         Craig Votava <[email protected]>
	Author:         Donal K. Fellows <[email protected]>
	Author:         Jeff Hobbs <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        28-Dec-2001
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

The Tk Text widget provides text tags, which are a very powerful
thing.  However, the current implementation does not provide an
efficient way for a Tk Text widget programmer to extract \(get\) all of
the actual text that has a given text tag.  This TIP proposes to
enhance the Tk Text widget to provide this functionality.

# Rationale

While writing applications using the Tk Text widget, I find myself
wanting to extract all of the text that has a given text tag.
Although this is possible with the existing functionality of the Tk
Text widget, it can become extremely inefficient, depending on your
application.

Consider the example where we load a text widget with say, the
contents of a scene from a play, and we tag all of the spoken passages
with the name of the character that utters them.  How can we provide
an efficient way to allow an end user to print out all the spoken text
for a single given character?

My initial impulse was to design something like this \(please excuse
the use of Perl-Tk syntax, that's what I'm most comfortable with\):

	   $txt->tagGet($tag);

The problem with this design is what should this return? A string?  An
list? If a list, should it be a list of each tagged character?  A list
of strings containing all contiguous characters? In addition, Steve
Lidie points out that the corresponding tagDelete\(\) command would
also have to be modified to mimic this change as well. This line of
thought got icky pretty fast.

My second impulse was to try to induce this functionality with as much
existing stuff as possible.  The _tagRanges_ command returns a list
of index pairs for all contiguous characters with a given tag.  The
thought here was to combine that command with the _get_ command to
get all the text with a given tag:

	   $txt->get( $txt->tagRanges($tag) );

This design seems to fit in well with much of the existing
functionality of the text widget.  The main problem here is that the
existing _get_ command only allows for either one or two arguments,
and returns a single string.  For this design to be implemented, the
get interface would need to be enhanced.  This is the design I chose
to implement as a reference \(prototype\) implementation.  I believe
that the functionality should be provided in the Tk Text widget, and
believe that this prototype solution could be turned into a production
solution.  However those decisions I happily leave up to the Tk
developers who are more knowledgeable about the Tk Text implementation
than myself.

An additional concern here involves the corresponding text delete
command. Should the delete command also be modified in a similar way so
that it has this same functionality too? It seems like it should.

# Specification

This specification will only describe how the reference implementation
was produced.  If it is decided that an alternate design is needed for
the final production solution, this specification can be scrapped.

The goal of this design is to enhance the Tk Text _get_ command from
accepting only one or two arguments, to accepting any number of 1
\(\+NULL\) or 2 arguments sets.  The Tcl-Tk manual page description would
change from this:

	   $t get i1 ?i2?

to something like this:

	   $t get i1 ?i2? ?(i3 ?i4? ...)?

By providing this enhancement, we give the programmer with the ability
to efficiently _get_ all of the text that is tagged with a given
tag.  The programmer would do this by using a compound statement
utilizing the existing _tag ranges_ command along with the enhanced
_get_ command, as follows \(the examples are using the Perl-Tk
syntax\):

	   $txt->get( $txt->tagRanges($tag) );

In addition, the enhancement will preserve compatibility with all of
the existing Tk _get_ commands currently in use.

Currently, the _get_ command simply returns a single string
containing all of the characters specified by the first and
\(optionally\) the second argument\(s\).  The enhanced _get_ command
will preserve this existing functionality:

	   my $chr = $text->get('1.0');

 > This command functions exactly the same as the original _get_
   command.  It will return a string containing the first character
   from the first line.

	   my $str = $text->get('1.0', '1.0 lineend');

 > This command functions exactly the same as the original _get_
   command.  It will return a string containing all of the characters
   on the first line.

However, if the programmer provides more than one or two argument\(s\),
the enhanced _get_ command will return a list of strings, just as if
the original _get_ command was called multiple times and the results
were loaded into a programmer-defined list:

	   my @lines = $text->get('1.0', '1.0 lineend', '2.0');

 > This command returns a list whose first element \(_$lines[[0]](0.md)_\)
   is a string containing all of the characters from the first line,
   and the second element \(_$lines[[1]](1.md)_\) is a string containing
   just the first character of the second line.

	   my @lines = $text->get('1.0', '', '2.0', '2.0 lineend');

 > This command returns a list whose first element \(_$lines[[0]](0.md)_\)
   is a string containing just the first character from the first
   line, and the second element \(_$lines[[1]](1.md)_\) is a string
   containing all of the characters on the second line.

	   my @lines = $text->get('1.0', '1.0 lineend', '2.0', '2.0 lineend');

 > This command returns a list whose first element \(_$lines[[0]](0.md)_\)
   is a string containing the all of the characters from the first
   line, and the second element \(_$lines[[1]](1.md)_\) is a string
   containing all of the characters from the second line.

All of this paves the way for the programmer to use the compound command:

	   my @lines = $txt->get( $txt->tagRanges($tag) );

 > This command returns a list whose elements are strings of all the
   contiguous characters tagged with a given tag.

# Example

The following Perl-Tk code illustrates how the enhanced _get_
command could be used with the existing _tag ranges_ command to
efficiently extract all of the text that is tagged with a given tag.

	   #! /usr/local/bin/perl -w

	   require 5.005;

	   use strict;
	   use English;

	   use Tk;

	   # Create main window with button and text widget in it...
	   my $top = MainWindow->new;
	   my $btn = $top->Button(-text=>'print odd lines')->pack;
	   my $txt = $top->Scrolled('Text', -relief=>'sunken', -borderwidth=>'2',
		-setgrid=>'true', -height=>'30', -scrollbars=>'e');
	   $txt->pack(-expand=>'yes', -fill=>'both');
	   $btn->configure(-command=>sub{&GetText($txt)} );

	   # Populate text widget with lines tagged odd and even...
	   my $lno;
	   my $oddeven;
	   foreach $lno (1..20) {
		if($lno % 2) { $oddeven = "odd" } else { $oddeven = "even" };
		$lno = "Line $lno ($oddeven)\n";
		$txt->insert ('end', $lno, $oddeven);

	   }

	   # Do the main processing loop...
	   MainLoop();

	   sub GetText {
		my $txtobj = shift;

		$txtobj->tag('configure', 'odd', -background=>'lightblue');
		$txtobj->tag('configure', 'even', -background=>'lightgreen');

		# This is the goal of all the work...
		my @lines = $txtobj->get($txtobj->tagRanges('odd'));

		print STDERR join("", @lines);

	   }

# Reference Implementation

The patch for this reference implementation has been posted to the ptk
mailing list. An archived version is available at:

<http://faqchest.dynhost.com/prgm/ptk-l/ptk-01/ptk-0112/ptk-011201/ptk01122716\_24437.html>

I have written and run a single benchmark test \(in Perl-Tk\) to compare
this reference implementation against a traditional method of
extracting all text with a specific tag.  The results of this specific
benchmark test \(tagging odd lines _odd_ and even lines _even_ in a
text window with 2000 entries\), run on my computer are as follows:

	Reference Implementation   0.105 CPU Seconds (average over 10 runs)
	Traditional Method         0.443 CPU Seconds (average over 10 runs)

I believe that both the CPU the efficiency, and the coding efficiency
that this reference implementation provides, merit the change to the
Tk Widget.  In addition to the _get_ enhancement, the symmetrical changes would be make to the _delete_ subcommand.

_The patch has received little testing so far, so any testing is
encouraged._

# Notes on Equivalent Behaviour in Tcl/Tk

Tcl has less of a need for this than Perl because it has a striding
[foreach] allowing the list of indices returned by the [$t tag
ranges] subcommand to be traversed in a straight-forward fashion, but
this sort of functionality is still useful.  The motivating examples
above become \(in order\):

	   set lines [$t get 1.0 "1.0 lineend" 2.0]
	   set lines [$t get 1.0 {} 2.0 "2.0 lineend"]
	   set lines [$t get 1.0 "1.0 lineend" 2.0 "2.0 lineend"]
	   set lines [eval $t get [$t tag ranges]]

# Copyright

This document has been placed in the public domain.

Name change from tip/94.tip to tip/94.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51

TIP:		94
Title:		Add Listbox -activestyle Option
Version:	$Revision: 1.5 $
Author:		Jeff Hobbs <[email protected]>
State:		Final
Type:		Project
Created:	29-May-2002
Tcl-Version:	8.4
Vote:		Done
Post-History:	

~ Abstract

This TIP proposes to add a [[-activestyle]] option to the [[listbox]]
widget that would control what style the active element has when the
widget has focus (currently hard-coded to be underlined).

~ Rationale

Tk has always had an underline on the active item in listboxes, which
is shown when the listbox has focus.  However this in incompatible
with the style of listboxes on Windows, especially as used in drop-down
boxes.  They instead have a thin dotted line to indicate the active
item.  In order to improve native look and feel, we would allow the
user to request the style which indicates the active item.

~ Specification

|    $listbox configure -activestyle none|underline|dotbox

The default would be underline, which stays consistent with the
current behavior.  ''dotbox'' is the Windows style, which is
essentially the dotted focus ring that any item with focus receives.
While Windows does have a special API (''DrawFocusRect'') to draw
this, it should be possible with the features of the dash patch to
emulate on Unix.  It may not be possible to draw a dotbox easily on
MacOS, in which case the option will be allowed, but nothing would be
drawn (rather than dropping back to underline).

~ Reference Implementation

This implementation is simple and would only extend one check in
''DisplayListbox'' for whether the underline should be drawn.

File: ''tcl/generix/tkListbox.c''

Function: ''DisplayListbox''

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|

|

|

|

|

|

|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51

# TIP 94: Add Listbox -activestyle Option

	Author:		Jeff Hobbs <[email protected]>
	State:		Final
	Type:		Project
	Created:	29-May-2002
	Tcl-Version:	8.4
	Vote:		Done
	Post-History:	
-----

# Abstract

This TIP proposes to add a [-activestyle] option to the [listbox]
widget that would control what style the active element has when the
widget has focus \(currently hard-coded to be underlined\).

# Rationale

Tk has always had an underline on the active item in listboxes, which
is shown when the listbox has focus.  However this in incompatible
with the style of listboxes on Windows, especially as used in drop-down
boxes.  They instead have a thin dotted line to indicate the active
item.  In order to improve native look and feel, we would allow the
user to request the style which indicates the active item.

# Specification

	    $listbox configure -activestyle none|underline|dotbox

The default would be underline, which stays consistent with the
current behavior.  _dotbox_ is the Windows style, which is
essentially the dotted focus ring that any item with focus receives.
While Windows does have a special API \(_DrawFocusRect_\) to draw
this, it should be possible with the features of the dash patch to
emulate on Unix.  It may not be possible to draw a dotbox easily on
MacOS, in which case the option will be allowed, but nothing would be
drawn \(rather than dropping back to underline\).

# Reference Implementation

This implementation is simple and would only extend one check in
_DisplayListbox_ for whether the underline should be drawn.

File: _tcl/generix/tkListbox.c_

Function: _DisplayListbox_

# Copyright

This document has been placed in the public domain.

Name change from tip/95.tip to tip/95.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115

TIP:            95
Title:          Add [wm attributes] Command
Version:        $Revision: 1.5 $
Author:         Jeff Hobbs <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        29-May-2002
Post-History:   
Tcl-Version:    8.4

~ Abstract

This TIP proposes adding a [[wm attributes]] command in order to
control platform-specific aspects of a toplevel.  In addition, it
proposes making [[wm]] a ''Tcl_Obj''-based command and centralizing
the common functionality.

~ Rationale

While Tk has been proven useful over time as a cross-platform toolkit,
it has some serious drawbacks in acceptance due to small, but
important, lacking functionality in the handling of toplevel windows
on certain platforms.  Having a toplevel stay on top on Windows is a
prime example of a commonly requested feature for which there is no
core support.  Mac/Tk has long had a special unknown command to
support special styles needed for proper "look and feel" there.  I
hereby propose a [[wm attributes]] command (like the [[file
attributes]] command) to providing platform-specific functionality for
toplevel windows.

~ Specification

|   wm attributes $toplevel ...
|   [[WINDOWS]]
|        ?-disabled ?bool??
|        ?-toolwindow ?bool??
|        ?-topmost ?bool??
|        ?-minimizebox ?bool??
|        ?-maximizebox ?bool??
|        ?-sysmenu ?bool??
|   [[MAC]]
|        ?-style ?alert|moveablealert|modal|moveablemodal|
|                 floating|document ...??
|   [[UNIX]]
|        <empty at this time>

Because Tk started off on Unix, most potential attributes are already
in the wm command, whether they really make sense across platforms or
not (some equivalent has been emulated in most cases).  If someone
feels that there are some X window attributes that Tk does not
support, this would be the place to put them.

On Windows, most of the attribute settings can be combined (they are
OR-ed bits of special style fields on a toplevel), which is why they
are set or retrieved as booleans.  The names reflect their Win32 API
bits.  For Windows and the Mac, the naming of attributes and/or styles
mirrors the native API as closely as possible, as we are exposing
native platform functionality in this command.  More specifics about
Windows styles can be seen here:

 > http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnwui/html/msdn_styles32.asp

For Macs, styles are mutually exclusive, so you set one of a list of
available styles.  MacOS has nine standard window types and eight
standard floating window types.  More information can be seen here:

 > http://developer.apple.com/techpubs/quicktime/qtdevdocs/INMAC/MACWIN/imWindowMgrRef.3.htm

~ Reference Implementation

Mac/Tk has a reference implementation already that would just adapt
the existing ''unsupported1'' code to ''wm attributes''.  There are
two variant patches for the Windows work in Tk patch 553926 at SF.

File: ''tk/mac/tkMacWm.c''

File: ''tk/win/tkWinWm.c''

File: ''tk/unix/tkUnixWm.c''

Function: ''Tk_WmCmd''

~ Comments

Several names have been used for a command with similar functionality.
Mac/Tk uses the ''style'' command, as does Tk Patch 553926.  This is
only for platform-specific configuration of toplevel windows, and it
not necessarily limited to style.  I considered ''wm configure'', but
I chose ''wm attributes'' because that worked just as well and had the
equivalent of ''file attributes'' to support the naming.

Windows toplevels could have more special styles like
''-transparent'', Windows scrollbars on the toplevel and a few other
window styles that the Win32 API supports.  Only the styles that have
had user requests are supported at this time.  We may want to add
''-caption'' and ''-dialogmodal'' support if these seem useful.

It was recommended that commonality be reached where possible, but
this tip addresses most specifically what isn't common across the
minute aspects of toplevel window handing on different platforms.  For
examples, for Windows to allow TOPMOST, it keeps a special list of the
topmost windows.  This manager-level support is necessary to avoid
contention amongst topmost windows.  Macintosh has many special dialog
and window styles to represent both changing UI design over time, as
well as the latest in UI design that can be reached standard for Mac
toplevels but not Unix or Windows ones.

It may be that certain functionality will become cross-platform as
native APIs develop, but this is meant to allow access to key native
look and feel features that Tk lacks for serious developers now.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|
|

|

|
|
|
|
|
|
|
|
|
|
|
|
|

|

|
|

|

|

|

|

|

|

|

|

|

|

|
|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115

# TIP 95: Add [wm attributes] Command

	Author:         Jeff Hobbs <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        29-May-2002
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

This TIP proposes adding a [wm attributes] command in order to
control platform-specific aspects of a toplevel.  In addition, it
proposes making [wm] a _Tcl\_Obj_-based command and centralizing
the common functionality.

# Rationale

While Tk has been proven useful over time as a cross-platform toolkit,
it has some serious drawbacks in acceptance due to small, but
important, lacking functionality in the handling of toplevel windows
on certain platforms.  Having a toplevel stay on top on Windows is a
prime example of a commonly requested feature for which there is no
core support.  Mac/Tk has long had a special unknown command to
support special styles needed for proper "look and feel" there.  I
hereby propose a [wm attributes] command \(like the [file
attributes] command\) to providing platform-specific functionality for
toplevel windows.

# Specification

	   wm attributes $toplevel ...
	   [[WINDOWS]]
	        ?-disabled ?bool??
	        ?-toolwindow ?bool??
	        ?-topmost ?bool??
	        ?-minimizebox ?bool??
	        ?-maximizebox ?bool??
	        ?-sysmenu ?bool??
	   [[MAC]]
	        ?-style ?alert|moveablealert|modal|moveablemodal|
	                 floating|document ...??
	   [[UNIX]]
	        <empty at this time>

Because Tk started off on Unix, most potential attributes are already
in the wm command, whether they really make sense across platforms or
not \(some equivalent has been emulated in most cases\).  If someone
feels that there are some X window attributes that Tk does not
support, this would be the place to put them.

On Windows, most of the attribute settings can be combined \(they are
OR-ed bits of special style fields on a toplevel\), which is why they
are set or retrieved as booleans.  The names reflect their Win32 API
bits.  For Windows and the Mac, the naming of attributes and/or styles
mirrors the native API as closely as possible, as we are exposing
native platform functionality in this command.  More specifics about
Windows styles can be seen here:

 > <http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnwui/html/msdn\_styles32.asp>

For Macs, styles are mutually exclusive, so you set one of a list of
available styles.  MacOS has nine standard window types and eight
standard floating window types.  More information can be seen here:

 > <http://developer.apple.com/techpubs/quicktime/qtdevdocs/INMAC/MACWIN/imWindowMgrRef.3.htm>

# Reference Implementation

Mac/Tk has a reference implementation already that would just adapt
the existing _unsupported1_ code to _wm attributes_.  There are
two variant patches for the Windows work in Tk patch 553926 at SF.

File: _tk/mac/tkMacWm.c_

File: _tk/win/tkWinWm.c_

File: _tk/unix/tkUnixWm.c_

Function: _Tk\_WmCmd_

# Comments

Several names have been used for a command with similar functionality.
Mac/Tk uses the _style_ command, as does Tk Patch 553926.  This is
only for platform-specific configuration of toplevel windows, and it
not necessarily limited to style.  I considered _wm configure_, but
I chose _wm attributes_ because that worked just as well and had the
equivalent of _file attributes_ to support the naming.

Windows toplevels could have more special styles like
_-transparent_, Windows scrollbars on the toplevel and a few other
window styles that the Win32 API supports.  Only the styles that have
had user requests are supported at this time.  We may want to add
_-caption_ and _-dialogmodal_ support if these seem useful.

It was recommended that commonality be reached where possible, but
this tip addresses most specifically what isn't common across the
minute aspects of toplevel window handing on different platforms.  For
examples, for Windows to allow TOPMOST, it keeps a special list of the
topmost windows.  This manager-level support is necessary to avoid
contention amongst topmost windows.  Macintosh has many special dialog
and window styles to represent both changing UI design over time, as
well as the latest in UI design that can be reached standard for Mac
toplevels but not Unix or Windows ones.

It may be that certain functionality will become cross-platform as
native APIs develop, but this is meant to allow access to key native
look and feel features that Tk lacks for serious developers now.

# Copyright

This document has been placed in the public domain.

Name change from tip/96.tip to tip/96.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78

TIP:		96
Title:		Add [tk caret] Command and Tk_SetCaretPos API
Version:	$Revision: 1.4 $
Author:		Jeff Hobbs <[email protected]>
State:		Final
Type:		Project
Created:	29-May-2002
Tcl-Version:	8.4
Vote:		Done
Post-History:	

~ Abstract

This TIP proposes to add a [[tk caret]] command and [[Tk_SetCaretPos]]
C API to manage ''carets'' in Tk.  ''caret'' is the term for where
text of graphics will be inserted.  It is necessary for correct
accessibility functionality (to know where to shift focus), and for
location the IME or XIM input box to handle complex character input
(e.g. Asian character sets).

~ Rationale

Tk has up until now not managed the caret within its windows.  This
has led to it being not Windows Accessibility certifiable.  On
Windows, this also cause the IME window to show in the top-left corner
of the window (somewhat OK for entries, bad for text widgets).  On X,
this meant that Tk had to use the root-window style XIM input, which
is a poor second to over-the-spot XIM input.  Managing the caret
corrects these problems.

Exposing the functionality at the Tcl level allows extension writers
to use the functionality without having to make Tk version API checks.
A simple

|   catch {tk caret $w -x $x -y $y}

will suffice to work across versions.

~ Specification

|   tk caret window ?-x xPos? ?-y yPos? ?-height height?
|   void Tk_SetCaretPos (Tk_Window tkwin, int x, int y, int height)

''-height'' specifies the height of the input line and is important
because Windows and X interpret the x,y coordinates differently
(top-left and bottom-left respectively), so it must be adjusted by
height for X.  If no height is specified, the height of the window
passed in will be used.

I chose to use the ''-option value'' format because it allows for
future extensibility.  There are APIs to control the font and other
aspects of the IME/XIM input window that appears, but management of
these is not covered in this tip.

~ Reference Implementation

The Tk_SetCaretPos implementation is currently in the core.  It needs
to be modified to move the caret information to be per display,
instead of per process.

File: ''tk/mac/tkMacXStubs.c''

File: ''tk/win/tkWinX.c''

File: ''tk/unix/tkUnixKey.c''

Function: ''Tk_SetCaretPos''

~ Comments

The current implementation at the C level was implemented with the
assistance of Keiichi Takahashi (BitWalk), Koiichi Yamamoto, Moo Kim
(NCR), and Mike Fabian (SuSE).  It has been tested on Windows
98/2000/XP and SuSE 7.3 using kinput/canna2.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|
|

|

|

|

|

|

|

|
|

|

|

|

|

|

|

|

|

|

|

|
|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78

# TIP 96: Add [tk caret] Command and Tk_SetCaretPos API

	Author:		Jeff Hobbs <[email protected]>
	State:		Final
	Type:		Project
	Created:	29-May-2002
	Tcl-Version:	8.4
	Vote:		Done
	Post-History:	
-----

# Abstract

This TIP proposes to add a [tk caret] command and [Tk_SetCaretPos]
C API to manage _carets_ in Tk.  _caret_ is the term for where
text of graphics will be inserted.  It is necessary for correct
accessibility functionality \(to know where to shift focus\), and for
location the IME or XIM input box to handle complex character input
\(e.g. Asian character sets\).

# Rationale

Tk has up until now not managed the caret within its windows.  This
has led to it being not Windows Accessibility certifiable.  On
Windows, this also cause the IME window to show in the top-left corner
of the window \(somewhat OK for entries, bad for text widgets\).  On X,
this meant that Tk had to use the root-window style XIM input, which
is a poor second to over-the-spot XIM input.  Managing the caret
corrects these problems.

Exposing the functionality at the Tcl level allows extension writers
to use the functionality without having to make Tk version API checks.
A simple

	   catch {tk caret $w -x $x -y $y}

will suffice to work across versions.

# Specification

	   tk caret window ?-x xPos? ?-y yPos? ?-height height?
	   void Tk_SetCaretPos (Tk_Window tkwin, int x, int y, int height)

_-height_ specifies the height of the input line and is important
because Windows and X interpret the x,y coordinates differently
\(top-left and bottom-left respectively\), so it must be adjusted by
height for X.  If no height is specified, the height of the window
passed in will be used.

I chose to use the _-option value_ format because it allows for
future extensibility.  There are APIs to control the font and other
aspects of the IME/XIM input window that appears, but management of
these is not covered in this tip.

# Reference Implementation

The Tk\_SetCaretPos implementation is currently in the core.  It needs
to be modified to move the caret information to be per display,
instead of per process.

File: _tk/mac/tkMacXStubs.c_

File: _tk/win/tkWinX.c_

File: _tk/unix/tkUnixKey.c_

Function: _Tk\_SetCaretPos_

# Comments

The current implementation at the C level was implemented with the
assistance of Keiichi Takahashi \(BitWalk\), Koiichi Yamamoto, Moo Kim
\(NCR\), and Mike Fabian \(SuSE\).  It has been tested on Windows
98/2000/XP and SuSE 7.3 using kinput/canna2.

# Copyright

This document has been placed in the public domain.

Name change from tip/97.tip to tip/97.md.

1
2
3
4
5
6
7
8
9
10
11
12

13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93

TIP:		97
Title:		Moving Vertices of Canvas Items
Version:	$Revision: 1.10 $
Author:		Agnar Renolen <[email protected]>
Author:		Donal K. Fellows <[email protected]>
State:		Final
Type:		Project
Tcl-Version:	8.6
Vote:		Done
Created:	07-Jun-2002
Post-History:	
Keywords:	Tk

~ Abstract

This TIP proposes a canvas subcommand (or possibly two) that allows for
replacing characters in text objects and to move individual vertices of line
and polygon items.

~ Rationale

Interactive graphics programs often allow users to modify shapes of objects by
selecting and dragging the vertices. Moving one vertex of a canvas item in the
current version of Tk, (at least as far as I can find out from the
documentation), can only be done by first removing the coordinate by
'''dchars''' and then insert the new one by '''insert''', or for geometric
items like lines and polygons using the '''coords''' command to obtain and
reset the coordinates, after having modified the coordinate list by
'''lreplace'''.

The most important issue here, I think, is performance. I believe that the
current way of moving a vertex can be slow in some scenarios.

The '''rchars''' canvas subcommand is proposed merely to conform with the
'''dchars''' and '''insert''' commands, which both operate on lines, polygons
and text items, hence '''rchars''' should do that as well.

~ Specification

Two canvas widget subcommands are proposed: '''imove''' and '''rchars'''. The
following subcommand is proposed to move a vertex of any canvas item:

 > ''canvas'' '''imove''' ''tagOrID index x y''

This subcommand will move the ''index''th coordinate of the items identified by
''tagOrID'' to the new position given by ''x'' and ''y''. The ''index'' value
will be processed according to normal canvas index rules (see the INDICES
section of the '''canvas''' manual). The subcommand will only work for line
and polygon items (or any third party items that set the TK_MOVABLE_POINTS
flag).

The following command provides a similar functionality, but conforms to the
model of the current '''insert''' and '''dchars''' subcommands.

 > ''canvas'' '''rchars''' ''tagOrID first last string''

This command will:

 for text items: replace the characters in the range ''first'' and ''last''
    (inclusive) with the characters in ''string''.

 for line and polygon items: replace the coordinates in the range ''first''
    and ''last'' (inclusive) with the coordinate list specified in ''string''
    (subject to the requirement that the coordinate list is an even number of
    floating point numbers).

In both cases, ''first'' and ''last'' will be processed according to the rules
in the INDICES section of the '''canvas''' manual page.

At the C level, the only change is the addition of a new flag,
'''TK_MOVABLE_POINTS'''. If this flag is set in the ''alwaysRedraw'' field of
the item type structure, it implies that the item supplies non-NULL
''dcharsProc'', ''indexProc'' and ''insertProc'' fields, and gives them
semantics equivalent to the line and polygon items (i.e. that the methods will
work with the coordinate list). Note that text items, despite having all the
required methods, do not set the flag because those methods work with
character indices.

~~ Notes

The '''imove''' subcommand is not strictly necessary as the '''rchars'''
subcommand can be used to obtain the same result. However, I believe that a
separate '''imove''' subcommand will be easier to understand for users than
the '''rchars''' subcommand, though the latter is still necessary as it allows
for more complex processing such as insertion or deletion of points.

~ Reference Implementation

See Patch 2157629[https://sourceforge.net/support/tracker.php?aid=2157629].

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
|
|
>

|

|

|

|
|
|
|

|

|
|
|

|

|

|

|
|
|
|
|
|

|

|

|
|

|
|
|
|

|
|

|

|
|
|

|

|

|
|

|

|

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93

# TIP 97: Moving Vertices of Canvas Items

	Author:		Agnar Renolen <[email protected]>
	Author:		Donal K. Fellows <[email protected]>
	State:		Final
	Type:		Project
	Tcl-Version:	8.6
	Vote:		Done
	Created:	07-Jun-2002
	Post-History:	
	Keywords:	Tk
-----

# Abstract

This TIP proposes a canvas subcommand \(or possibly two\) that allows for
replacing characters in text objects and to move individual vertices of line
and polygon items.

# Rationale

Interactive graphics programs often allow users to modify shapes of objects by
selecting and dragging the vertices. Moving one vertex of a canvas item in the
current version of Tk, \(at least as far as I can find out from the
documentation\), can only be done by first removing the coordinate by
**dchars** and then insert the new one by **insert**, or for geometric
items like lines and polygons using the **coords** command to obtain and
reset the coordinates, after having modified the coordinate list by
**lreplace**.

The most important issue here, I think, is performance. I believe that the
current way of moving a vertex can be slow in some scenarios.

The **rchars** canvas subcommand is proposed merely to conform with the
**dchars** and **insert** commands, which both operate on lines, polygons
and text items, hence **rchars** should do that as well.

# Specification

Two canvas widget subcommands are proposed: **imove** and **rchars**. The
following subcommand is proposed to move a vertex of any canvas item:

 > _canvas_ **imove** _tagOrID index x y_

This subcommand will move the _index_th coordinate of the items identified by
_tagOrID_ to the new position given by _x_ and _y_. The _index_ value
will be processed according to normal canvas index rules \(see the INDICES
section of the **canvas** manual\). The subcommand will only work for line
and polygon items \(or any third party items that set the TK\_MOVABLE\_POINTS
flag\).

The following command provides a similar functionality, but conforms to the
model of the current **insert** and **dchars** subcommands.

 > _canvas_ **rchars** _tagOrID first last string_

This command will:

 for text items: replace the characters in the range _first_ and _last_
    \(inclusive\) with the characters in _string_.

 for line and polygon items: replace the coordinates in the range _first_
    and _last_ \(inclusive\) with the coordinate list specified in _string_
    \(subject to the requirement that the coordinate list is an even number of
    floating point numbers\).

In both cases, _first_ and _last_ will be processed according to the rules
in the INDICES section of the **canvas** manual page.

At the C level, the only change is the addition of a new flag,
**TK\_MOVABLE\_POINTS**. If this flag is set in the _alwaysRedraw_ field of
the item type structure, it implies that the item supplies non-NULL
_dcharsProc_, _indexProc_ and _insertProc_ fields, and gives them
semantics equivalent to the line and polygon items \(i.e. that the methods will
work with the coordinate list\). Note that text items, despite having all the
required methods, do not set the flag because those methods work with
character indices.

## Notes

The **imove** subcommand is not strictly necessary as the **rchars**
subcommand can be used to obtain the same result. However, I believe that a
separate **imove** subcommand will be easier to understand for users than
the **rchars** subcommand, though the latter is still necessary as it allows
for more complex processing such as insertion or deletion of points.

# Reference Implementation

See Patch 2157629<https://sourceforge.net/support/tracker.php?aid=2157629> .

# Copyright

This document has been placed in the public domain.

Name change from tip/98.tip to tip/98.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89

TIP:		98
Title:		Adding Transparency Compositing Rules to Photo Images
Author:		Donal K. Fellows <[email protected]>
Created:	09-Jun-2001
Version:	$Revision: 1.5 $
Type:		Project
State:		Final
Vote:		Done
Tcl-Version:	8.4
Post-History:	

~ Abstract

This TIP adds compositing rules to Tk's photo images to give
programmers better control over what happens when two transparent
images are combined.  This TIP also allows for several frames of an
animated GIF file to be correctly displayed in an image even when the
transparent area is not constant.

~ Rationale

This is a TIP that is inspired by the tkchat application in Tcllib,
and in particular by the image used to represent the LOL smiley.  The
problem with this image is that its transparent area changes over
time, and this is caused by the fact that ''Tk_PhotoPutBlock()'' only
allows one way of compositing a block with an image; it behaves as if
the data being added was on a sheet of cel (the material used to make
hand-drawn animated cartoons) allowing for sophisticated layering
effects.  Unfortunately, for many applications (and animated GIF
images are definitely among these) this sophistication works against
us.  In a GIF image, transparency is treated not as extra information
that is added to each pixel's colour, but rather as a special colour;
a pixel cannot be, for example, red and transparent at the same time.
Support for this requires a different (and indeed simpler) kind of
compositing rule.  And of course, once you have such a facility in the
underlying C code, it should be exposed to scripts.

There are other kinds of compositing rule (for example, acting like
the added block is placed under the image, and many others) but this
TIP does not propose adding anything other than a way to chose between
the current behaviour and the behaviour required for supporting GIF
animation, in the belief that those two compositing rules are the ones
most useful to programmers, and that once the general facility is
there, the other rules will be relatively easy to add in the future.

~ Specification

This TIP adds a ''compositingRule'' argument to ''Tk_PhotoPutBlock''
(and ''Tk_PhotoPutZoomedBlock'') to allow selection between the
current behaviour (overlaying) and the other one I wish to support
(setting/overriding.)  The permitted values of this argument will be
''TK_PHOTO_COMPOSITE_OVERLAY'' (the currently implemented behaviour)
and ''TK_PHOTO_COMPOSITE_SET'' (the behaviour required to support GIF
file animation.)

At the Tcl level, when copying from one image to another (the other
photo image subcommands do not currently support transparency at all)
the ''photo get'' will take an extra option ''-compositingrule'' to
allow selection of the compositing rule.  The permitted values of this
option will be ''overlay'' and ''set'' by analogy with the values
described above.

~ Implementation Notes

Proposed implementation patch:
https://sourceforge.net/tracker/index.php?func=detail&aid=566765&group_id=12997&atid=312997

The proposed implementation of this TIP naturally includes
backward-compatibility functions that allow pre-compiled extensions to
continue to operate without recompilation (provided they use Stubs for
linking.)  Furthermore, extension authors can also define the symbol
''USE_COMPOSITELESS_PHOTO_PUT_BLOCK'' when compiling and have
source-level compatibility with the old functions.

The proposed implementation also creates ''TkSubtractRegion'' as a new
internal function; it is the analogue of ''XSubtractRegion'' as
''TkIntersectRegion'' is the analogue of ''XIntersectRegion''.  It
might be useful to other parts of the core that manipulate regions.

Both the PPM and GIF file readers use the ''set'' compositing rule,
PPM because the format does not support transparency (and ''set''
should at least theoretically be faster) and GIF because it is
required semantically.  Other image formats are not required to do
this, of course.  The ''$img put $data'' photo image subcommand uses
''set'' compositing because it does not support transparency.

~ Copyright

This document is placed in the public domain.

<
|
|
|
<
|
|
|
|
|
>

|

|

|

|
|
|
|

|

|
|

|

|
|
|
|
|
|
|

|
|
|

|

|

|

|
|
|

|
|
|

|
|
|

|
|

|

>

1
2
3

4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89

# TIP 98: Adding Transparency Compositing Rules to Photo Images
	Author:		Donal K. Fellows <[email protected]>
	Created:	09-Jun-2001

	Type:		Project
	State:		Final
	Vote:		Done
	Tcl-Version:	8.4
	Post-History:	
-----

# Abstract

This TIP adds compositing rules to Tk's photo images to give
programmers better control over what happens when two transparent
images are combined.  This TIP also allows for several frames of an
animated GIF file to be correctly displayed in an image even when the
transparent area is not constant.

# Rationale

This is a TIP that is inspired by the tkchat application in Tcllib,
and in particular by the image used to represent the LOL smiley.  The
problem with this image is that its transparent area changes over
time, and this is caused by the fact that _Tk\_PhotoPutBlock\(\)_ only
allows one way of compositing a block with an image; it behaves as if
the data being added was on a sheet of cel \(the material used to make
hand-drawn animated cartoons\) allowing for sophisticated layering
effects.  Unfortunately, for many applications \(and animated GIF
images are definitely among these\) this sophistication works against
us.  In a GIF image, transparency is treated not as extra information
that is added to each pixel's colour, but rather as a special colour;
a pixel cannot be, for example, red and transparent at the same time.
Support for this requires a different \(and indeed simpler\) kind of
compositing rule.  And of course, once you have such a facility in the
underlying C code, it should be exposed to scripts.

There are other kinds of compositing rule \(for example, acting like
the added block is placed under the image, and many others\) but this
TIP does not propose adding anything other than a way to chose between
the current behaviour and the behaviour required for supporting GIF
animation, in the belief that those two compositing rules are the ones
most useful to programmers, and that once the general facility is
there, the other rules will be relatively easy to add in the future.

# Specification

This TIP adds a _compositingRule_ argument to _Tk\_PhotoPutBlock_
\(and _Tk\_PhotoPutZoomedBlock_\) to allow selection between the
current behaviour \(overlaying\) and the other one I wish to support
\(setting/overriding.\)  The permitted values of this argument will be
_TK\_PHOTO\_COMPOSITE\_OVERLAY_ \(the currently implemented behaviour\)
and _TK\_PHOTO\_COMPOSITE\_SET_ \(the behaviour required to support GIF
file animation.\)

At the Tcl level, when copying from one image to another \(the other
photo image subcommands do not currently support transparency at all\)
the _photo get_ will take an extra option _-compositingrule_ to
allow selection of the compositing rule.  The permitted values of this
option will be _overlay_ and _set_ by analogy with the values
described above.

# Implementation Notes

Proposed implementation patch:
<https://sourceforge.net/tracker/index.php?func=detail&aid=566765&group\_id=12997&atid=312997>

The proposed implementation of this TIP naturally includes
backward-compatibility functions that allow pre-compiled extensions to
continue to operate without recompilation \(provided they use Stubs for
linking.\)  Furthermore, extension authors can also define the symbol
_USE\_COMPOSITELESS\_PHOTO\_PUT\_BLOCK_ when compiling and have
source-level compatibility with the old functions.

The proposed implementation also creates _TkSubtractRegion_ as a new
internal function; it is the analogue of _XSubtractRegion_ as
_TkIntersectRegion_ is the analogue of _XIntersectRegion_.  It
might be useful to other parts of the core that manipulate regions.

Both the PPM and GIF file readers use the _set_ compositing rule,
PPM because the format does not support transparency \(and _set_
should at least theoretically be faster\) and GIF because it is
required semantically.  Other image formats are not required to do
this, of course.  The _$img put $data_ photo image subcommand uses
_set_ compositing because it does not support transparency.

# Copyright

This document is placed in the public domain.

Name change from tip/99.tip to tip/99.md.

1
2
3
4
5
6
7
8
9
10

11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95

96
97

98
99
100
101
102
103

TIP:            99
Title:          Add 'file link' to Tcl
Version:        $Revision: 1.23 $
Author:         Vince Darley <[email protected]>
State:          Final
Type:           Project
Vote:           Done
Created:        11-Jun-2002
Post-History:   
Tcl-Version:    8.4

~ Abstract

Tcl can read links, but cannot create them.  This TIP proposes adding
a ''file link'' subcommand to allow cross-platform creation of links.

~ Proposal

Add a new subcommand with the following syntax:

|      file link ?-linktype? linkName ?target?

If only one argument is given, that argument is assumed to be
''linkName'', and this command returns the value of the link given by
''linkName'' (i.e. the name of the file it points to).  If
''linkName'' isn't a link or its value cannot be read (as, for
example, seems to be the case with hard links, which look just like
ordinary files), then an error is returned.

If 2 arguments are given, then these are assumed to be ''linkName''
and ''target''.  If ''linkName'' already exists, or if ''target''
doesn't exist, an error will be returned.  Otherwise, Tcl creates a
new link called ''linkName'' which points to the existing filesystem
object at ''target'', where the type of the link is platform-specific
(on Unix a symbolic link will be the default).  This is useful for the
case where the user wishes to create a link in a cross-platform way,
and doesn't care what type of link is created.

If the user wishes to make a link of a ''specific type only'', (and
signal an error if for some reason that is not possible), then the
optional ''linktype'' argument should be given.  Accepted values for
linktype are ''-symbolic'' and ''-hard''.

When creating links on filesystems that either do not support any
links, or do not support the specific type requested, an error message
will be returned (in particular Windows 95, 98 and ME do not support
any symbolic links at present, but Unix, MacOS and Windows NT/2000/XP
(on NTFS drives) do).

The TIP proposes implementing:

|           Unix,MacOSX      Win-NTFS           MacOS
|symbolic:      yes        directories-only      yes
|hard:       files-only     files-only           no

This also leaves the avenue open, in the future, for the addition of
other link types (e.g. Windows shortcuts) through additions to list of
acceptable ''linktype''s.  This TIP only proposes adding the above
options.

This means that a general ''[[file link $linkname $target]]'' should
always succeed on the above platforms (for both files and
directories), but uses of ''-hard'' or ''-symbolic'' could fail,
depending on the current platform, and the type of the path.

~ Rationale

There are many requests on comp.lang.tcl for this functionality (see
http://groups.google.com/groups?dq=&hl=en&lr=&ie=UTF8&oe=UTF8&threadm=4dd3bea3.0206100250.95eeb4e%40posting.google.com&rnum=1&prev=/&frame=on
for a recent thread), and if Tcl can read links (''file readlink'',
''file lstat''), it really ought to be able to write them.

Discussion has shown that both symbolic and hard links are desirable,
and that for cross-platform use a general-purpose ''file link'' which
creates ''something'' is useful.

Some users would prefer hard links to be the default, but on balance
most people commenting seemed to prefer symbolic links as default.
This has the added benefit that symbolic links will then be the
default on MacOS, Unix and Windows for everything, ''except'' files on
WinTcl (where hard-links are required).

~ Alternatives

There is no cross-platform alternative available.  TclX provides a
''link'' command for Unix only, and Unix platforms can also use ''exec
ln ?-s?'' command to achieve the same effect.

~ Reference Implementation

Tcl contains a ''testfilelink'' command in ''generic/tclTest.c'',
which is a partial implementation used by the test suite.  For a full
implementation of this TIP, including the ''-linktype'' switch, see:

''

http://sourceforge.net/tracker/index.php?func=detail&aid=562970&group_id=10894&atid=310894
''

which includes extensive docs and tests.

~ Copyright

This document has been placed in the public domain.

<
|
<
|
|
|
|
|
|
|
>

|

|

|

|

|
|
|

|

|
|

|
|
|

|
|
|
|

|

|

|
|
|

|
|

|
|
|

|

|
|
|
|

|
|

|
|

|

|
|

|

|

|

<
>
|
<
>

|

>

1

2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93

94
95

96
97
98
99
100
101
102
103

# TIP 99: Add 'file link' to Tcl

	Author:         Vince Darley <[email protected]>
	State:          Final
	Type:           Project
	Vote:           Done
	Created:        11-Jun-2002
	Post-History:   
	Tcl-Version:    8.4
-----

# Abstract

Tcl can read links, but cannot create them.  This TIP proposes adding
a _file link_ subcommand to allow cross-platform creation of links.

# Proposal

Add a new subcommand with the following syntax:

	      file link ?-linktype? linkName ?target?

If only one argument is given, that argument is assumed to be
_linkName_, and this command returns the value of the link given by
_linkName_ \(i.e. the name of the file it points to\).  If
_linkName_ isn't a link or its value cannot be read \(as, for
example, seems to be the case with hard links, which look just like
ordinary files\), then an error is returned.

If 2 arguments are given, then these are assumed to be _linkName_
and _target_.  If _linkName_ already exists, or if _target_
doesn't exist, an error will be returned.  Otherwise, Tcl creates a
new link called _linkName_ which points to the existing filesystem
object at _target_, where the type of the link is platform-specific
\(on Unix a symbolic link will be the default\).  This is useful for the
case where the user wishes to create a link in a cross-platform way,
and doesn't care what type of link is created.

If the user wishes to make a link of a _specific type only_, \(and
signal an error if for some reason that is not possible\), then the
optional _linktype_ argument should be given.  Accepted values for
linktype are _-symbolic_ and _-hard_.

When creating links on filesystems that either do not support any
links, or do not support the specific type requested, an error message
will be returned \(in particular Windows 95, 98 and ME do not support
any symbolic links at present, but Unix, MacOS and Windows NT/2000/XP
\(on NTFS drives\) do\).

The TIP proposes implementing:

	           Unix,MacOSX      Win-NTFS           MacOS
	symbolic:      yes        directories-only      yes
	hard:       files-only     files-only           no

This also leaves the avenue open, in the future, for the addition of
other link types \(e.g. Windows shortcuts\) through additions to list of
acceptable _linktype_s.  This TIP only proposes adding the above
options.

This means that a general _[file link $linkname $target]_ should
always succeed on the above platforms \(for both files and
directories\), but uses of _-hard_ or _-symbolic_ could fail,
depending on the current platform, and the type of the path.

# Rationale

There are many requests on comp.lang.tcl for this functionality \(see
<http://groups.google.com/groups?dq=&hl=en&lr=&ie=UTF8&oe=UTF8&threadm=4dd3bea3.0206100250.95eeb4e%40posting.google.com&rnum=1&prev=/&frame=on>
for a recent thread\), and if Tcl can read links \(_file readlink_,
_file lstat_\), it really ought to be able to write them.

Discussion has shown that both symbolic and hard links are desirable,
and that for cross-platform use a general-purpose _file link_ which
creates _something_ is useful.

Some users would prefer hard links to be the default, but on balance
most people commenting seemed to prefer symbolic links as default.
This has the added benefit that symbolic links will then be the
default on MacOS, Unix and Windows for everything, _except_ files on
WinTcl \(where hard-links are required\).

# Alternatives

There is no cross-platform alternative available.  TclX provides a
_link_ command for Unix only, and Unix platforms can also use _exec
ln ?-s?_ command to achieve the same effect.

# Reference Implementation

Tcl contains a _testfilelink_ command in _generic/tclTest.c_,
which is a partial implementation used by the test suite.  For a full
implementation of this TIP, including the _-linktype_ switch, see:

_
<http://sourceforge.net/tracker/index.php?func=detail&aid=562970&group\_id=10894&atid=310894>

_

which includes extensive docs and tests.

# Copyright

This document has been placed in the public domain.

~~1 2 3 4 5 6 7 8 9 10 11~~ 12 13 14 15 16 17 ~~18 19~~ 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 ~~35 36 37~~ 38 39 40 ~~41 42 43~~ 44 45 ~~46 47~~ 48 49 50 51 52 53 54 55 ~~56 57 58~~ 59 ~~60 61~~ 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 ~~80 81~~ 82 83 84 85 86 ~~87 88~~ 89 90 91 92 93 94 95 96 ~~97 98~~ 99 100 ~~101~~ 102 103 ~~104~~ 105 106 107 108 109 110 111 112 113 ~~114~~ 115 116 117 118 ~~119~~ 120 121 122 ~~123~~ 124 125 126 ~~127~~ 128 ~~129~~ 130 131 ~~132 133 134 135~~ 136 ~~137~~ 138 139 ~~140~~ 141 ~~142~~ 143 ~~144 145 146 147 148 149~~ 150 151 152 ~~153~~ 154 ~~155 156 157 158 159 160~~ 161 162 ~~163~~ 164 165 ~~166~~ 167 168 169 170 171 172 ~~173~~ 174 175 176 ~~177~~ 178 179 ~~180~~ 181 182 183 184 ~~185~~ 186 187 ~~188~~ 189 190 191 192 ~~193~~ 194 195 196 197 198 199 ~~200~~ 201 202 203 ~~204~~ 205 206 ~~207~~ 208 209 ~~210~~ 211 212 ~~213 214~~ 215 ~~216~~ 217 218 ~~219~~ 220 221 222 ~~223 224~~ 225 226 227 228 ~~229 230~~ 231 232 233 234 235 236 237 ~~238 239~~ 240 241 ~~242~~ 243 ~~244~~ 245 ~~246 247 248 249 250~~ 251 252 253 ~~254 255 256~~ 257 ~~258 259~~ 260 261 ~~262~~ 263 264 265 266 ~~267 268 269~~ 270 271 272 ~~273 274~~ 275 276 ~~277 278 279~~ 280 281 282 ~~283~~ 284 285 286 287 288 289 290 291 292 293 294 295 296 ~~297~~ 298 ~~299~~ 300 301 ~~302~~ 303 ~~304 305~~ 306 307 308 309 310 ~~311~~ 312 313 314 315 316 317 ~~318 319~~ 320 321 322 323 324 325 326 327 328 329 330 ~~331~~ 332 333 334 335 336 337 338	~~TIP: 173~~ T~~itle~~: Internationalisation and Refactoring of the 'clock' Command ~~Version: $Revision: 1.22 $~~ Author: Kevin Kenny <[email protected]> State: Final Type: Project Vote: Done Created: 11-Mar-2004 Post-History: Discussions-To: news:comp.lang.tcl Tcl-Version: 8.5 ~~~ Abstract~~ ~~The [~~[clock]~~] command provides Tcl's fundamental facilities for~~ computing with dates and times. It has served Tcl faithfully since 7.6, but the computing world has advanced significantly in the decade ~~that it has been in service. This TIP proposes a (nearly entirely compatible) reimplementation of [~~[clock]~~] that will allow for fewer~~ ambiguities on input, improved localisation, more portability, and less exposure of platform-dependent bugs. A significantly greater ~~fraction of [~~[clock]~~] shall be implemented in Tcl than it is today,~~ and the code shall be refactored to use the ensemble mechanism ~~introducted for Tcl 8.5 (see [~~112]~~).~~ ~~~ Rationale~~ There is an embarrassing number of open bugs and feature requests ~~against the [~~[clock]~~] command. As the maintainer of [~~[clock]~~], the~~ author of this TIP has also received a number of informal feature requests that are not logged at SourceForge. Unfortunately, many of the requested fixes and enhancements cannot be effectively addressed ~~with the current architecture of [~~[clock]~~].~~ ~~1. Several users have requested additional input formats to [[clock scan]], notably the full range of ISO8601 time formats (including formats based on week number and day-of-week); year and~~ day-of-year; Apache "web log" dates and times; numeric dates placing the month before the day; and localised names of months and days of the week. Unfortunately, these formats simply cannot be added in the current architecture of [[clock scan]]; in fact, there are several outstanding bugs in [[clock scan]] (for example, the parsing of numeric time zones east of Greenwich) that cannot be fixed without breaking something else. ~~> The fundamental issue is that [[clock scan]] is asked to process input with too many ambiguities. An input token such as ''2000'',~~ for example, may be interpreted as a year, a time of day, or a ~~number ("now + 2000 seconds"). ''1000'' may (perhaps) not be a~~ year, but could be a time of day, a number, or a time zone. Localisation would only make this problem worse. Without additional guidance, there is, even in theory, no way to determine ~~whether ''03-11-2004'' represents the third of November or the~~ eleventh of March. ~~> To solve this problem, a radical redesign of [[clock scan]] is required; the programmer ''must'' be allowed to specify an expected input format (or set of expected formats).~~ ~~> A side effect of such a redesign would be improved ease of maintenance. The current [[clock scan]] is a YACC-derived parser;~~ the build process, however, runs a script on the output of YACC to modify its memory management and alter its external symbol names to make it compatible with Tcl's conventions. This script is fragile; at present, it is known to work only with the version of YACC distributed with Solaris. ~~> There are a number of other issues with [[clock scan]] that could~~ be addressed at the same time with such a redesign. For instance, there is a known problem at present that an input string that specifies time and time zone but not date can return a time that is one day too early or late; this problem arises because the existing ~~parser presumes the current ''local'' date when parsing such a~~ string, rather than the current date in the given time zone. The problem is difficult to address because of the left-to-right nature ~~of the LALR(1) parser.~~ ~~2. A few enhancements have been requested to [[clock format]]; most~~ notably, proper localization on all platforms. In addition, the ~~documentation of [[clock format]] is at best approximate, because it depends on the ''strftime'' function in the Standard C Library.~~ This function differs among platforms, because the C standard, the Posix standard, and the Single Unix Specification have gone through evolution over time, and few platforms support all the features of the current generation of any of them. ~~> In addition, the Year 2038 bug looms large on the horizon. On most 32-bit platforms, ''time~~_t''~~ (used in the C library funtions) is a~~ 32-bit count of seconds from 1 January 1970; dates beyond 2038 cannot be represented in this format. ~~> The dependence on a complex library function such as ''strftime''~~ introduces obscure platform-dependent bugs. Several open bugs in ~~[[clock format]], for instance, fail only on HP-UX, or only on~~ Windows. ~~> Date formats have been requested (specifically, the Japanese civil calendar) that are beyond the capabilities of the Standard C~~ Library functions. ~~~~> [~~[clock format]] does not honor user preferences for date/time~~ format on Windows. ~~> All of these concerns seem to indicate that our current dependency~~ upon vendor-supplied date and time manipulation routines is ill advised. A single implementation that we control will make the behavior consistent among platforms, allow the localisation to follow Tcl's conventions, and let us lead rather than follow the vendor in fixing bugs. 3. Server applications frequently require support of multiple locales and multiple time zones within a single process, because they need to parse input and format output according to the client's ~~environment. The current [~~[clock]~~] facilities either do not~~ support localization at all, or else support a change to locale only by changing environment variables. This technique, once again, exposes bugs in the vendor libraries. It also introduces difficulties with thread safety; Tcl does not have a single ~~mechanism whereby the ~~''TZ''~~ and ~~''LC~~_TIME'' environment variables~~ are protected. 4. The only mechanism for performing calculations like "one month ~~after the current date" is [[clock scan]]. While this works well~~ in practice, using a parser to perform arithmetic seems somewhat perverse. ~~~ Specification~~ ~~The [~~[clock]~~] command shall be reimplemented as an ensemble [~~112]~~,~~ with most of the subcommands implemented in Tcl. A minimal set of the existing C code shall be refactored and placed inside a ''::tcl::clock'' namespace. The existing subcommands ''seconds'' and ''clicks'' shall be exposed. The existing ''scan'' shall be hidden inside the namespace. [[clock scan]~~] and [~~[clock format]] shall be reimplemented in Tcl. In addition, a new [[clock add]] command shall be added. ~~The syntax and semantics of the [[clock clicks]~~] and [~~[clock seconds]]~~ commands will remain unchanged. ~~~~~~~~~clock scan~~ ~~The [[clock scan]] command shall have the syntax:~~ > ~~'''~~clock scan~~''' ''~~string'' ?~~'''~~-base~~''' ''~~baseTime''? ?~~'''~~-format~~''' ''~~format''? ?~~'''~~-gmt~~''' ''~~boolean''? ?~~'''~~-locale~~''' ''~~name''? ?~~'''~~-timezone~~''' ''~~timeZone''? It accepts a character string representing a date and time and returns the time that the string represents, expressed as a count of seconds ~~from the Posix epoch (1 January 1970, 0000 UTC).~~ If a ~~'''~~-format~~'''~~ option is not supplied, the scan is a ''free format'' scan. The existing YACC parser for ''clock scan'' will be used to interpret the input string. ''This form of the command is explicitly deprecated'' because of the inherent ambiguities in interpreting the input string. The free-format version of [[clock scan]] does not accept ~~'''~~-locale~~'''~~ or ~~'''~~-timezone~~'''~~ options, since the legacy code does not support multiple locales or time zones. ~~If the ~~'''~~-format~~'''~~ options is supplied, it is interpreted as a~~ specification for the expected input form. If the given string matches the input form, it is converted to a count of seconds and ~~returned; otherwise, an error is thrown. See ''FORMATS'' below for a~~ discussion of the available format groups and their interpretation. Extraction of the date from the input string is guided by what fields are present in the format. The order of preference, from highest to lowest, is: ~~{seconds from epoch}, {starDate}: Date fields that specify both date~~ and time take highest precedence. If format groups for these fields appear multiple times, the rightmost takes precedence. ~~{Julian Day Number}: The Julian Day Number uniquely specifies a~~ calendar date. ~~{century, year, month, day of month}, {century, year, day of year}, {century, year, week of year, day of week}, {locale era, locale year, month, day of month}:~~ Formats with complete year are preferred to formats with a two-digit year. For a two digit year, the date range is constrained to lie between 1938 and 2037. ~~{year, month, day of month}, {year, day of year}, {year, week of year, day of week}, {year of locale era, month, day of month}:~~ Formats that specify the year are preferred to those that do not. ~~{month, day of month}, {day of year}, {week of year, day of week}:~~ Formats that specify a day within the year are preferred to those that specify merely the day of week or day of month. Formats that do not specify the year are presumed to designate the base year. ~~{day of month}, {day of week}: If none of the above rules apply, a~~ day of the month or day of the week standing alone is interpreted as belonging to the base month or week. None of the above: If no combination of fields that specifies a date is found, the base date is used. ~~The time of day returned by [[clock scan]] is determined by the~~ presence of fields in the format string, in the following order of preference. ~~{seconds from epoch, StarDate}: If either of these fields is present,~~ it uniquely determines date and time. ~~{am/pm indicator, hour am/pm, minute, second}, {hour, minute, second}:~~ Time with seconds is preferred to time without seconds. ~~{am/pm indicator, hour am/pm, minute}, {hour, minute}: Time can be~~ interpreted without the seconds. ~~{am/pm indicator, hour am/pm}, {hour}: Time can be expressed as an hour alone, ''e.g.'',~~ ~~\| clock scan "6 pm" -format "%I %p"~~ None of the above: If none of the above indicators is present, ~~''00:00:00'' (the start of the day) in the given time zone is used.~~ In all of the foregoing discussion, the 'base date', 'base month', 'base week', and 'base year' refer to the day, month, week or year ~~designated by the ~~'''~~-base~~'''~~ parameter, which is a count of seconds from the Posix epoch. If no ~~'''~~-base~~'''~~ parameter is supplied, the~~ current date is used as the base date. The year, month, week and day are obtained by interpreting the base date in the time zone specified by the date/time string. If the given format does not include a time zone, then the base time is interpreted in the default time zone; see ~~''TIME ZONES'' below for the way that the default time zone is determined, and the interpretation of the ~~'''~~-timezone~~'''~~ and ~~'''~~-gmt~~'''~~~~ options. The locale is used to determine the spelling of native language words such as the names of months, names of weekdays, am/pm indicators, and locale eras. It is also used in the interpretation of the format groups, '%X', '%x', and '%c'. In addition, the locale determines the date at which the calendar in use changes from the Julian calendar to ~~the Gregorian. If no ~~'''~~-locale~~'''~~ parameter is supplied, the default is to use the root locale. See ''LOCALISATION'' below for more~~ information. ~~~~~~~~~clock format~~ ~~The [[clock format]] command shall have the syntax:~~ > ~~'''~~clock format~~''' ''~~string'' ?~~'''~~-format~~''' ''~~format''? ?~~'''~~-gmt~~''' ''~~boolean''? ?~~'''~~-locale~~''' ''~~name''? ?~~'''~~-timezone~~''' ''~~timeZone''? It accepts a time, expressed in seconds from the Posix epoch of 1 January 1970, 00:00 UTC, and formats it according to the given format ~~string. See ''FORMATS'' below for a discussion of the available format codes. If no format string is supplied, a default format, {%a %b %d %H:%M:%S %Z %Y} is used.~~ ~~The ~~'''~~-timezone~~'''~~, ~~'''~~-gmt~~'''~~, and ~~'''~~-locale~~'''~~ options are interpreted as for [[clock scan]]. See ''TIME ZONES'' and ''LOCALISATION'' below~~ for how these options work. ~~~~~~~~~clock add~~ This command performs arithmetic on dates and times. The syntax is: ~~> ~~'''~~clock add~~''' ''~~time~~'' ?''~~count unit''?... ?~~'''~~-gmt~~''' ''~~boolean''? ?~~'''~~-timezone~~''' ''~~timeZone''? ?~~'''~~-locale~~''' ''~~name''?~~ It accepts a time, expressed in seconds from the Posix epoch of 1 January 1970, 00:00 UTC, and adds or subtracts units of time from it ~~according to the alternating ''count'' and ''unit'' parameters. Each ''count'' must be a wide integer; each ''unit'' is one of the~~ following: ~~\| years year months month \| weeks week days day \| hours hour minutes minute seconds second~~ The command works by converting the given time to a calendar day and time of day in the given locale and time zone. To that day and time ~~of day, it adds or subtracts the given offsets ''in sequence''. It~~ reconverts the resulting time to a count of seconds, again using the given locale and time zone, and returns that count of seconds. There are subtle differences in many cases between adding seemingly similar offsets. For instance, on the day before Daylight Saving Time goes into effect, adding 24 hours will give "the time 24 hours from the base time, irrespective of any clock change", while adding 1 day will give "the time it will be at the same time of day on the following day." Similarly, adding 1 month on 30 January will give either 28 or 29 February. There are equally strange effects when performing date/time arithmetic across the change between the Julian and Gregorian calendars. ~~The ~~'''~~-timezone~~'''~~, ~~'''~~-gmt~~'''~~, and ~~'''~~-locale~~'''~~ options are used to~~ control the interpretation of the count of seconds as a calendar day ~~and time. Refer to ''TIME ZONES'' and ''LOCALIZATION'' below for a~~ fuller discussion. ~~~~Formats~~ ~~The [[clock scan]~~] and [~~[clock format]] commands will be implemented in Tcl, without depending on the local ''strftime'' and ''strptime''~~ functions. For this reason, format groups will function identically on all platforms. The format groups will be interpreted as follows. %a: On output, receives the abbreviation for the day of the week in the given locale. On input, matches the name of the day of the ~~week (in the given locale) in either abbreviated or full form,~~ and may be used to determine the calendar date. %A: On output, receives the full name of the day of the week in the given locale. On input, treated identically with %a. %b: On output, receives the abbreviation for the name of the month in ~~the given locale. On input, matches the name of the month (in the given locale) in either abbreviated or full form, and may be~~ used to determine the calendar date. %B: On output, receives the full name of the month in the given locale. On input, treated identically with %b. %C: On output, receives the number of the century, in Indo-Arabic numerals. On input, matches one or two digits, and accepts the number of the century in Indo-Arabic numerals. May be used to determine the calendar date. %c: On output, produces a correct locale-dependent representation of ~~date and time of day. On input, matches whatever format ~~''%c''~~~~ produces in the given locale, and may be used to determine calendar date and time. %d: On output, produces the number of the day of the month, in Indo-Arabic numerals, with a leading zero. On input, matches one or two digits, accepts the day of the month, and may be used to determine calendar date.	< \| < \| \| \| \| \| \| \| \| > \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337	# TIP 173: Internationalisation and Refactoring of the 'clock' Command Author: Kevin Kenny <[email protected]> State: Final Type: Project Vote: Done Created: 11-Mar-2004 Post-History: Discussions-To: news:comp.lang.tcl Tcl-Version: 8.5 ----- # Abstract The [clock] command provides Tcl's fundamental facilities for computing with dates and times. It has served Tcl faithfully since 7.6, but the computing world has advanced significantly in the decade that it has been in service. This TIP proposes a \(nearly entirely compatible\) reimplementation of [clock] that will allow for fewer ambiguities on input, improved localisation, more portability, and less exposure of platform-dependent bugs. A significantly greater fraction of [clock] shall be implemented in Tcl than it is today, and the code shall be refactored to use the ensemble mechanism introducted for Tcl 8.5 \(see [[112]](112.md)\). # Rationale There is an embarrassing number of open bugs and feature requests against the [clock] command. As the maintainer of [clock], the author of this TIP has also received a number of informal feature requests that are not logged at SourceForge. Unfortunately, many of the requested fixes and enhancements cannot be effectively addressed with the current architecture of [clock]. 1. Several users have requested additional input formats to [clock scan], notably the full range of ISO8601 time formats \(including formats based on week number and day-of-week\); year and day-of-year; Apache "web log" dates and times; numeric dates placing the month before the day; and localised names of months and days of the week. Unfortunately, these formats simply cannot be added in the current architecture of [clock scan]; in fact, there are several outstanding bugs in [clock scan] \(for example, the parsing of numeric time zones east of Greenwich\) that cannot be fixed without breaking something else. > The fundamental issue is that [clock scan] is asked to process input with too many ambiguities. An input token such as _2000_, for example, may be interpreted as a year, a time of day, or a number \("now \+ 2000 seconds"\). _1000_ may \(perhaps\) not be a year, but could be a time of day, a number, or a time zone. Localisation would only make this problem worse. Without additional guidance, there is, even in theory, no way to determine whether _03-11-2004_ represents the third of November or the eleventh of March. > To solve this problem, a radical redesign of [clock scan] is required; the programmer _must_ be allowed to specify an expected input format \(or set of expected formats\). > A side effect of such a redesign would be improved ease of maintenance. The current [clock scan] is a YACC-derived parser; the build process, however, runs a script on the output of YACC to modify its memory management and alter its external symbol names to make it compatible with Tcl's conventions. This script is fragile; at present, it is known to work only with the version of YACC distributed with Solaris. > There are a number of other issues with [clock scan] that could be addressed at the same time with such a redesign. For instance, there is a known problem at present that an input string that specifies time and time zone but not date can return a time that is one day too early or late; this problem arises because the existing parser presumes the current _local_ date when parsing such a string, rather than the current date in the given time zone. The problem is difficult to address because of the left-to-right nature of the LALR\(1\) parser. 2. A few enhancements have been requested to [clock format]; most notably, proper localization on all platforms. In addition, the documentation of [clock format] is at best approximate, because it depends on the _strftime_ function in the Standard C Library. This function differs among platforms, because the C standard, the Posix standard, and the Single Unix Specification have gone through evolution over time, and few platforms support all the features of the current generation of any of them. > In addition, the Year 2038 bug looms large on the horizon. On most 32-bit platforms, _time\_t_ \(used in the C library funtions\) is a 32-bit count of seconds from 1 January 1970; dates beyond 2038 cannot be represented in this format. > The dependence on a complex library function such as _strftime_ introduces obscure platform-dependent bugs. Several open bugs in [clock format], for instance, fail only on HP-UX, or only on Windows. > Date formats have been requested \(specifically, the Japanese civil calendar\) that are beyond the capabilities of the Standard C Library functions. > [clock format] does not honor user preferences for date/time format on Windows. > All of these concerns seem to indicate that our current dependency upon vendor-supplied date and time manipulation routines is ill advised. A single implementation that we control will make the behavior consistent among platforms, allow the localisation to follow Tcl's conventions, and let us lead rather than follow the vendor in fixing bugs. 3. Server applications frequently require support of multiple locales and multiple time zones within a single process, because they need to parse input and format output according to the client's environment. The current [clock] facilities either do not support localization at all, or else support a change to locale only by changing environment variables. This technique, once again, exposes bugs in the vendor libraries. It also introduces difficulties with thread safety; Tcl does not have a single mechanism whereby the _TZ_ and _LC\_TIME_ environment variables are protected. 4. The only mechanism for performing calculations like "one month after the current date" is [clock scan]. While this works well in practice, using a parser to perform arithmetic seems somewhat perverse. # Specification The [clock] command shall be reimplemented as an ensemble [[112]](112.md), with most of the subcommands implemented in Tcl. A minimal set of the existing C code shall be refactored and placed inside a _::tcl::clock_ namespace. The existing subcommands _seconds_ and _clicks_ shall be exposed. The existing _scan_ shall be hidden inside the namespace. [clock scan] and [clock format] shall be reimplemented in Tcl. In addition, a new [clock add] command shall be added. The syntax and semantics of the [clock clicks] and [clock seconds] commands will remain unchanged. ### clock scan The [clock scan] command shall have the syntax: > clock scan _string_ ?-base _baseTime_? ?-format _format_? ?-gmt _boolean_? ?-locale _name_? ?-timezone _timeZone_? It accepts a character string representing a date and time and returns the time that the string represents, expressed as a count of seconds from the Posix epoch \(1 January 1970, 0000 UTC\). If a -format option is not supplied, the scan is a _free format_ scan. The existing YACC parser for _clock scan_ will be used to interpret the input string. _This form of the command is explicitly deprecated_ because of the inherent ambiguities in interpreting the input string. The free-format version of [clock scan] does not accept -locale or -timezone options, since the legacy code does not support multiple locales or time zones. If the -format options is supplied, it is interpreted as a specification for the expected input form. If the given string matches the input form, it is converted to a count of seconds and returned; otherwise, an error is thrown. See _FORMATS_ below for a discussion of the available format groups and their interpretation. Extraction of the date from the input string is guided by what fields are present in the format. The order of preference, from highest to lowest, is: \{seconds from epoch\}, \{starDate\}: Date fields that specify both date and time take highest precedence. If format groups for these fields appear multiple times, the rightmost takes precedence. \{Julian Day Number\}: The Julian Day Number uniquely specifies a calendar date. \{century, year, month, day of month\}, \{century, year, day of year\}, \{century, year, week of year, day of week\}, \{locale era, locale year, month, day of month\}: Formats with complete year are preferred to formats with a two-digit year. For a two digit year, the date range is constrained to lie between 1938 and 2037. \{year, month, day of month\}, \{year, day of year\}, \{year, week of year, day of week\}, \{year of locale era, month, day of month\}: Formats that specify the year are preferred to those that do not. \{month, day of month\}, \{day of year\}, \{week of year, day of week\}: Formats that specify a day within the year are preferred to those that specify merely the day of week or day of month. Formats that do not specify the year are presumed to designate the base year. \{day of month\}, \{day of week\}: If none of the above rules apply, a day of the month or day of the week standing alone is interpreted as belonging to the base month or week. None of the above: If no combination of fields that specifies a date is found, the base date is used. The time of day returned by [clock scan] is determined by the presence of fields in the format string, in the following order of preference. \{seconds from epoch, StarDate\}: If either of these fields is present, it uniquely determines date and time. \{am/pm indicator, hour am/pm, minute, second\}, \{hour, minute, second\}: Time with seconds is preferred to time without seconds. \{am/pm indicator, hour am/pm, minute\}, \{hour, minute\}: Time can be interpreted without the seconds. \{am/pm indicator, hour am/pm\}, \{hour\}: Time can be expressed as an hour alone, _e.g._, clock scan "6 pm" -format "%I %p" None of the above: If none of the above indicators is present, _00:00:00_ \(the start of the day\) in the given time zone is used. In all of the foregoing discussion, the 'base date', 'base month', 'base week', and 'base year' refer to the day, month, week or year designated by the -base parameter, which is a count of seconds from the Posix epoch. If no -base parameter is supplied, the current date is used as the base date. The year, month, week and day are obtained by interpreting the base date in the time zone specified by the date/time string. If the given format does not include a time zone, then the base time is interpreted in the default time zone; see _TIME ZONES_ below for the way that the default time zone is determined, and the interpretation of the -timezone and -gmt options. The locale is used to determine the spelling of native language words such as the names of months, names of weekdays, am/pm indicators, and locale eras. It is also used in the interpretation of the format groups, '%X', '%x', and '%c'. In addition, the locale determines the date at which the calendar in use changes from the Julian calendar to the Gregorian. If no -locale parameter is supplied, the default is to use the root locale. See _LOCALISATION_ below for more information. ### clock format The [clock format] command shall have the syntax: > clock format _string_ ?-format _format_? ?-gmt _boolean_? ?-locale _name_? ?-timezone _timeZone_? It accepts a time, expressed in seconds from the Posix epoch of 1 January 1970, 00:00 UTC, and formats it according to the given format string. See _FORMATS_ below for a discussion of the available format codes. If no format string is supplied, a default format, \{%a %b %d %H:%M:%S %Z %Y\} is used. The -timezone, -gmt, and -locale options are interpreted as for [clock scan]. See _TIME ZONES_ and _LOCALISATION_ below for how these options work. ### clock add This command performs arithmetic on dates and times. The syntax is: > clock add _time_ ?_count unit_?... ?-gmt _boolean_? ?-timezone _timeZone_? ?-locale _name_? It accepts a time, expressed in seconds from the Posix epoch of 1 January 1970, 00:00 UTC, and adds or subtracts units of time from it according to the alternating _count_ and _unit_ parameters. Each _count_ must be a wide integer; each _unit_ is one of the following: years year months month weeks week days day hours hour minutes minute seconds second The command works by converting the given time to a calendar day and time of day in the given locale and time zone. To that day and time of day, it adds or subtracts the given offsets _in sequence_. It reconverts the resulting time to a count of seconds, again using the given locale and time zone, and returns that count of seconds. There are subtle differences in many cases between adding seemingly similar offsets. For instance, on the day before Daylight Saving Time goes into effect, adding 24 hours will give "the time 24 hours from the base time, irrespective of any clock change", while adding 1 day will give "the time it will be at the same time of day on the following day." Similarly, adding 1 month on 30 January will give either 28 or 29 February. There are equally strange effects when performing date/time arithmetic across the change between the Julian and Gregorian calendars. The -timezone, -gmt, and -locale options are used to control the interpretation of the count of seconds as a calendar day and time. Refer to _TIME ZONES_ and _LOCALIZATION_ below for a fuller discussion. ## Formats The [clock scan] and [clock format] commands will be implemented in Tcl, without depending on the local _strftime_ and _strptime_ functions. For this reason, format groups will function identically on all platforms. The format groups will be interpreted as follows. %a: On output, receives the abbreviation for the day of the week in the given locale. On input, matches the name of the day of the week \(in the given locale\) in either abbreviated or full form, and may be used to determine the calendar date. %A: On output, receives the full name of the day of the week in the given locale. On input, treated identically with %a. %b: On output, receives the abbreviation for the name of the month in the given locale. On input, matches the name of the month \(in the given locale\) in either abbreviated or full form, and may be used to determine the calendar date. %B: On output, receives the full name of the month in the given locale. On input, treated identically with %b. %C: On output, receives the number of the century, in Indo-Arabic numerals. On input, matches one or two digits, and accepts the number of the century in Indo-Arabic numerals. May be used to determine the calendar date. %c: On output, produces a correct locale-dependent representation of date and time of day. On input, matches whatever format _%c_ produces in the given locale, and may be used to determine calendar date and time. %d: On output, produces the number of the day of the month, in Indo-Arabic numerals, with a leading zero. On input, matches one or two digits, accepts the day of the month, and may be used to determine calendar date.
︙			︙
359 360 361 362 363 364 365 ~~366~~ 367 368 369 370 371 372 373	and may be used to determine calendar date. %EX: On output, produces the time of day in the locale's alternative representation. On input, accepts whatever %EX produces and may be used to determine time of day. %Ey: On output, produces the number of the current year relative to ~~the locale's current era ~~''%EC''~~, expressed in the locale's~~ alternative numerals. On input, accepts the number of the year relative to the current era in the locale's alternative numerics, and may be used to determine calendar date. %EY: On output, produces an unambiguous representation of the current year in the locale's alternative calendar and alternative numerals. This group is often synonymous with %EC%Ey. On	\|	358 359 360 361 362 363 364 365 366 367 368 369 370 371 372	and may be used to determine calendar date. %EX: On output, produces the time of day in the locale's alternative representation. On input, accepts whatever %EX produces and may be used to determine time of day. %Ey: On output, produces the number of the current year relative to the locale's current era _%EC_, expressed in the locale's alternative numerals. On input, accepts the number of the year relative to the current era in the locale's alternative numerics, and may be used to determine calendar date. %EY: On output, produces an unambiguous representation of the current year in the locale's alternative calendar and alternative numerals. This group is often synonymous with %EC%Ey. On
︙			︙
383 384 385 386 387 388 389 ~~390~~ 391 392 393 ~~394~~ 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 ~~410~~ 411 412 413 ~~414~~ 415 416 ~~417 418~~ 419 420 421 ~~422 423~~ 424 425 426 427 428 429 430 431 432 433 434 435 436 437 ~~438~~ 439 440 441 442 ~~443~~ 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 ~~463~~ 464 465 466 467 468 469 470 471 ~~472~~ 473 474 475 ~~476 477~~ 478 479 480 ~~481~~ 482 ~~483 484~~ 485 486 487 ~~488~~ 489 ~~490~~ 491 492 493 494 495 496 497	with the ISO8601 week number. On input, accepts a four-digit year number, and may be used to determine calendar date if the %V format group is also present. %h: Synonymous with %b. %H: On output, produces the two-digit hour of the day on a 24-hour ~~clock (00-24). On input, matches two digits, and may be used to~~ determine time of day. %I: On output, produces the two-digit hour of the day on a 12-hour ~~clock (12-11). On input, matches two digits, and may be used to~~ determine time of day. %j: On output, produces the three-digit number of the day of the year. On input, matches three digits, and may be used to determine the day of the year. %J: On output, produces the number of the Julian Day Number beginning at noon of the given date. The Julian Day Number is a representation popular with astronomers; it is a count of days in which Day 1 is 1 January, 4713 B.C.E., on the proleptic Julian calendar; in this system, 1 January 2000 is Julian Day 2451545. On input, matches any string of digits and interprets it as a Julian Day; may be used to determine calendar date. %k: On output, produces the number of the hour on a 24-hour clock ~~~~(0-24~~) without a leading zero. On input, matches one or two~~ digits and may be used to determine time of day. %l: On output, produces the number of the hour on a 12-hour clock ~~(12-11) without a leading zero. On input, matches one or two~~ digits and may be used to determine time of day. ~~%m: On output, produces the number of the month (01-12), with exactly two digits (using a leading zero if necessary). On input,~~ matches exactly two digits and may be used to determine calendar date. ~~%M: On output, produces the number of the minute of the hour (00-59) with exactly two digits (using a leading zero if necessary). On~~ input, matches exactly two digits and may be used to determine time of day. %N: On output, produces the number of the month, with no leading zero. On input, matches one or two digits, and may be used to determine time of day. %Od, %Oe, %OH, %OI, %Ok, %Ol, %Om, %OM, %OS, %Ou, %ow, %Oy: All of these format groups are synonymous with their counterparts without the 'O', except that the string is produced and parsed in the locale-dependent alternative numerals. %p: On output, produces the indicator for 'a.m.', or 'p.m.' appropriate for the given locale, converted to upper case. On ~~input, accepts whatever %p produces (in upper or lower case) and~~ may be used to determine time of day. %P: On output, produces the indicator for 'a.m.', or 'p.m.' appropriate for the given locale. On input, accepts whatever %p ~~produces (in upper or lower case) and may be used to determine~~ time of day. %Q: On output, produces a StarDate. On input, accepts a StarDate and may be used to determine calendar date and time of day. %r: On output, produces a locale-dependent time of day representation on a 12-hour clock. On input, accepts whatever %r produces and may be used to determine time of day. %R: On output, produces a locale-dependent time of day representation on a 24-hour clock. On input, accepts whatever %R produces and may be used to determine time of day. %s: On output, produces a string of digits representing the count of seconds since 1 January 1970, 00:00 UTC. On input, accepts a string of digits and accepts it as such a count; may be used to determine date and time of day. %S: On output, produces a two-digit number of the second of the ~~minute (00-59). On input, accepts two digits. May be used to~~ determine time of day. %t: On output, produces a TAB character. On input, matches a TAB character. %T: Synonymous with %H:%M:%S. %u: On output, produces the number of the day of the week ~~(1-Monday,7-Sunday). On input, accepts a single digit. May be~~ used to determine calendar day. %U: On output, produces the ordinal number of the week of the year ~~(00-53). The first Sunday of the year is the first day of week 01. On input accepts two digits ''which are otherwise ignored.''~~ This format group is never used in determining an input date. %V: On output, produces the number of the ISO8601 week as a two digit ~~number (01-53). Week 01 is the week containing January 4; or the~~ first week of the year containing at least 4 days; or the week ~~containing the first Thursday of the year (the three statements are equivalent). Each week begins on a Monday. On input, accepts~~ the ISO8601 week number, and may be used to determine the calendar day. ~~%w: On output, produces a week number (00-53) within the year; week~~ 01 begins on the first Monday of the year. On input, accepts two ~~digits, ''which are otherwise ignored.'' This format group is~~ never used in determining an input date. %x: On output, produces the date in a locale-dependent representation. On input, accepts whatever %x produces and may be used to determine calendar date. %X: On output, produces the time of day in a locale-dependent	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496	with the ISO8601 week number. On input, accepts a four-digit year number, and may be used to determine calendar date if the %V format group is also present. %h: Synonymous with %b. %H: On output, produces the two-digit hour of the day on a 24-hour clock \(00-24\). On input, matches two digits, and may be used to determine time of day. %I: On output, produces the two-digit hour of the day on a 12-hour clock \(12-11\). On input, matches two digits, and may be used to determine time of day. %j: On output, produces the three-digit number of the day of the year. On input, matches three digits, and may be used to determine the day of the year. %J: On output, produces the number of the Julian Day Number beginning at noon of the given date. The Julian Day Number is a representation popular with astronomers; it is a count of days in which Day 1 is 1 January, 4713 B.C.E., on the proleptic Julian calendar; in this system, 1 January 2000 is Julian Day 2451545. On input, matches any string of digits and interprets it as a Julian Day; may be used to determine calendar date. %k: On output, produces the number of the hour on a 24-hour clock \(0-24\) without a leading zero. On input, matches one or two digits and may be used to determine time of day. %l: On output, produces the number of the hour on a 12-hour clock \(12-11\) without a leading zero. On input, matches one or two digits and may be used to determine time of day. %m: On output, produces the number of the month \(01-12\), with exactly two digits \(using a leading zero if necessary\). On input, matches exactly two digits and may be used to determine calendar date. %M: On output, produces the number of the minute of the hour \(00-59\) with exactly two digits \(using a leading zero if necessary\). On input, matches exactly two digits and may be used to determine time of day. %N: On output, produces the number of the month, with no leading zero. On input, matches one or two digits, and may be used to determine time of day. %Od, %Oe, %OH, %OI, %Ok, %Ol, %Om, %OM, %OS, %Ou, %ow, %Oy: All of these format groups are synonymous with their counterparts without the 'O', except that the string is produced and parsed in the locale-dependent alternative numerals. %p: On output, produces the indicator for 'a.m.', or 'p.m.' appropriate for the given locale, converted to upper case. On input, accepts whatever %p produces \(in upper or lower case\) and may be used to determine time of day. %P: On output, produces the indicator for 'a.m.', or 'p.m.' appropriate for the given locale. On input, accepts whatever %p produces \(in upper or lower case\) and may be used to determine time of day. %Q: On output, produces a StarDate. On input, accepts a StarDate and may be used to determine calendar date and time of day. %r: On output, produces a locale-dependent time of day representation on a 12-hour clock. On input, accepts whatever %r produces and may be used to determine time of day. %R: On output, produces a locale-dependent time of day representation on a 24-hour clock. On input, accepts whatever %R produces and may be used to determine time of day. %s: On output, produces a string of digits representing the count of seconds since 1 January 1970, 00:00 UTC. On input, accepts a string of digits and accepts it as such a count; may be used to determine date and time of day. %S: On output, produces a two-digit number of the second of the minute \(00-59\). On input, accepts two digits. May be used to determine time of day. %t: On output, produces a TAB character. On input, matches a TAB character. %T: Synonymous with %H:%M:%S. %u: On output, produces the number of the day of the week \(1-Monday,7-Sunday\). On input, accepts a single digit. May be used to determine calendar day. %U: On output, produces the ordinal number of the week of the year \(00-53\). The first Sunday of the year is the first day of week 01. On input accepts two digits _which are otherwise ignored._ This format group is never used in determining an input date. %V: On output, produces the number of the ISO8601 week as a two digit number \(01-53\). Week 01 is the week containing January 4; or the first week of the year containing at least 4 days; or the week containing the first Thursday of the year \(the three statements are equivalent\). Each week begins on a Monday. On input, accepts the ISO8601 week number, and may be used to determine the calendar day. %w: On output, produces a week number \(00-53\) within the year; week 01 begins on the first Monday of the year. On input, accepts two digits, _which are otherwise ignored._ This format group is never used in determining an input date. %x: On output, produces the date in a locale-dependent representation. On input, accepts whatever %x produces and may be used to determine calendar date. %X: On output, produces the time of day in a locale-dependent
︙			︙
505 506 507 508 509 510 511 ~~512 513~~ 514 515 516 517 ~~518 519 520 521 522~~ 523 524 525 526 527 528 ~~529~~ 530 ~~531~~ 532 533 ~~534~~ 535 536 ~~537 538~~ 539 540 541 ~~542 543 544 545 546 547 548 549 550~~ 551 ~~552~~ 553 554 555 556 ~~557 558 559 560 561~~ 562 ~~563~~ 564 ~~565~~ 566 567 568 569 570 571 572 573 574 575 ~~576~~ 577 578 579 ~~580~~ 581 ~~582~~ 583 584 ~~585~~ 586 ~~587~~ 588 589 ~~590 591~~ 592 ~~593~~ 594 ~~595~~ 596 ~~597~~ 598 599 600 601 602 603 ~~604 605 606 607~~ 608 609 610 611 ~~612~~ 613 ~~614 615~~ 616 617 618 ~~619~~ 620 ~~621~~ 622 623 ~~624~~ 625 ~~626 627~~ 628 629 ~~630~~ 631 ~~632 633~~ 634 ~~635~~ 636 ~~637~~ 638 639 ~~640 641 642~~ 643 644 ~~645~~ 646 ~~647~~ 648 649 ~~650 651 652~~ 653 654 655 ~~656~~ 657 658 ~~659 660~~ 661 662 ~~663~~ 664 ~~665~~ 666 667 ~~668~~ 669 ~~670~~ 671 672 673 ~~674~~ 675 ~~676 677~~ 678 679 680 681 682 ~~683~~ 684 685 686 687 ~~688 689~~ 690 691 692 ~~693~~ 694 ~~695~~ 696 697 ~~698~~ 699 700 701 ~~702~~ 703 704 705 706 707 ~~708~~ 709 710 711 712 713 ~~714~~ 715 ~~716~~ 717 ~~718 719~~ 720 ~~721 722~~ 723 ~~724 725~~ 726 727 ~~728~~ 729 730 ~~731 732~~ 733 ~~734 735~~ 736 ~~737 738~~ 739 ~~740 741~~ 742 ~~743 744~~ 745 ~~746 747~~ 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 ~~844 845 846~~ 847 848 849 ~~850~~ 851 ~~852~~ 853 ~~854~~ 855 856 857 858 ~~859 860~~ 861 862 863 864 865 866 ~~867 868~~ 869 870 871 872 873 874 875 876 ~~877 878~~ 879 880 881 882 883 884 885 886 ~~887~~ 888 ~~889~~ 890 891 892 893 894 895 ~~896~~ 897 ~~898~~ 899 900 ~~901 902 903 904 905 906~~ 907 908 909 910 ~~911 912~~ 913 914 915 916 917 ~~918~~ 919 920 921 ~~922 923 924~~ 925 926 927 ~~928~~ 929 930 931 932 933 ~~934 935~~ 936 937 938 939 940 941 942 ~~943~~ 944 ~~945~~ 946 947 948 ~~949~~ 950 ~~951~~ 952 953 954 955 956 957	%Y: On output, produces the four-digit calendar year. On input, accepts four digits and may be used to determine calendar date. Note that %Y does not yield a year appropriate for use with the ISO8601 week number %V; programs should use %G for that purpose. %z: On output, produces the current time zone, expressed in hours and ~~minutes east (+hhmm) or west (-hhmm) of Greenwich. On input, accepts a time zone specifier (see ''TIME ZONES'' below) that~~ will be used to determine the time zone. %Z: On output, produces the current time zone's name, possibly translated to the given locale. On input, accepts a time zone specifier (see ''TIME ZONES'' below) that will be used to determine the time zone. ''This option should, in general, be used on input only when parsing RFC822 dates.'' Other uses are fraught with ambiguity; for instance, the string ~~''BST''~~ may represent ''British Summer Time'' or ''Brazilian Standard Time''. It is recommended that date/time strings for use by computers use numeric time zones instead. %%: On output, produces a literal '%' charater. On input, matches a literal '%' character. ~~%+: Synonymous with "%a %b %e %H:%M:%S %Z %Y".~~ ~~~~Time Zones~~ There are several ways that a time zone may be specified for use with ~~[[clock scan]~~], [~~[clock format]~~] and [~~[clock add]]. In order of preference:~~ * The time zone may appear in the input string matched by a %z or %Z ~~format group in [[clock scan]]. These format groups match time zones in the forms +hhmm, +hhmmss, -hhmm, -hhmmss, and alphanumeric~~ strings. The numeric representations are self explanatory; an alphanumeric string must be the one of: \| gmt ut utc bst wet wat at \| nft nst ndt ast adt est edt \| cst cdt mst mdt pst pdt yst \| ydt hst hdt cat ahst nt idlw \| cet cest met mewt mest swt sst \| eet eest bt it zp4 zp5 ist \| zp6 wast wadt jt cct jst cast \| cadt east eadt gst nzt nzst nzdt \| idle ~~> or a single letter other than J. Generally speaking, numeric time~~ zones should be preferred for communication among computers; the alphanumeric time zones are provided primarily for the parsing of legacy RFC822 time stamps. * The time zone may appear in the ~~'''~~-timezone~~'''~~ argument to the [~~[clock]~~] command, or may be implied by the presence of ~~'''~~-gmt 1~~'''~~. It is an error to use ~~'''~~-timezone~~'''~~ and ~~'''~~-gmt~~'''~~ in the same call. The ~~'''~~-gmt 1~~'''~~ option may be regarded as an obsolete synonym of ~~'''~~-timezone :UTC~~'''~~. * The time zone may appear in the environment variable, ''TCL~~_TZ''~~. * The time zone may appear in the environment variable, ~~''TZ''~~. * Failing all of these, on Windows systems, the time zone will be obtained from the Registry. * As a last resort, the time zone is set to ':localtime'. Once the time zone is obtained by one of these means, it is interpreted as follows: ":localtime": This specifier requests that the C library functions ~~''localtime~~()''~~ and ''mktime~~()''~~ be used whenever converting times~~ between local and Greenwich. It is generally used as a last resort if the time zone can be determined in no other way. ~~"+hhmm", "+hhmmss", "-hhmm", "-hhmmss": These specifiers give the~~ time zone explicitly in terms of hours, minutes and seconds east ~~(+) or west (-) of Greenwich.~~ ":filename": The given file name is interpreted as a path name ~~relative to [[info library]]/tzdata, and the specified file is~~ loaded as a Tcl script. The script is expected to set the ~~'':filename'' element in the ''tzdata'' array to a list of~~ transitions. Each transition is a four-element list comprising: ~~> * the time at which the transition takes place, expressed in seconds from the Posix Epoch (1 January 1970, 00:00 UTC)~~ ~~> * the offset (in seconds east of Greenwich) to apply.~~ ~~> * an indicator (0=Standard Time, 1=Daylight Saving Time)~~ ~~> * the name to use when displaying the given time zone in the root~~ locale. > The first transition is expected to take place at time -9223372036854775808, the smallest value of a wide integer. Any string recognizable as a Posix time zone specifier: A time zone may be specified in Posix syntax (see [http://www.opengroup.org/onlinepubs/007904975/basedefs/xbd_chap08.html]), for example ''EST5EDT'' or ~~''EST~~+05:00EDT+04:00,M4.1.0/01:00,M10.5.0/02:00''. Any other string is processed by prefixing a colon and attempting to load the given file, as shown above. ~~~~Localisation~~ ~~The [~~[clock]~~] command is localised by a set of message catalogs located in [[file join [[info library]] clock msgs]] and loaded into~~ the namespace, ::tcl::clock. The possible strings to be translated include: ~~AM: The string that identifies ''ante meridiem'' times when~~ expressing a time of day in the given locale. This string has ~~the value, ~~{am~~} in the root locale.~~ BCE: The string that identifies dates before the Common Era in the ~~given locale. This string has the value, {B.C.E.} in the root~~ locale. Those localising this string should be aware that, ~~depending on local culture, a name such as "B.C." (before Christ) may be offensive.~~ CE: The string that identifies dates of the Common Era in the given ~~locale. This string has the value, ~~{C.E.~~} in the root locale.~~ Those localising this string should be aware that, depending on ~~local culture, a name such as "A.D." (Latin, ''anno Domini'', "in the year of Our Lord") may be offensive.~~ ~~DATE_FORMAT: The format specifier for calendar dates in the given~~ locale. In the root locale, %m/%d/%Y is used for compatibility ~~with earlier versions of the [~~[clock]~~] command, even though~~ %Y-%m-%d would probably be preferable. DATE~~_TIME~~_FORMAT: The format specifier for combined date and time in the given locale. In the root locale, {%a %b %e %H:%M:%S %Y} is used for compatibility with earlier versions of the [~~[clock]~~] command, even though %Y-%m-%dT%H:%M:%S would be preferable. ~~DAYS~~_OF~~_WEEK_ABBREV: Abbreviations of the days of the week in the~~ given locale. In the root locale, this string has the value, ~~{Sun Mon Tue Wed Thu Fri Sat}. In any locale, this string~~ is expected to represent a valid Tcl list. ~~DAYS~~_OF~~_WEEK_FULL: Full names of the days of the week in the given locale. In the root locale, this string has the value, {Sunday Monday Tuesday Wednesday Thursday Friday Saturday}.~~ In any locale, this string is expected to represent a valid Tcl list. ~~GREGORIAN_CHANGE_DATE: The date on which the change from the Julian~~ to the Gregorian calendar takes place, expressed as a Julian Day Number. In the root locale, this string has the value, ~~{2299161}, corresponding to 15 October 1582 New Style. In the 'en' locale, this value is {2361222}, 14 September 1752 New~~ Style. ~~LOCALE~~_DATE~~_FORMAT: The format to use when formatting dates in the~~ locale's alternative calendar. In the root locale, ~~LOCALE~~_DATE~~_FORMAT is ~~''%x''~~, which causes formatting without~~ alternative numerals. ~~LOCALE_DATE~~_TIME~~_FORMAT: The format to use when formatting date/time~~ strings in the locale's alternative calendar. In the root locale, ~~LOCALE_DATE~~_TIME~~_FORMAT is ''%Ex %EX'', which causes concatenation~~ of the locale's format for date, a space character, and the locale's format for time. ~~LOCALE_ERAS: In a locale where a calendar with multiple eras is in~~ use, gives a list of triples. The first element of each triple ~~is the time (in seconds from the Posix epoch of 1 January 1970, 00:00 UTC) at which the era begins; the second is the name of the~~ era, and the third is a constant offset to be subtracted from the Gregorian year to give the year of the era. In any locale, this string is expected to represent a valid Tcl list. ~~LOCALE_NUMERALS: In a locale where alternative numerals may be used,~~ gives a list containing the numerals that represent the numbers from zero to ninety-nine. Note that these numerals are the ones typically used on calendars, not the ones that represent currencies or quantities. For instance, in a Han locale, the ~~number twenty-one is represented by \~~u5eff~~\u4e00, not by \u4e8c\~~u5341~~\u4e00.~~ In any locale, this string is expected to represent a valid Tcl list. ~~LOCALE~~_TIME~~_FORMAT: The time format to use when formatting a time of~~ day using a locale's alternative numerals. In the root locale, ~~this string is ~~''%X''~~, which causes formatting without alternative~~ numerals. ~~LOCALE~~_YEAR~~_FORMAT: The time format to use when formatting a year in~~ the locale's alternative calendar. In the root locale, this string is %Y. ~~MONTHS_ABBREV: Abbreviated names of the months in the given locale.~~ In the root locale, consists of three-letter abbreviations for the English months: Jan-Dec. In any locale, this string is expected to represent a valid Tcl list. ~~MONTHS_FULL: Full names of the months in the given locale. In the~~ root locale, consists of the names of the English months in order from 'January' to 'December'. In any locale, this string is expected to represent a valid Tcl list. ~~PM: The string that identifies ''post meridiem'' times when~~ expressing a time of day in the given locale. This string has ~~the value, ~~{pm~~} in the root locale.~~ ~~TIME_FORMAT: String that specifies the default time format in the given locale. In the root locale, this string is {%H:%M:%S}~~ ~~TIME_FORMAT_12: String that formats time on a 12-hour clock in the given locale. In the root locale, this string is {%I:%M:%S %p}.~~ ~~TIME_FORMAT_24: String that formats time on a 24-hour clock in the given locale. In the root locale, this string is {%H:%M}.~~ There is a defined order for substitution of locale strings, which ~~constrains the format groups that can appear in the ''_FORMAT'' strings.~~ Specifically: * DATE~~_TIME~~_FORMAT and LOCALE_DATE~~_TIME~~_FORMAT may contain any format groups other than ~~''%c''~~ and ~~''%Ec''~~. * LOCALE~~_DATE~~_FORMAT and LOCALE~~_TIME~~_FORMAT may not contain ''%c~~'', ''~~%Ec~~'', ''~~%Ex'', or ~~''%EX''~~. * DATE_FORMAT and TIME_FORMAT may not contain ''%c~~'', ''~~%Ec'', ''%x~~'', ''~~%Ex'', ~~''%X''~~, or ~~''%EX''~~. * TIME_FORMAT_12 and TIME_FORMAT_24 may not contain ''%c~~'', ''~~%Ec'', ''%r'', ''%R'', ~~''%T'', ''%x'', ''~~%Ex~~'', ''%X''~~, or ~~''%EX''~~. * LOCALE~~_YEAR~~_FORMAT may not contain ''%c~~'', ''~~%Ec'', ''%r'', ''%R'', ~~''%T'', ''%x'', ''~~%Ex~~'', ''%X'', ''%EX''~~, or ~~''%Ey''~~. ~~''Example.'' The following file is "ja.msg", which localises the [~~[clock]~~] command to a Japanese locale.~~ \|namespace eval ::tcl::clock { \| ::msgcat::mcset ja DAYS_OF_WEEK_ABBREV [list \ \| "\u65e5"\ \| "\u6708"\ \| "\u706b"\ \| "\u6c34"\ \| "\u6728"\ \| "\u91d1"\ \| "\u571f"] \| ::msgcat::mcset ja DAYS_OF_WEEK_FULL [list \ \| "\u65e5\u66dc\u65e5"\ \| "\u6708\u66dc\u65e5"\ \| "\u706b\u66dc\u65e5"\ \| "\u6c34\u66dc\u65e5"\ \| "\u6728\u66dc\u65e5"\ \| "\u91d1\u66dc\u65e5"\ \| "\u571f\u66dc\u65e5"] \| ::msgcat::mcset ja MONTHS_ABBREV [list \ \| "1"\ \| "2"\ \| "3"\ \| "4"\ \| "5"\ \| "6"\ \| "7"\ \| "8"\ \| "9"\ \| "10"\ \| "11"\ \| "12"\ \| ""] \| ::msgcat::mcset ja MONTHS_FULL [list \ \| "1\u6708"\ \| "2\u6708"\ \| "3\u6708"\ \| "4\u6708"\ \| "5\u6708"\ \| "6\u6708"\ \| "7\u6708"\ \| "8\u6708"\ \| "9\u6708"\ \| "10\u6708"\ \| "11\u6708"\ \| "12\u6708"\ \| ""] \| ::msgcat::mcset ja BCE "\u7d00\u5143\u524d" \| ::msgcat::mcset ja CE "\u897f\u66a6" \| ::msgcat::mcset ja AM "\u5348\u524d" \| ::msgcat::mcset ja PM "\u5348\u5f8c" \| ::msgcat::mcset ja DATE_FORMAT "%Y/%m/%d" \| ::msgcat::mcset ja TIME_FORMAT "%k:%M:%S" \| ::msgcat::mcset ja DATE_TIME_FORMAT "%Y/%m/%d %k:%M:%S %z" \| ::msgcat::mcset ja LOCALE_NUMERALS "\u3007 \u4e00 \u4e8c \u4e09 \u56db \| \u4e94 \u516d \u4e03 \u516b \u4e5d \u5341 \u5341\u4e00 \u5341\u4e8c \| \u5341\u4e09 \u5341\u56db \u5341\u4e94 \u5341\u516d \u5341\u4e03 \| \u5341\u516b \u5341\u4e5d \u4e8c\u5341 \u5eff\u4e00 \u5eff\u4e8c \| \u5eff\u4e09 \u5eff\u56db \u5eff\u4e94 \u5eff\u516d \u5eff\u4e03 \| \u5eff\u516b \u5eff\u4e5d \u4e09\u5341 \u5345\u4e00 \u5345\u4e8c \| \u5345\u4e09 \u5345\u56db \u5345\u4e94 \u5345\u516d \u5345\u4e03 \| \u5345\u516b \u5345\u4e5d \u56db\u5341 \u56db\u5341\u4e00 \| \u56db\u5341\u4e8c \u56db\u5341\u4e09 \u56db\u5341\u56db \| \u56db\u5341\u4e94 \u56db\u5341\u516d \u56db\u5341\u4e03 \| \u56db\u5341\u516b \u56db\u5341\u4e5d \u4e94\u5341 \| \u4e94\u5341\u4e00 \| \u4e94\u5341\u4e8c \u4e94\u5341\u4e09 \u4e94\u5341\u56db \| \u4e94\u5341\u4e94 \u4e94\u5341\u516d \u4e94\u5341\u4e03 \| \u4e94\u5341\u516b \u4e94\u5341\u4e5d \u516d\u5341 \| \u516d\u5341\u4e00 \u516d\u5341\u4e8c \u516d\u5341\u4e09 \| \u516d\u5341\u56db \u516d\u5341\u4e94 \u516d\u5341\u516d \| \u516d\u5341\u4e03 \u516d\u5341\u516b \u516d\u5341\u4e5d \| \u4e03\u5341 \| \u4e03\u5341\u4e00 \u4e03\u5341\u4e8c \u4e03\u5341\u4e09 \| \u4e03\u5341\u56db \u4e03\u5341\u4e94 \u4e03\u5341\u516d \| \u4e03\u5341\u4e03 \u4e03\u5341\u516b \u4e03\u5341\u4e5d \| \u516b\u5341 \| \u516b\u5341\u4e00 \u516b\u5341\u4e8c \u516b\u5341\u4e09 \| \u516b\u5341\u56db \u516b\u5341\u4e94 \u516b\u5341\u516d \| \u516b\u5341\u4e03 \u516b\u5341\u516b \u516b\u5341\u4e5d \| \u4e5d\u5341 \| \u4e5d\u5341\u4e00 \u4e5d\u5341\u4e8c \u4e5d\u5341\u4e09 \| \u4e5d\u5341\u56db \u4e5d\u5341\u4e94 \u4e5d\u5341\u516d \| \u4e5d\u5341\u4e03 \u4e5d\u5341\u516b \u4e5d\u5341\u4e5d" \| ::msgcat::mcset ja LOCALE_DATE_FORMAT "%EY\u5e74%B%Od\u65e5" \| ::msgcat::mcset ja LOCALE_TIME_FORMAT "%OH\u6642%OM\u5206%OS\u79d2" \| ::msgcat::mcset ja LOCALE_DATE_TIME_FORMAT \ \| "%A %EY\u5e74%B%Od\u65e5%OH\u6642%OM\u5206%OS\u79d2 %z" \| ::msgcat::mcset ja LOCALE_ERAS " \| {-9223372036854775808 \u897f\u66a6 0} \| {-3060979200 \u660e\u6cbb 1867} \| {-1812153600 \u5927\u6b63 1911} \| {-1357603200 \u662d\u548c 1925} \| {568512000 \u5e73\u6210 1987}" \|} In addition to the standard locales, two special locales may appear on the ~~'''~~-locale~~'''~~ parameter; ~~'''~~current~~'''~~, which designates the result of evaluating [[mclocale]], and ~~'''~~system~~'''~~, which designates the current "system" locale, which is determined by (in order of preference): * the date/time format settings on the Windows control panel * the environment variable LC_TIME * the current locale from [[mclocale]]. ~~~~ Build System~~ Several tools are provided for the use of maintainers: loadICU.tcl: ~~Given a distribution of IBM's ''icu4c'' [http://oss.software.ibm.com/icu/index.html],~~ this program analyzes the source code of the message catalogs and extracts appropriate Tcl-based messages for the date and time formats in the supported locales. loadtzif.tcl: Given a time zone information file used by the Olson version of ~~'tzset' (for a description, see the latest 'tzcode' file in [ftp://elsie.nci.nih.gov/pub/]), creates the corresponding Tcl~~ 'tzdata' file. makeTestCases.tcl: Makes several thousand auto-generated test cases to exercise the time conversion algorithms. tclZIC.tcl: Given the source code for the Olson time zone descriptions ~~(obtainable as the latest 'tzdata' file in [ftp://elsie.nci.nih.gov/pub/]), creates the full set of Tcl~~ 'tzdata' files. Since these tools depend on third party source, they will not be included in the usual build steps; instead, maintainers will be expected to run them whenever changing files on which they depend. It will be a good practice to update the ICU and Olson files just before cutting a release. ~~~ Reference Implementation~~ ~~The implementation of a refactored [~~[clock]~~] command is a work~~ in progress, and interested developers are urged to contact the TIP author if they want to help with implementation, documentation, or testing. The code is available in the same SourceForge repository as the Tcl core, and Tcl maintainers can obtain it with ~~\| cvs -d:ext:[email protected]:/cvsroot/tcl co newclock~~ ~~~ Notes on the cost of implementation~~ Since it is well known that Tcl code is typically 30-50 times slower than the equivalent C, it is to be expected that [[clock scan]], [[clock format]], and [[clock add]] will be in that performance range. [[clock seconds]] and [[clock clicks]] will still be C code and are not expected to suffer a measurable change in performance. (If they do, the implementors plan to address the issue.) The cost of the time zone data files and the message catalogs is not trivial; they occupy about 1.6 megabytes exclusive of file system fragmentation and may occupy multiple megabytes depending ~~on the minimum size of a file. The implementors assume (and are working to ensure) that some sort of compressed virtual file system~~ will be available as core functionality in the 8.5 final release. With zlib compression, the message catalogs and time zone data total less than half a megabyte. It is worth noting that a distribution that must run in the absolute minimum space may omit both message catalogs and time zone data; if this is done, named time zones ~~(e.g., :America/New~~_York~~) will not be available on systems such~~ as Windows that lack 'zoneinfo', and will suffer from Y2038 bugs on systems such as Solaris and Linux that have 'zoneinfo'. Without the message catalogs, the only supported locale will be the root locale (and on Windows, the 'system' locale). This combination provides functionality comparable to the [~~[clock]~~] command prior to this TIP. The Tcl code that implements [~~[clock]~~] is less than eighty kilobytes with comments and blank lines removed; this amount of overhead is thought to be negligible. ~~~ Bugs~~ The reference implementation does not attempt any calendars not based on the hybrid Julian/Gregorian calendar. This implementation is adequate for the Western countries and for the Japanese civil calendar, but does not address the Hijri, Hebraic, Thai, Chinese or ~~Korean calendars. (No Tcl user has requested these, to the best of the knowledge of the author of this TIP.)~~ The Gregorian change date is not supplied in most locales. Localisation in most locales was done by an American who is probably excessively ignorant in such matters. This TIP makes no effort to be compliant with RFC 2550 ~~[http://www.faqs.org/rfcs/rfc2550.html].~~ ~~~ Copyright~~ Copyright 2004, by Kevin B. Kenny. Redistribution permitted under the terms of the Open Publication License ~~[http://www.opencontent.org/openpub/].~~ ~~~ Acknowledgments~~ The author of this TIP wishes to thank all the Tcl'ers who have taken the time to read and comment on it, most notably Joe English, Donal K. Fellows, Jeff Hobbs, Arjen Markus, Reinhard Max, Christopher Nelson, Donald G. Porter, Pascal Scheffers, and Peter da Silva.	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| < > \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| >	504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957	%Y: On output, produces the four-digit calendar year. On input, accepts four digits and may be used to determine calendar date. Note that %Y does not yield a year appropriate for use with the ISO8601 week number %V; programs should use %G for that purpose. %z: On output, produces the current time zone, expressed in hours and minutes east \(\+hhmm\) or west \(-hhmm\) of Greenwich. On input, accepts a time zone specifier \(see _TIME ZONES_ below\) that will be used to determine the time zone. %Z: On output, produces the current time zone's name, possibly translated to the given locale. On input, accepts a time zone specifier \(see _TIME ZONES_ below\) that will be used to determine the time zone. _This option should, in general, be used on input only when parsing RFC822 dates._ Other uses are fraught with ambiguity; for instance, the string _BST_ may represent _British Summer Time_ or _Brazilian Standard Time_. It is recommended that date/time strings for use by computers use numeric time zones instead. %%: On output, produces a literal '%' charater. On input, matches a literal '%' character. %\+: Synonymous with "%a %b %e %H:%M:%S %Z %Y". ## Time Zones There are several ways that a time zone may be specified for use with [clock scan], [clock format] and [clock add]. In order of preference: * The time zone may appear in the input string matched by a %z or %Z format group in [clock scan]. These format groups match time zones in the forms \+hhmm, \+hhmmss, -hhmm, -hhmmss, and alphanumeric strings. The numeric representations are self explanatory; an alphanumeric string must be the one of: gmt ut utc bst wet wat at nft nst ndt ast adt est edt cst cdt mst mdt pst pdt yst ydt hst hdt cat ahst nt idlw cet cest met mewt mest swt sst eet eest bt it zp4 zp5 ist zp6 wast wadt jt cct jst cast cadt east eadt gst nzt nzst nzdt idle > or a single letter other than J. Generally speaking, numeric time zones should be preferred for communication among computers; the alphanumeric time zones are provided primarily for the parsing of legacy RFC822 time stamps. * The time zone may appear in the -timezone argument to the [clock] command, or may be implied by the presence of -gmt 1. It is an error to use -timezone and -gmt in the same call. The -gmt 1 option may be regarded as an obsolete synonym of -timezone :UTC. * The time zone may appear in the environment variable, _TCL\_TZ_. * The time zone may appear in the environment variable, _TZ_. * Failing all of these, on Windows systems, the time zone will be obtained from the Registry. * As a last resort, the time zone is set to ':localtime'. Once the time zone is obtained by one of these means, it is interpreted as follows: ":localtime": This specifier requests that the C library functions _localtime\(\)_ and _mktime\(\)_ be used whenever converting times between local and Greenwich. It is generally used as a last resort if the time zone can be determined in no other way. "\+hhmm", "\+hhmmss", "-hhmm", "-hhmmss": These specifiers give the time zone explicitly in terms of hours, minutes and seconds east \(\+\) or west \(-\) of Greenwich. ":filename": The given file name is interpreted as a path name relative to [info library]/tzdata, and the specified file is loaded as a Tcl script. The script is expected to set the _:filename_ element in the _tzdata_ array to a list of transitions. Each transition is a four-element list comprising: > \* the time at which the transition takes place, expressed in seconds from the Posix Epoch \(1 January 1970, 00:00 UTC\) > \* the offset \(in seconds east of Greenwich\) to apply. > \* an indicator \(0=Standard Time, 1=Daylight Saving Time\) > \* the name to use when displaying the given time zone in the root locale. > The first transition is expected to take place at time -9223372036854775808, the smallest value of a wide integer. Any string recognizable as a Posix time zone specifier: A time zone may be specified in Posix syntax \(see <http://www.opengroup.org/onlinepubs/007904975/basedefs/xbd_chap08.html> \), for example _EST5EDT_ or _EST\+05:00EDT\+04:00,M4.1.0/01:00,M10.5.0/02:00_. Any other string is processed by prefixing a colon and attempting to load the given file, as shown above. ## Localisation The [clock] command is localised by a set of message catalogs located in [file join [info library] clock msgs] and loaded into the namespace, ::tcl::clock. The possible strings to be translated include: AM: The string that identifies _ante meridiem_ times when expressing a time of day in the given locale. This string has the value, \{am\} in the root locale. BCE: The string that identifies dates before the Common Era in the given locale. This string has the value, \{B.C.E.\} in the root locale. Those localising this string should be aware that, depending on local culture, a name such as "B.C." \(before Christ\) may be offensive. CE: The string that identifies dates of the Common Era in the given locale. This string has the value, \{C.E.\} in the root locale. Those localising this string should be aware that, depending on local culture, a name such as "A.D." \(Latin, _anno Domini_, "in the year of Our Lord"\) may be offensive. DATE\_FORMAT: The format specifier for calendar dates in the given locale. In the root locale, %m/%d/%Y is used for compatibility with earlier versions of the [clock] command, even though %Y-%m-%d would probably be preferable. DATE\_TIME\_FORMAT: The format specifier for combined date and time in the given locale. In the root locale, \{%a %b %e %H:%M:%S %Y\} is used for compatibility with earlier versions of the [clock] command, even though %Y-%m-%dT%H:%M:%S would be preferable. DAYS\_OF\_WEEK\_ABBREV: Abbreviations of the days of the week in the given locale. In the root locale, this string has the value, \{Sun Mon Tue Wed Thu Fri Sat\}. In any locale, this string is expected to represent a valid Tcl list. DAYS\_OF\_WEEK\_FULL: Full names of the days of the week in the given locale. In the root locale, this string has the value, \{Sunday Monday Tuesday Wednesday Thursday Friday Saturday\}. In any locale, this string is expected to represent a valid Tcl list. GREGORIAN\_CHANGE\_DATE: The date on which the change from the Julian to the Gregorian calendar takes place, expressed as a Julian Day Number. In the root locale, this string has the value, \{2299161\}, corresponding to 15 October 1582 New Style. In the 'en' locale, this value is \{2361222\}, 14 September 1752 New Style. LOCALE\_DATE\_FORMAT: The format to use when formatting dates in the locale's alternative calendar. In the root locale, LOCALE\_DATE\_FORMAT is _%x_, which causes formatting without alternative numerals. LOCALE\_DATE\_TIME\_FORMAT: The format to use when formatting date/time strings in the locale's alternative calendar. In the root locale, LOCALE\_DATE\_TIME\_FORMAT is _%Ex %EX_, which causes concatenation of the locale's format for date, a space character, and the locale's format for time. LOCALE\_ERAS: In a locale where a calendar with multiple eras is in use, gives a list of triples. The first element of each triple is the time \(in seconds from the Posix epoch of 1 January 1970, 00:00 UTC\) at which the era begins; the second is the name of the era, and the third is a constant offset to be subtracted from the Gregorian year to give the year of the era. In any locale, this string is expected to represent a valid Tcl list. LOCALE\_NUMERALS: In a locale where alternative numerals may be used, gives a list containing the numerals that represent the numbers from zero to ninety-nine. Note that these numerals are the ones typically used on calendars, not the ones that represent currencies or quantities. For instance, in a Han locale, the number twenty-one is represented by \\u5eff\\u4e00, not by \\u4e8c\\u5341\\u4e00. In any locale, this string is expected to represent a valid Tcl list. LOCALE\_TIME\_FORMAT: The time format to use when formatting a time of day using a locale's alternative numerals. In the root locale, this string is _%X_, which causes formatting without alternative numerals. LOCALE\_YEAR\_FORMAT: The time format to use when formatting a year in the locale's alternative calendar. In the root locale, this string is %Y. MONTHS\_ABBREV: Abbreviated names of the months in the given locale. In the root locale, consists of three-letter abbreviations for the English months: Jan-Dec. In any locale, this string is expected to represent a valid Tcl list. MONTHS\_FULL: Full names of the months in the given locale. In the root locale, consists of the names of the English months in order from 'January' to 'December'. In any locale, this string is expected to represent a valid Tcl list. PM: The string that identifies _post meridiem_ times when expressing a time of day in the given locale. This string has the value, \{pm\} in the root locale. TIME\_FORMAT: String that specifies the default time format in the given locale. In the root locale, this string is \{%H:%M:%S\} TIME\_FORMAT\_12: String that formats time on a 12-hour clock in the given locale. In the root locale, this string is \{%I:%M:%S %p\}. TIME\_FORMAT\_24: String that formats time on a 24-hour clock in the given locale. In the root locale, this string is \{%H:%M\}. There is a defined order for substitution of locale strings, which constrains the format groups that can appear in the _\_FORMAT_ strings. Specifically: * DATE\_TIME\_FORMAT and LOCALE\_DATE\_TIME\_FORMAT may contain any format groups other than _%c_ and _%Ec_. * LOCALE\_DATE\_FORMAT and LOCALE\_TIME\_FORMAT may not contain _%c_, _%Ec_, _%Ex_, or _%EX_. * DATE\_FORMAT and TIME\_FORMAT may not contain _%c_, _%Ec_, _%x_, _%Ex_, _%X_, or _%EX_. * TIME\_FORMAT\_12 and TIME\_FORMAT\_24 may not contain _%c_, _%Ec_, _%r_, _%R_, _%T_, _%x_, _%Ex_, _%X_, or _%EX_. * LOCALE\_YEAR\_FORMAT may not contain _%c_, _%Ec_, _%r_, _%R_, _%T_, _%x_, _%Ex_, _%X_, _%EX_, or _%Ey_. _Example._ The following file is "ja.msg", which localises the [clock] command to a Japanese locale. namespace eval ::tcl::clock { ::msgcat::mcset ja DAYS_OF_WEEK_ABBREV [list \ "\u65e5"\ "\u6708"\ "\u706b"\ "\u6c34"\ "\u6728"\ "\u91d1"\ "\u571f"] ::msgcat::mcset ja DAYS_OF_WEEK_FULL [list \ "\u65e5\u66dc\u65e5"\ "\u6708\u66dc\u65e5"\ "\u706b\u66dc\u65e5"\ "\u6c34\u66dc\u65e5"\ "\u6728\u66dc\u65e5"\ "\u91d1\u66dc\u65e5"\ "\u571f\u66dc\u65e5"] ::msgcat::mcset ja MONTHS_ABBREV [list \ "1"\ "2"\ "3"\ "4"\ "5"\ "6"\ "7"\ "8"\ "9"\ "10"\ "11"\ "12"\ ""] ::msgcat::mcset ja MONTHS_FULL [list \ "1\u6708"\ "2\u6708"\ "3\u6708"\ "4\u6708"\ "5\u6708"\ "6\u6708"\ "7\u6708"\ "8\u6708"\ "9\u6708"\ "10\u6708"\ "11\u6708"\ "12\u6708"\ ""] ::msgcat::mcset ja BCE "\u7d00\u5143\u524d" ::msgcat::mcset ja CE "\u897f\u66a6" ::msgcat::mcset ja AM "\u5348\u524d" ::msgcat::mcset ja PM "\u5348\u5f8c" ::msgcat::mcset ja DATE_FORMAT "%Y/%m/%d" ::msgcat::mcset ja TIME_FORMAT "%k:%M:%S" ::msgcat::mcset ja DATE_TIME_FORMAT "%Y/%m/%d %k:%M:%S %z" ::msgcat::mcset ja LOCALE_NUMERALS "\u3007 \u4e00 \u4e8c \u4e09 \u56db \u4e94 \u516d \u4e03 \u516b \u4e5d \u5341 \u5341\u4e00 \u5341\u4e8c \u5341\u4e09 \u5341\u56db \u5341\u4e94 \u5341\u516d \u5341\u4e03 \u5341\u516b \u5341\u4e5d \u4e8c\u5341 \u5eff\u4e00 \u5eff\u4e8c \u5eff\u4e09 \u5eff\u56db \u5eff\u4e94 \u5eff\u516d \u5eff\u4e03 \u5eff\u516b \u5eff\u4e5d \u4e09\u5341 \u5345\u4e00 \u5345\u4e8c \u5345\u4e09 \u5345\u56db \u5345\u4e94 \u5345\u516d \u5345\u4e03 \u5345\u516b \u5345\u4e5d \u56db\u5341 \u56db\u5341\u4e00 \u56db\u5341\u4e8c \u56db\u5341\u4e09 \u56db\u5341\u56db \u56db\u5341\u4e94 \u56db\u5341\u516d \u56db\u5341\u4e03 \u56db\u5341\u516b \u56db\u5341\u4e5d \u4e94\u5341 \u4e94\u5341\u4e00 \u4e94\u5341\u4e8c \u4e94\u5341\u4e09 \u4e94\u5341\u56db \u4e94\u5341\u4e94 \u4e94\u5341\u516d \u4e94\u5341\u4e03 \u4e94\u5341\u516b \u4e94\u5341\u4e5d \u516d\u5341 \u516d\u5341\u4e00 \u516d\u5341\u4e8c \u516d\u5341\u4e09 \u516d\u5341\u56db \u516d\u5341\u4e94 \u516d\u5341\u516d \u516d\u5341\u4e03 \u516d\u5341\u516b \u516d\u5341\u4e5d \u4e03\u5341 \u4e03\u5341\u4e00 \u4e03\u5341\u4e8c \u4e03\u5341\u4e09 \u4e03\u5341\u56db \u4e03\u5341\u4e94 \u4e03\u5341\u516d \u4e03\u5341\u4e03 \u4e03\u5341\u516b \u4e03\u5341\u4e5d \u516b\u5341 \u516b\u5341\u4e00 \u516b\u5341\u4e8c \u516b\u5341\u4e09 \u516b\u5341\u56db \u516b\u5341\u4e94 \u516b\u5341\u516d \u516b\u5341\u4e03 \u516b\u5341\u516b \u516b\u5341\u4e5d \u4e5d\u5341 \u4e5d\u5341\u4e00 \u4e5d\u5341\u4e8c \u4e5d\u5341\u4e09 \u4e5d\u5341\u56db \u4e5d\u5341\u4e94 \u4e5d\u5341\u516d \u4e5d\u5341\u4e03 \u4e5d\u5341\u516b \u4e5d\u5341\u4e5d" ::msgcat::mcset ja LOCALE_DATE_FORMAT "%EY\u5e74%B%Od\u65e5" ::msgcat::mcset ja LOCALE_TIME_FORMAT "%OH\u6642%OM\u5206%OS\u79d2" ::msgcat::mcset ja LOCALE_DATE_TIME_FORMAT \ "%A %EY\u5e74%B%Od\u65e5%OH\u6642%OM\u5206%OS\u79d2 %z" ::msgcat::mcset ja LOCALE_ERAS " {-9223372036854775808 \u897f\u66a6 0} {-3060979200 \u660e\u6cbb 1867} {-1812153600 \u5927\u6b63 1911} {-1357603200 \u662d\u548c 1925} {568512000 \u5e73\u6210 1987}" } In addition to the standard locales, two special locales may appear on the -locale parameter; current, which designates the result of evaluating [mclocale], and system, which designates the current "system" locale, which is determined by \(in order of preference\): * the date/time format settings on the Windows control panel * the environment variable LC\_TIME * the current locale from [mclocale]. ## Build System Several tools are provided for the use of maintainers: loadICU.tcl: Given a distribution of IBM's _icu4c_ <http://oss.software.ibm.com/icu/index.html> , this program analyzes the source code of the message catalogs and extracts appropriate Tcl-based messages for the date and time formats in the supported locales. loadtzif.tcl: Given a time zone information file used by the Olson version of 'tzset' \(for a description, see the latest 'tzcode' file in [ftp://elsie.nci.nih.gov/pub/]\), creates the corresponding Tcl 'tzdata' file. makeTestCases.tcl: Makes several thousand auto-generated test cases to exercise the time conversion algorithms. tclZIC.tcl: Given the source code for the Olson time zone descriptions \(obtainable as the latest 'tzdata' file in [ftp://elsie.nci.nih.gov/pub/]\), creates the full set of Tcl 'tzdata' files. Since these tools depend on third party source, they will not be included in the usual build steps; instead, maintainers will be expected to run them whenever changing files on which they depend. It will be a good practice to update the ICU and Olson files just before cutting a release. # Reference Implementation The implementation of a refactored [clock] command is a work in progress, and interested developers are urged to contact the TIP author if they want to help with implementation, documentation, or testing. The code is available in the same SourceForge repository as the Tcl core, and Tcl maintainers can obtain it with cvs -d:ext:[email protected]:/cvsroot/tcl co newclock # Notes on the cost of implementation Since it is well known that Tcl code is typically 30-50 times slower than the equivalent C, it is to be expected that [clock scan], [clock format], and [clock add] will be in that performance range. [clock seconds] and [clock clicks] will still be C code and are not expected to suffer a measurable change in performance. \(If they do, the implementors plan to address the issue.\) The cost of the time zone data files and the message catalogs is not trivial; they occupy about 1.6 megabytes exclusive of file system fragmentation and may occupy multiple megabytes depending on the minimum size of a file. The implementors assume \(and are working to ensure\) that some sort of compressed virtual file system will be available as core functionality in the 8.5 final release. With zlib compression, the message catalogs and time zone data total less than half a megabyte. It is worth noting that a distribution that must run in the absolute minimum space may omit both message catalogs and time zone data; if this is done, named time zones \(e.g., :America/New\_York\) will not be available on systems such as Windows that lack 'zoneinfo', and will suffer from Y2038 bugs on systems such as Solaris and Linux that have 'zoneinfo'. Without the message catalogs, the only supported locale will be the root locale \(and on Windows, the 'system' locale\). This combination provides functionality comparable to the [clock] command prior to this TIP. The Tcl code that implements [clock] is less than eighty kilobytes with comments and blank lines removed; this amount of overhead is thought to be negligible. # Bugs The reference implementation does not attempt any calendars not based on the hybrid Julian/Gregorian calendar. This implementation is adequate for the Western countries and for the Japanese civil calendar, but does not address the Hijri, Hebraic, Thai, Chinese or Korean calendars. \(No Tcl user has requested these, to the best of the knowledge of the author of this TIP.\) The Gregorian change date is not supplied in most locales. Localisation in most locales was done by an American who is probably excessively ignorant in such matters. This TIP makes no effort to be compliant with RFC 2550 <http://www.faqs.org/rfcs/rfc2550.html> . # Copyright Copyright 2004, by Kevin B. Kenny. Redistribution permitted under the terms of the Open Publication License <http://www.opencontent.org/openpub/> . # Acknowledgments The author of this TIP wishes to thank all the Tcl'ers who have taken the time to read and comment on it, most notably Joe English, Donal K. Fellows, Jeff Hobbs, Arjen Markus, Reinhard Max, Christopher Nelson, Donald G. Porter, Pascal Scheffers, and Peter da Silva.

~~1 2 3 4 5 6 7 8 9 10 11~~ 12 13 14 15 16 ~~17 18 19~~ 20 21 22 23 24 25 26 27 ~~28 29~~ 30 31 32 33 34 35 36	~~TIP: 219~~ T~~itle~~: Tcl Channel Reflection API ~~Version: $Revision: 1.27 $~~ Author: Andreas Kupries <[email protected]> Author: Andreas Kupries <[email protected]> State: Final Type: Project Vote: Done Created: 09-Sep-2004 Post-History: Tcl-Version: 8.5 ~~~ Abstract~~ This document describes an API which reflects the Channel Driver API of the core I/O system up into the Tcl level, for the implementation of channel types in Tcl. It is built on top of [~~208]~~ ('Add a chan command') and also an independent companion to [~~230]~~ ('Tcl Channel Transformation Reflection API') and [~~228]~~ ('Tcl Filesystem Reflection API'). As the later TIPs bring the ability of writing channel transformations and filesystems in Tcl itself into the core so this TIP provides the facilities for the implementation of new ~~channel types in Tcl. This document specifies version ~~''1''~~ of the channel~~ reflection API. ~~~ Motivation / Rationale~~ The purpose of this and the other reflection TIPs is to provide all the ~~facilities required for the creation and usage of wrapped files (= virtual filesystems attached to executables and binary libraries) within the core.~~ While it is possible to implement and place all the proposed reflectivity in separate and external packages, this however means that the core itself cannot make use of wrapping technology and virtual filesystems to encapsulate and attach its own data and library files to itself. This is something which is desirable as it can make the deployment and embedding of the core easier, due to having less files to deal with, and a higher degree of self-containment.	< \| < \| \| \| \| \| \| \| \| > \| \| \| \| \| \| \| \|	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35	# TIP 219: Tcl Channel Reflection API Author: Andreas Kupries <[email protected]> Author: Andreas Kupries <[email protected]> State: Final Type: Project Vote: Done Created: 09-Sep-2004 Post-History: Tcl-Version: 8.5 ----- # Abstract This document describes an API which reflects the Channel Driver API of the core I/O system up into the Tcl level, for the implementation of channel types in Tcl. It is built on top of [[208]](208.md) \('Add a chan command'\) and also an independent companion to [[230]](230.md) \('Tcl Channel Transformation Reflection API'\) and [[228]](228.md) \('Tcl Filesystem Reflection API'\). As the later TIPs bring the ability of writing channel transformations and filesystems in Tcl itself into the core so this TIP provides the facilities for the implementation of new channel types in Tcl. This document specifies version _1_ of the channel reflection API. # Motivation / Rationale The purpose of this and the other reflection TIPs is to provide all the facilities required for the creation and usage of wrapped files \(= virtual filesystems attached to executables and binary libraries\) within the core. While it is possible to implement and place all the proposed reflectivity in separate and external packages, this however means that the core itself cannot make use of wrapping technology and virtual filesystems to encapsulate and attach its own data and library files to itself. This is something which is desirable as it can make the deployment and embedding of the core easier, due to having less files to deal with, and a higher degree of self-containment.
︙			︙
44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 ~~74 75~~ 76 77 78 79 80 ~~81 82 83~~ 84 85 86 87 ~~88 89~~ 90 91 92 93 ~~94 95 96 97 98~~ 99 ~~100 101~~ 102 103 104 ~~105~~ 106 107 108 109 110 ~~111~~ 112 ~~113~~ 114 115 ~~116~~ 117 118 119 ~~120 121~~ 122 123 ~~124 125~~ 126 ~~127~~ 128 ~~129 130~~ 131 132 ~~133~~ 134 135 136 137 138 ~~139 140 141~~ 142 143 144 ~~145~~ 146 ~~147~~ 148 ~~149~~ 150 151 ~~152 153~~ 154 155 ~~156~~ 157 ~~158~~ 159 160 161 162 163 ~~164~~ 165 ~~166~~ 167 168 169 170 ~~171~~ 172 173 ~~174~~ 175 ~~176 177~~ 178 179 180 181 182 ~~183~~ 184 ~~185 186~~ 187 ~~188~~ 189 190 ~~191~~ 192 ~~193 194~~ 195 ~~196~~ 197 ~~198 199~~ 200 201 202 ~~203~~ 204 ~~205 206~~ 207 208 ~~209 210~~ 211 ~~212~~ 213 214 ~~215~~ 216 ~~217~~ 218 ~~219 220 221~~ 222 223 224 225 ~~226~~ 227 228 229 230 231 232 ~~233 234~~ 235 ~~236~~ 237 238 ~~239~~ 240 ~~241~~ 242 ~~243~~ 244 245 246 ~~247~~ 248 ~~249~~ 250 ~~251~~ 252 ~~253~~ 254 ~~255~~ 256 ~~257~~ 258 259 260 ~~261~~ 262 263 ~~264~~ 265 266 267 ~~268 269~~ 270 ~~271~~ 272 273 ~~274~~ 275 276 277 ~~278 279~~ 280 ~~281~~ 282 ~~283~~ 284 ~~285~~ 286 ~~287~~ 288 ~~289 290~~ 291 292 ~~293~~ 294 295 ~~296~~ 297 ~~298~~ 299 ~~300 301~~ 302 303 ~~304~~ 305 ~~306 307~~ 308 309 ~~310~~ 311 ~~312~~ 313 ~~314 315~~ 316 317 ~~318~~ 319 320 ~~321 322~~ 323 324 ~~325~~ 326 327 ~~328~~ 329 ~~330~~ 331 ~~332 333 334~~ 335 336 337 338 339 ~~340~~ 341 342 ~~343~~ 344 ~~345~~ 346 347 ~~348~~ 349 ~~350 351~~ 352 353 ~~354~~ 355 ~~356 357~~ 358 ~~359~~ 360 361 ~~362~~ 363 364 365 ~~366~~ 367 368 ~~369~~ 370 371 ~~372~~ 373 374 ~~375~~ 376 377 ~~378~~ 379 380 ~~381~~ 382 383 ~~384~~ 385 386 387 388 389 390 391 392 ~~393~~ 394 ~~395~~ 396 397 398 399 400 401 ~~402 403 404 405~~ 406 ~~407~~ 408 ~~409~~ 410 411 412 413 414 ~~415~~ 416 417 ~~418~~ 419 ~~420 421~~ 422 423 424 425 426 427 428 429 430 431 ~~432~~ 433 434 435 436 437 438 439 440 ~~441~~ 442 443 444 ~~445~~ 446 447 448 ~~449 450~~ 451 452 ~~453 454 455~~ 456 ~~457~~ 458 459 ~~460~~ 461 ~~462~~ 463 ~~464~~ 465 ~~466~~ 467 ~~468~~ 469 ~~470~~ 471 ~~472~~ 473 ~~474~~ 475 ~~476~~ 477 ~~478~~ 479 480 ~~481~~ 482 ~~483~~ 484 485 ~~486~~ 487 ~~488~~ 489 490 ~~491~~ 492 ~~493~~ 494 ~~495~~ 496 ~~497~~ 498 499 ~~500~~ 501 ~~502~~ 503 504 505 506 507 508 509 ~~510~~ 511 ~~512~~ 513 ~~514~~ 515 ~~516~~ 517 ~~518~~ 519 ~~520~~ 521 ~~522~~ 523 ~~524~~ 525 ~~526~~ 527 ~~528~~ 529 ~~530~~ 531 ~~532~~ 533 ~~534~~ 535 536 537 538 ~~539~~ 540 ~~541~~ 542 ~~543~~ 544 545 546 ~~547~~ 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 ~~567~~ 568 569 570 571 572 573 574 575 576 ~~577~~ 578 ~~579~~ 580 ~~581~~ 582 ~~583~~ 584 ~~585~~ 586 587 588 589 590 591 592	give users of the core the freedom to experiment with their own ideas, instead of constraining them to what we managed to envision. Another use for reflected channels was found when creating the reference implementation: As helper for testing the generic I/O system of Tcl, by creating channels which forcibly return errors, bogus data, and the like. ~~~ Specification~~ ~~~~ Introduction~~ This specification has to address two questions to make the reflection work. * How are the driver functions reflected into the Tcl level? * How are file events generated in the Tcl level communicated back to the C level? This includes routing to the correct channel. ~~~~ C Level API~~ ~~Four functions are added to the public C API. See section "''Error Handling''"~~ for their detailed specification. ~~~~ Tcl Level API~~ The Tcl Level API consists of two new subcommands added to the ensemble ~~command ~~'''~~chan~~'''~~ specified by [~~208]~~. The new subcommands are:~~ * ~~'''~~chan create~~''' ''~~mode cmdprefix'' ~~> This subcommand creates a new script level channel using the command prefix ''cmdprefix'' as its handler. The ''cmdprefix'' has to be a list. The API~~ this handler has to provide is specified below, in the section "Command Handler API". The handle of the new channel is returned as the result of ~~the command, and the channel is open. Use the regular ~~'''~~close~~'''~~ command~~ to remove the channel. > The argument ''mode'' specifies if the channel is opened for reading, writing, or both. It is a list containing any of the strings ~~'''~~read~~'''~~ or ~~'''~~write~~'''~~. The list has to have at least one element, as a channel you can neither write to nor read from makes no sense. The handler command for the new channel has to support the chosen mode. An error is thrown if that is not the case. ~~> We have chosen to use ''late binding'' of the handler command. See the section "''Early versus Late Binding of the Handler Command''" for more~~ detailed explanations. * ~~'''~~chan postevent~~''' ''~~channel eventspec'' > This subcommand is for use by command handlers, it notifies the channel represented by ''channel'' that the event(s) listed in the ''eventspec'' have occurred. The argument ''eventspec'' is a list containing any of ~~'''~~read~~'''~~ and ~~'''~~write~~'''~~. At least one element is required (It does not make sense to invoke the command if there are no events to post). ~~> Note that this subcommand can be used only on channel handles which were created/opened with the subcommand ~~'''~~create~~'''~~. Application to channels~~ like files, sockets, etc. is not possible and will cause the generation of an error. ~~> As only the Tcl level of a channel, i.e. its command handler, should post~~ events to it we also restrict the usage of this command to the interpreter the handler command is in. In other words, posting events to a reflected channel from a different interpreter than its implementation is in is not allowed. ~~> Another restriction is that it is not possible to post events the I/O core~~ has not registered interest in. Trying to do so will cause the method to ~~throw an error. See the method ~~'''~~watch~~'''~~ in section "Command Handler API"~~ as well. ~~~~ Command Handler API~~ The Tcl-level handler command for a reflected channel is an ensemble that has to support the following subcommands, as listed below. Note that the term ~~''ensemble'' is used to generically describe all command (prefixes) which are able to process subcommands. This TIP is ~~''not''~~ tied to the recently~~ introduced 'namespace ensemble's. ~~Of the available methods the handler ~~'''~~has to~~'''~~ support ~~'''~~initialize~~'''~~, ~~'''~~finalize~~'''~~, and ~~'''~~watch~~'''~~, always. The other methods are optional.~~ * ''handlerCmd~~'' '''~~initialize~~''' ''~~channel mode'' ~~> This is the first call the command handler will receive for the given new ''channel''. It is his responsibility to set up any internal data~~ structures it needs to keep track of the channel and its state. ~~> The return value of the method has to be a list containing the names of all~~ methods which are supported by this handler. This implicitly tells the C level the version of the API used by the command handler making a separate version number redundant. Hence our decision to leave such a number out of the API. Any changes to the API will be either the elimination of methods, or the introduction of new ones. An existing method cannot change its signature (arguments, and result), a new method has to be introduced for this. All of this implies that this method, ~~'''~~initialize~~'''~~, ~~'''~~is unchangeable~~'''~~ after the TIP has been committed, as it is the entry point through which the C level will determine the API version before it knows anything else. ~~> Any error thrown by the method will abort the creation of the channel and~~ no channel will be created. The thrown error will appear as error thrown by ~~~~'''~~chan create~~'''~~.~~ ~~> Any exception beyond ''error'', like ''break'', etc. is treated as and~~ converted to an error. ~~> ~~'''~~Important~~'''~~ - If the creation of the channel was aborted due to failures in ~~'''~~initialize~~'''~~ then the method ~~'''~~finalize~~'''~~ will ~~''not''~~ be~~ called. ~~> This method has no equivalent at the C level.~~ ~~> It was considered to return only the list of optional methods supported by~~ the handler. The chosen method however should make the code in the C layer more regular. Another advantage of this is that it allows the C level to better check if the API it expects is matching the API provided by the handler. ~~> The argument ''mode'' tells the handler if the channel was opened for~~ reading, writing, or both. It is a list containing any of the strings ~~~~'''~~read~~'''~~ or ~~'''~~write~~'''~~. The C-level doing the call will never generate~~ abbreviations of these strings. The list will always contain at least one element, as a channel you can neither write to nor read from makes no sense. ~~> The method has to throw an error if the chosen mode is not supported by the~~ handler command. * ''handlerCmd~~'' '''~~finalize~~''' ''~~channel'' ~~> The method is called when the channel was ~~'''~~close~~'''~~d, and is the last call a handler can receive for the given ''channel''. This happens just~~ before the destruction of the C level data structures. Still, the command handler must not access the channel anymore in no way. It is now his responsibility to clean up any internal resources it allocated to this channel. ~~> The return value of the method is ignored.~~ ~~> If the method throws an error the command which caused its invocation (usually ~~'''~~close~~'''~~) will appear to have thrown this error.~~ ~~> Any exception beyond ''error'', like ''break'', etc. is treated as and~~ converted to an error. ~~> The equivalent C-level function is ~~''Tcl~~_DriverCloseProc''.~~ ~~> This method is not invoked if the creation of the channel was aborted during ~~'''~~initialize~~'''~~.~~ * ''handlerCmd~~'' '''~~read~~''' ''~~channel count'' ~~> This method is ''optional''. It is called when the user requests data from a channel. ''count'' specifies how many ''bytes'' have been requested. If~~ the method is not supported then it is not possible to read from the channel handled by the command. ~~> The return value of the method is taken as the requested data. If the~~ returned data contains more bytes than requested an error will be signaled ~~and later thrown by the command which performed the read (usually ~~'''~~gets~~'''~~ or ~~'''~~read~~'''~~). Returning less bytes than requested is~~ acceptable however. ~~> If the method throws an error the command which caused its invocation (usually ~~'''~~gets~~'''~~, or ~~'''~~read~~'''~~) will appear to have thrown this error.~~ ~~> Any exception beyond ''error'', like ''break'', etc. is treated as and~~ converted to an error. ~~> The equivalent C-level function is ~~''Tcl~~_DriverInputProc''.~~ * ''handlerCmd~~'' '''~~write~~''' ''~~channel data'' > This method is ''optional''. It is called when the user writes data to the channel. Note that the ''data'' are bytes, not characters (The underlying Tcl_ObjType is ''ByteArray''). Any type of transformation (EOL, encoding) configured for the channel has already been applied at this point. If the method is not supported then it is not possible to write to the channel handled by the command. ~~> The return value of the method is taken as the number of bytes written by~~ the channel. Anything non-numeric will cause an error to be signaled and later thrown by the command which performed the write. A negative value implies that the write failed. Returning a value greater than the number of bytes given to the handler, or zero, is forbidden and will cause the C level to throw errors. ~~> If the method throws an error the command which caused its invocation (usually ~~'''~~puts~~'''~~) will appear to have thrown this error.~~ ~~> Any exception beyond ''error'', like ''break'', etc. is treated as and~~ converted to an error. ~~> The equivalent C-level function is ~~''Tcl~~_DriverOutputProc''.~~ * ''handlerCmd~~'' '''~~seek~~''' ''~~channel offset base'' ~~> This method is ''optional''. It is responsible for the handling of seek and~~ tell requests on the channel. If it is not supported then seeking will not be possible for the channel. ~~~~> ''~~base'' is one of~~ ~~> * '''start~~'''~~ - Seeking is relative to the beginning of the channel.~~ ~~> * '''current~~'''~~ - Seeking is relative to the current seek position.~~ ~~> * '''end~~'''~~ - Seeking is relative to the end of the channel.~~ ~~> The base argument of the builtin ~~'''~~seek~~'''~~ command takes the same names.~~ ~~> The ''offset'' is an integer number specifying the amount of ''bytes'' to~~ seek forward or backward. A positive number will seek forward, and a negative number will seek backward. ~~> A channel may provide only limited seeking. For example sockets can seek~~ forward, but not backward. ~~> The return value of the method is taken as the ~~(new~~) location of the~~ channel, counted from the start. This has to be an integer number greater than or equal to zero. ~~> If the method throws an error the command which caused its invocation (usually ~~'''~~seek~~'''~~) will appear to have thrown this error.~~ ~~> Any exception beyond ''error'', like ''break'', etc. is treated as and~~ converted to an error. ~~> The offset/base combination of 0/"current" signals a ~~'''~~tell~~'''~~ request,~~ i.e. seek nothing relative to the current location, making the new location identical to the current one, which is then returned. ~~> The equivalent C-level functions are ~~''Tcl~~_DriverSeekProc'', and ~~''Tcl~~_DriverWideSeekProc'' (where possible).~~ * ''handlerCmd~~'' '''~~configure~~''' ''~~channel option value'' ~~> This method is ''optional''. It is for writing the type specific options.~~ ~~> Per call one option has to be written.~~ ~~> The return value of the method is ignored.~~ ~~> If the method throws an error the command which performed the ~~(re~~)configuration or query (usually ~~'''~~fconfigure~~'''~~) will appear to have~~ thrown this error. ~~> Any exception beyond ''error'', like ''break'', etc. is treated as and~~ converted to an error. ~~> The equivalent C-level function is ~~''Tcl~~_DriverSetOptionProc''.~~ * ''handlerCmd~~'' '''~~cget~~''' ''~~channel option'' ~~> This method is ''optional''. It is used when reading a single type specific option. If this method is supported then the method ~~'''~~cgetall~~'''~~ has to be~~ supported as well. ~~> The call has to return the value of the specified option.~~ ~~> If the method throws an error the command which performed the ~~(re~~)configuration or query (usually ~~'''~~fconfigure~~'''~~) will appear to have~~ thrown this error. ~~> The equivalent C-level function is ~~''Tcl~~_DriverGetOptionProc''.~~ * ''handlerCmd~~'' '''~~cgetall~~''' ''~~channel'' ~~> This method is ''optional''. It is used for reading all type specific options. If this method is supported then the method ~~'''~~cget~~'''~~ has to be~~ supported as well. ~~> It has to return a list of all options and their values. This list has to~~ have an even number of elements. ~~> If the method throws an error the command which performed the ~~(re~~)configuration or query (usually ~~'''~~fconfigure~~'''~~) will appear to have~~ thrown this error. ~~> Any exception beyond ''error'', like ''break'', etc. is treated as and~~ converted to an error. ~~> The equivalent C-level function is ~~''Tcl~~_DriverGetOptionProc''.~~ * ''handlerCmd~~'' '''~~watch~~''' ''~~channel eventspec'' > This methods notifies the Tcl level that the specified channel is interesting in the events listed in the ''eventspec''. This is a list containing any of ~~'''~~read~~'''~~ and ~~'''~~write~~'''~~. The C-level doing the call will never generate abbreviations of these strings. The empty list is allowed as well and signals that the channel does not wish to be notified of any events. In other words, it has to disable event generation at the Tcl level. ~~> Any return value of the method is ignored. This includes errors thrown by~~ the method, break, continue, and custom return codes. ~~> The equivalent C-level function is ~~''Tcl~~_DriverWatchProc''.~~ ~~> This method interacts with ~~'''~~chan postevent~~'''~~. Trying to post an event~~ not listed in the last call to this method will cause an error. * ''handlerCmd~~'' '''~~blocking~~''' ''~~channel mode'' ~~> This method is ''optional''. It handles changes to the blocking mode of the channel. The ''mode'' is a boolean flag. True means that the channel has to~~ be set to blocking. False means that the channel should be non-blocking. ~~> The return value of the method is ignored.~~ ~~> If the method throws an error the command which caused its invocation (usually ~~'''~~fconfigure~~'''~~) will appear to have thrown this error.~~ ~~> Any exception beyond ''error'', like ''break'', etc. is treated as and~~ converted to an error. ~~> The equivalent C-level function is ~~''Tcl~~_DriverBlockModeProc''.~~ Notes: * The function ~~''Tcl~~_DriverGetHandleProc'' is not supported. There is no equivalent handler method at the Tcl level. * The function ~~''Tcl~~_DriverHandlerProc'' is not supported. There is no equivalent handler method at the Tcl level. The function has no relevance to base channels, which we work with here, only for channel ~~transformations. See [~~230]~~ ('Tcl Channel Transformation Reflection API')~~ for more information on the issue. * The function ~~''Tcl~~_DriverFlushProc'' is not supported. The reason for this: The current generic I/O layer of Tcl does not use this function at all, nowhere. Therefore support at the Tcl level makes no sense either. We can ~~always extend the API defined here (and change its version number) should~~ the function be used at some time in the future. ~~~~ Error handling~~ The current I/O core's ability to handle arbitrary Tcl error messages is very ~~limited. ~~''Tcl~~_DriverGetOptionProc'' and ~~''Tcl~~_DriverSetOptionProc'' are the~~ only driver functions for which this is possible directly. Everywhere else the API is restricted to returning POSIX error codes. This limitation makes the debugging of problems in a channel command handler at least very difficult. As such it is considered not acceptable. It is proposed to solve this problem through the addition of four new functions to Tcl's public stub table. ~~> void ~~'''Tcl~~_SetChannelError~~'''~~(Tcl_Channel ~~''chan''~~, Tcl_Obj* ''msg'')~~ ~~> void ~~'''Tcl~~_SetChannelErrorInterp~~'''~~(Tcl_Interp* ''ip'', Tcl_Obj* ''msg'')~~ > > These functions store error information in a channel or interpreter. Previously stored information will be discarded. They have to be used by channel drivers wishing to pass regular Tcl error information to the generic layer of the I/O core. > > The refCount of ~~''msg''~~ is unchanged when the functions had to rewrite ~~''msg''~~ per the safety precautions explained below, as a properly modified copy of ~~''msg''~~ is stored, and not ~~''msg''~~ itself. Otherwise the refCount of ~~''msg''~~ is incremented by one. ~~> void ~~'''Tcl~~_GetChannelError~~'''~~(Tcl_Channel ~~''chan''~~, Tcl_Obj** ''msg'')~~ ~~> void ~~'''Tcl~~_GetChannelErrorInterp~~'''~~(Tcl_Interp* ''ip'', Tcl_Obj** ''msg'')~~ > > These function retrieve error information stored in a channel or interpreter O, and also resets O to have no information stored in it. They will return NULL if no information was stored to begin with. ~~> > i.e. After an invocation of ~~'''~~Tcl_GetChannelError''' for a~~ channel/interpreter object O, all following invocations will return NULL for that object, until an intervening invocation of ~~~~'''~~Tcl_SetChannelError''' again stored information in O.~~ ~~> > The ~~''msg''~~ argument is not allowed to be NULL. Nor are the ''chan'' and ~~''ip''~~ arguments.~~ > > The refCount of the returned information is not touched. The reference previously held by the channel or interpreter is now held by the caller of the function and it is its responsibility to release that reference when it is done with the object. This solution is not very elegant, but anything else will require an incompatible redefinition of the whole channel driver structure and of the driver functions. ~~It should also be noted that usage of ~~'''~~Tcl_Obj~~'''~~ects for the information~~ storage binds the information to a single thread. I.e. a transfer across thread boundaries is not possible. This however is not required here and thus no limitation. The four functions have been made public as I can imagine that even C level drivers might wish to use this facility to generate more explicit and readable error messages than is provided through POSIX error codes and the errno API. ~~The information talked about in the API specifications above is ~~'''~~not~~'''~~ a~~ plain string, but has to be a list of uneven length. The last element will be interpreted as the actual error message in question, and the preceding elements are considered as option/value pairs containing additional ~~information about the error, like the ''errorCode'', etc. I.e. they are an~~ extensible dictionary containing the details of the error beyond the basic message. ~~As a ~~'''~~safety precaution~~'''~~ any ''-level'' specification submitted by the driver and a non-zero value will be rewritten to a value of ~~''0''~~ to prevent~~ the driver from being able to force the user application into the execution of arbitrary multi-level returns, i.e. from arbitrarily changing the control-flow ~~of the application itself. Analogously any ''-code'' specification with a non-zero value which is not ''error'' is rewritten to value ~~''1''~~ (i.e. ''error'').~~ ~~Below a list of driver functions, and which of the ~~''Tcl~~_SetChannelError'''~~ functions they are allowed to use. ~~'''~~Tcl_DriverCloseProc~~'''~~ ~~> May use ~~''Tcl~~_SetChannelErrorInterp'', and only this function.~~ * ~~'''~~Tcl_DriverInputProc~~'''~~ ~~> May use ~~''Tcl~~_SetChannelError'', and only this function.~~ * ~~'''~~Tcl_DriverOutputProc~~'''~~ ~~> May use ~~''Tcl~~_SetChannelError'', and only this function.~~ * ~~'''~~Tcl_DriverSeekProc~~'''~~, and ~~'''~~Tcl_DriverWideSeekProc~~'''~~ ~~> May use ~~''Tcl~~_SetChannelError'', and only this function.~~ * ~~'''~~Tcl_DriverSetOptionProc~~'''~~ ~~> Has already the ability to pass arbitrary error messages. Must ~~'''~~not~~'''~~~~ use any of the new functions. * ~~'''~~Tcl_DriverGetOptionProc~~'''~~ ~~> Has already the ability to pass arbitrary error messages. Must ~~'''~~not~~'''~~~~ use any of the new functions. * ~~'''~~Tcl_DriverWatchProc~~'''~~ ~~> Must ~~'''~~not~~'''~~ use any of the new functions. Is internally called and has~~ no ability to return any type of error whatsoever. * ~~'''~~Tcl_DriverBlockModeProc~~'''~~ ~~> May use ~~''Tcl~~_SetChannelError'', and only this function.~~ * ~~'''~~Tcl_DriverGetHandleProc~~'''~~ ~~> Must ~~'''~~not~~'''~~ use any of the new functions. It is only a low-level~~ function, and not used by Tcl commands. * ~~'''~~Tcl_DriverHandlerProc~~'''~~ ~~> Must ~~'''~~not~~'''~~ use any of the new functions. Is internally called and has~~ no ability to return any type of error whatsoever. Given the information above the following public functions of the Tcl C API are affected by these changes. I.e. when these functions are called the channel may now contain a stored arbitrary error message requiring processing by the caller. * ~~'''~~Tcl_StackChannel~~'''~~ * ~~'''~~Tcl_Seek~~'''~~ * ~~'''~~Tcl_Tell~~'''~~ * ~~'''~~Tcl_ReadRaw~~'''~~ * ~~'''~~Tcl_Read~~'''~~ * ~~'''~~Tcl_ReadChars~~'''~~ * ~~'''~~Tcl_Gets~~'''~~ * ~~'''~~Tcl_GetsObj~~'''~~ * ~~'''~~Tcl_Flush~~'''~~ * ~~'''~~Tcl_WriteRaw~~'''~~ * ~~'''~~Tcl_WriteObj~~'''~~ * ~~'''~~Tcl_Write~~'''~~ * ~~'''~~Tcl_WriteChars~~'''~~ All other API functions are unchanged. Especially the functions below leave all their error information in the interpreter result. * ~~'''~~Tcl_Close~~'''~~ * ~~'''~~Tcl_UnregisterChannel~~'''~~ * ~~'''~~Tcl_UnstackChannel~~'''~~ A previous revision of this TIP specified only two functions, storing the data only in channels. This however proved to be inadequate. It allows the transfer ~~of messages for most driver functions, but not ''close''. Storing an error~~ message in the channel structure which is destroyed is not helpful. So we need the functions for storing data in interpreters. Conversely, providing only two functions storing the information in an interpreter, is inadequate as well. The circumstances for that to happen are actually very limited, but they can happen. First, most driver functions are not given an interpreter reference when called, and actually do not know which interpreter caused their invocation. The only remedy we have is that the channel structure has to have an interpreter reference to the interpreter of the command handler, for the calls into the Tcl level. This could be used in most circumstances, except when threads are enabled and the channel was transfered out of the thread containing that interpreter. We are not allowed to use this interpreter from the channel thread, and again have no other reference available. So for this the code/message pair has to be stored in a channel as the sole place available. A previous revision of this TIP not only stored an error message, but also a result code in the channel or interpreter, and used it as the return code of the Tcl command which invoked the driver function returning the exception. This feature has been discarded as a possible security hazard. It would allow ~~a malicious Tcl driver to cause ''break'' and ''continue'' exceptions at~~ arbitrary locations in the overall application, controlling its behaviour as it sees fit. I wish to thank Joe English and Vince Darley for their input with regard to the limitations of error propagation in the I/O core and possible ideas for solving it. Joe's discourse on the problems with the use of POSIX error codes in an earlier revision of this TIP made me realize that I should not use them anywhere in the API for reflected channels and rather concentrate on extending the I/O system to properly receive Tcl error messages. And while I rejected ~~the ~~'''~~TclSetPosixError~~'''~~ function Vince proposed I hopefully kept the spirit~~ of that proposal in my solution as well. The main reason against setting an ~~arbitrary ''posix error string'' was that it invented another way of passing~~ error information around, whereas the specification above is based on the ~~existing Tcl_InterpState and attendant functionality.~~ ~~~~ Interaction with Threads and Other Interpreters.~~ ~~A channel created with the ~~'''~~chan create~~'''~~ command knows the interpreter it~~ was created in and executes its handler command only in that interpreter, even if the channel is shared with and/or has been moved into a different interpreter. This is easy to accomplish, by evaluating the handler command only in the context of the original interpreter. The channel also knows the thread it was created in and executes its handler command only in that thread, even if the channel has been moved into a	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591	give users of the core the freedom to experiment with their own ideas, instead of constraining them to what we managed to envision. Another use for reflected channels was found when creating the reference implementation: As helper for testing the generic I/O system of Tcl, by creating channels which forcibly return errors, bogus data, and the like. # Specification ## Introduction This specification has to address two questions to make the reflection work. * How are the driver functions reflected into the Tcl level? * How are file events generated in the Tcl level communicated back to the C level? This includes routing to the correct channel. ## C Level API Four functions are added to the public C API. See section "_Error Handling_" for their detailed specification. ## Tcl Level API The Tcl Level API consists of two new subcommands added to the ensemble command chan specified by [[208]](208.md). The new subcommands are: * chan create _mode cmdprefix_ > This subcommand creates a new script level channel using the command prefix _cmdprefix_ as its handler. The _cmdprefix_ has to be a list. The API this handler has to provide is specified below, in the section "Command Handler API". The handle of the new channel is returned as the result of the command, and the channel is open. Use the regular close command to remove the channel. > The argument _mode_ specifies if the channel is opened for reading, writing, or both. It is a list containing any of the strings read or write. The list has to have at least one element, as a channel you can neither write to nor read from makes no sense. The handler command for the new channel has to support the chosen mode. An error is thrown if that is not the case. > We have chosen to use _late binding_ of the handler command. See the section "_Early versus Late Binding of the Handler Command_" for more detailed explanations. * chan postevent _channel eventspec_ > This subcommand is for use by command handlers, it notifies the channel represented by _channel_ that the event\(s\) listed in the _eventspec_ have occurred. The argument _eventspec_ is a list containing any of read and write. At least one element is required \(It does not make sense to invoke the command if there are no events to post\). > Note that this subcommand can be used only on channel handles which were created/opened with the subcommand create. Application to channels like files, sockets, etc. is not possible and will cause the generation of an error. > As only the Tcl level of a channel, i.e. its command handler, should post events to it we also restrict the usage of this command to the interpreter the handler command is in. In other words, posting events to a reflected channel from a different interpreter than its implementation is in is not allowed. > Another restriction is that it is not possible to post events the I/O core has not registered interest in. Trying to do so will cause the method to throw an error. See the method watch in section "Command Handler API" as well. ## Command Handler API The Tcl-level handler command for a reflected channel is an ensemble that has to support the following subcommands, as listed below. Note that the term _ensemble_ is used to generically describe all command \(prefixes\) which are able to process subcommands. This TIP is _not_ tied to the recently introduced 'namespace ensemble's. Of the available methods the handler has to support initialize, finalize, and watch, always. The other methods are optional. * _handlerCmd_ initialize _channel mode_ > This is the first call the command handler will receive for the given new _channel_. It is his responsibility to set up any internal data structures it needs to keep track of the channel and its state. > The return value of the method has to be a list containing the names of all methods which are supported by this handler. This implicitly tells the C level the version of the API used by the command handler making a separate version number redundant. Hence our decision to leave such a number out of the API. Any changes to the API will be either the elimination of methods, or the introduction of new ones. An existing method cannot change its signature \(arguments, and result\), a new method has to be introduced for this. All of this implies that this method, initialize, is unchangeable after the TIP has been committed, as it is the entry point through which the C level will determine the API version before it knows anything else. > Any error thrown by the method will abort the creation of the channel and no channel will be created. The thrown error will appear as error thrown by chan create. > Any exception beyond _error_, like _break_, etc. is treated as and converted to an error. > Important - If the creation of the channel was aborted due to failures in initialize then the method finalize will _not_ be called. > This method has no equivalent at the C level. > It was considered to return only the list of optional methods supported by the handler. The chosen method however should make the code in the C layer more regular. Another advantage of this is that it allows the C level to better check if the API it expects is matching the API provided by the handler. > The argument _mode_ tells the handler if the channel was opened for reading, writing, or both. It is a list containing any of the strings read or write. The C-level doing the call will never generate abbreviations of these strings. The list will always contain at least one element, as a channel you can neither write to nor read from makes no sense. > The method has to throw an error if the chosen mode is not supported by the handler command. * _handlerCmd_ finalize _channel_ > The method is called when the channel was closed, and is the last call a handler can receive for the given _channel_. This happens just before the destruction of the C level data structures. Still, the command handler must not access the channel anymore in no way. It is now his responsibility to clean up any internal resources it allocated to this channel. > The return value of the method is ignored. > If the method throws an error the command which caused its invocation \(usually close\) will appear to have thrown this error. > Any exception beyond _error_, like _break_, etc. is treated as and converted to an error. > The equivalent C-level function is _Tcl\_DriverCloseProc_. > This method is not invoked if the creation of the channel was aborted during initialize. * _handlerCmd_ read _channel count_ > This method is _optional_. It is called when the user requests data from a channel. _count_ specifies how many _bytes_ have been requested. If the method is not supported then it is not possible to read from the channel handled by the command. > The return value of the method is taken as the requested data. If the returned data contains more bytes than requested an error will be signaled and later thrown by the command which performed the read \(usually gets or read\). Returning less bytes than requested is acceptable however. > If the method throws an error the command which caused its invocation \(usually gets, or read\) will appear to have thrown this error. > Any exception beyond _error_, like _break_, etc. is treated as and converted to an error. > The equivalent C-level function is _Tcl\_DriverInputProc_. * _handlerCmd_ write _channel data_ > This method is _optional_. It is called when the user writes data to the channel. Note that the _data_ are bytes, not characters \(The underlying Tcl\_ObjType is _ByteArray_\). Any type of transformation \(EOL, encoding\) configured for the channel has already been applied at this point. If the method is not supported then it is not possible to write to the channel handled by the command. > The return value of the method is taken as the number of bytes written by the channel. Anything non-numeric will cause an error to be signaled and later thrown by the command which performed the write. A negative value implies that the write failed. Returning a value greater than the number of bytes given to the handler, or zero, is forbidden and will cause the C level to throw errors. > If the method throws an error the command which caused its invocation \(usually puts\) will appear to have thrown this error. > Any exception beyond _error_, like _break_, etc. is treated as and converted to an error. > The equivalent C-level function is _Tcl\_DriverOutputProc_. * _handlerCmd_ seek _channel offset base_ > This method is _optional_. It is responsible for the handling of seek and tell requests on the channel. If it is not supported then seeking will not be possible for the channel. > _base_ is one of > \* start - Seeking is relative to the beginning of the channel. > \* current - Seeking is relative to the current seek position. > \* end - Seeking is relative to the end of the channel. > The base argument of the builtin seek command takes the same names. > The _offset_ is an integer number specifying the amount of _bytes_ to seek forward or backward. A positive number will seek forward, and a negative number will seek backward. > A channel may provide only limited seeking. For example sockets can seek forward, but not backward. > The return value of the method is taken as the \(new\) location of the channel, counted from the start. This has to be an integer number greater than or equal to zero. > If the method throws an error the command which caused its invocation \(usually seek\) will appear to have thrown this error. > Any exception beyond _error_, like _break_, etc. is treated as and converted to an error. > The offset/base combination of 0/"current" signals a tell request, i.e. seek nothing relative to the current location, making the new location identical to the current one, which is then returned. > The equivalent C-level functions are _Tcl\_DriverSeekProc_, and _Tcl\_DriverWideSeekProc_ \(where possible\). * _handlerCmd_ configure _channel option value_ > This method is _optional_. It is for writing the type specific options. > Per call one option has to be written. > The return value of the method is ignored. > If the method throws an error the command which performed the \(re\)configuration or query \(usually fconfigure\) will appear to have thrown this error. > Any exception beyond _error_, like _break_, etc. is treated as and converted to an error. > The equivalent C-level function is _Tcl\_DriverSetOptionProc_. * _handlerCmd_ cget _channel option_ > This method is _optional_. It is used when reading a single type specific option. If this method is supported then the method cgetall has to be supported as well. > The call has to return the value of the specified option. > If the method throws an error the command which performed the \(re\)configuration or query \(usually fconfigure\) will appear to have thrown this error. > The equivalent C-level function is _Tcl\_DriverGetOptionProc_. * _handlerCmd_ cgetall _channel_ > This method is _optional_. It is used for reading all type specific options. If this method is supported then the method cget has to be supported as well. > It has to return a list of all options and their values. This list has to have an even number of elements. > If the method throws an error the command which performed the \(re\)configuration or query \(usually fconfigure\) will appear to have thrown this error. > Any exception beyond _error_, like _break_, etc. is treated as and converted to an error. > The equivalent C-level function is _Tcl\_DriverGetOptionProc_. * _handlerCmd_ watch _channel eventspec_ > This methods notifies the Tcl level that the specified channel is interesting in the events listed in the _eventspec_. This is a list containing any of read and write. The C-level doing the call will never generate abbreviations of these strings. The empty list is allowed as well and signals that the channel does not wish to be notified of any events. In other words, it has to disable event generation at the Tcl level. > Any return value of the method is ignored. This includes errors thrown by the method, break, continue, and custom return codes. > The equivalent C-level function is _Tcl\_DriverWatchProc_. > This method interacts with chan postevent. Trying to post an event not listed in the last call to this method will cause an error. * _handlerCmd_ blocking _channel mode_ > This method is _optional_. It handles changes to the blocking mode of the channel. The _mode_ is a boolean flag. True means that the channel has to be set to blocking. False means that the channel should be non-blocking. > The return value of the method is ignored. > If the method throws an error the command which caused its invocation \(usually fconfigure\) will appear to have thrown this error. > Any exception beyond _error_, like _break_, etc. is treated as and converted to an error. > The equivalent C-level function is _Tcl\_DriverBlockModeProc_. Notes: * The function _Tcl\_DriverGetHandleProc_ is not supported. There is no equivalent handler method at the Tcl level. * The function _Tcl\_DriverHandlerProc_ is not supported. There is no equivalent handler method at the Tcl level. The function has no relevance to base channels, which we work with here, only for channel transformations. See [[230]](230.md) \('Tcl Channel Transformation Reflection API'\) for more information on the issue. * The function _Tcl\_DriverFlushProc_ is not supported. The reason for this: The current generic I/O layer of Tcl does not use this function at all, nowhere. Therefore support at the Tcl level makes no sense either. We can always extend the API defined here \(and change its version number\) should the function be used at some time in the future. ## Error handling The current I/O core's ability to handle arbitrary Tcl error messages is very limited. _Tcl\_DriverGetOptionProc_ and _Tcl\_DriverSetOptionProc_ are the only driver functions for which this is possible directly. Everywhere else the API is restricted to returning POSIX error codes. This limitation makes the debugging of problems in a channel command handler at least very difficult. As such it is considered not acceptable. It is proposed to solve this problem through the addition of four new functions to Tcl's public stub table. > void Tcl\_SetChannelError\(Tcl\_Channel _chan_, Tcl\_Obj\* _msg_\) > void Tcl\_SetChannelErrorInterp\(Tcl\_Interp\* _ip_, Tcl\_Obj\* _msg_\) > > These functions store error information in a channel or interpreter. Previously stored information will be discarded. They have to be used by channel drivers wishing to pass regular Tcl error information to the generic layer of the I/O core. > > The refCount of _msg_ is unchanged when the functions had to rewrite _msg_ per the safety precautions explained below, as a properly modified copy of _msg_ is stored, and not _msg_ itself. Otherwise the refCount of _msg_ is incremented by one. > void Tcl\_GetChannelError\(Tcl\_Channel _chan_, Tcl\_Obj\\ _msg_\) > void Tcl\_GetChannelErrorInterp\(Tcl\_Interp\* _ip_, Tcl\_Obj\\ _msg_\) > > These function retrieve error information stored in a channel or interpreter O, and also resets O to have no information stored in it. They will return NULL if no information was stored to begin with. > > i.e. After an invocation of Tcl\_GetChannelError\* for a channel/interpreter object O, all following invocations will return NULL for that object, until an intervening invocation of Tcl\_SetChannelError\* again stored information in O. > > The _msg_ argument is not allowed to be NULL. Nor are the _chan_ and _ip_ arguments. > > The refCount of the returned information is not touched. The reference previously held by the channel or interpreter is now held by the caller of the function and it is its responsibility to release that reference when it is done with the object. This solution is not very elegant, but anything else will require an incompatible redefinition of the whole channel driver structure and of the driver functions. It should also be noted that usage of Tcl\_Objects for the information storage binds the information to a single thread. I.e. a transfer across thread boundaries is not possible. This however is not required here and thus no limitation. The four functions have been made public as I can imagine that even C level drivers might wish to use this facility to generate more explicit and readable error messages than is provided through POSIX error codes and the errno API. The information talked about in the API specifications above is not a plain string, but has to be a list of uneven length. The last element will be interpreted as the actual error message in question, and the preceding elements are considered as option/value pairs containing additional information about the error, like the _errorCode_, etc. I.e. they are an extensible dictionary containing the details of the error beyond the basic message. As a safety precaution any _-level_ specification submitted by the driver and a non-zero value will be rewritten to a value of _0_ to prevent the driver from being able to force the user application into the execution of arbitrary multi-level returns, i.e. from arbitrarily changing the control-flow of the application itself. Analogously any _-code_ specification with a non-zero value which is not _error_ is rewritten to value _1_ \(i.e. _error_\). Below a list of driver functions, and which of the _Tcl\_SetChannelError\*** functions they are allowed to use. * Tcl\_DriverCloseProc > May use _Tcl\_SetChannelErrorInterp_, and only this function. * Tcl\_DriverInputProc > May use _Tcl\_SetChannelError_, and only this function. * Tcl\_DriverOutputProc > May use _Tcl\_SetChannelError_, and only this function. * Tcl\_DriverSeekProc, and Tcl\_DriverWideSeekProc > May use _Tcl\_SetChannelError_, and only this function. * Tcl\_DriverSetOptionProc > Has already the ability to pass arbitrary error messages. Must not use any of the new functions. * Tcl\_DriverGetOptionProc > Has already the ability to pass arbitrary error messages. Must not use any of the new functions. * Tcl\_DriverWatchProc > Must not use any of the new functions. Is internally called and has no ability to return any type of error whatsoever. * Tcl\_DriverBlockModeProc > May use _Tcl\_SetChannelError_, and only this function. * Tcl\_DriverGetHandleProc > Must not use any of the new functions. It is only a low-level function, and not used by Tcl commands. * Tcl\_DriverHandlerProc > Must not use any of the new functions. Is internally called and has no ability to return any type of error whatsoever. Given the information above the following public functions of the Tcl C API are affected by these changes. I.e. when these functions are called the channel may now contain a stored arbitrary error message requiring processing by the caller. * Tcl\_StackChannel * Tcl\_Seek * Tcl\_Tell * Tcl\_ReadRaw * Tcl\_Read * Tcl\_ReadChars * Tcl\_Gets * Tcl\_GetsObj * Tcl\_Flush * Tcl\_WriteRaw * Tcl\_WriteObj * Tcl\_Write * Tcl\_WriteChars All other API functions are unchanged. Especially the functions below leave all their error information in the interpreter result. * Tcl\_Close * Tcl\_UnregisterChannel * Tcl\_UnstackChannel A previous revision of this TIP specified only two functions, storing the data only in channels. This however proved to be inadequate. It allows the transfer of messages for most driver functions, but not _close_. Storing an error message in the channel structure which is destroyed is not helpful. So we need the functions for storing data in interpreters. Conversely, providing only two functions storing the information in an interpreter, is inadequate as well. The circumstances for that to happen are actually very limited, but they can happen. First, most driver functions are not given an interpreter reference when called, and actually do not know which interpreter caused their invocation. The only remedy we have is that the channel structure has to have an interpreter reference to the interpreter of the command handler, for the calls into the Tcl level. This could be used in most circumstances, except when threads are enabled and the channel was transfered out of the thread containing that interpreter. We are not allowed to use this interpreter from the channel thread, and again have no other reference available. So for this the code/message pair has to be stored in a channel as the sole place available. A previous revision of this TIP not only stored an error message, but also a result code in the channel or interpreter, and used it as the return code of the Tcl command which invoked the driver function returning the exception. This feature has been discarded as a possible security hazard. It would allow a malicious Tcl driver to cause _break_ and _continue_ exceptions at arbitrary locations in the overall application, controlling its behaviour as it sees fit. I wish to thank Joe English and Vince Darley for their input with regard to the limitations of error propagation in the I/O core and possible ideas for solving it. Joe's discourse on the problems with the use of POSIX error codes in an earlier revision of this TIP made me realize that I should not use them anywhere in the API for reflected channels and rather concentrate on extending the I/O system to properly receive Tcl error messages. And while I rejected the TclSetPosixError function Vince proposed I hopefully kept the spirit of that proposal in my solution as well. The main reason against setting an arbitrary _posix error string_ was that it invented another way of passing error information around, whereas the specification above is based on the existing Tcl\_InterpState and attendant functionality. ## Interaction with Threads and Other Interpreters. A channel created with the chan create command knows the interpreter it was created in and executes its handler command only in that interpreter, even if the channel is shared with and/or has been moved into a different interpreter. This is easy to accomplish, by evaluating the handler command only in the context of the original interpreter. The channel also knows the thread it was created in and executes its handler command only in that thread, even if the channel has been moved into a
︙			︙
600 601 602 603 604 605 606 ~~607 608~~ 609 610 ~~611~~ 612 613 614 ~~615~~ 616 ~~617~~ 618 619 ~~620 621 622~~ 623 ~~624~~ 625 ~~626~~ 627 ~~628~~ 629 ~~630~~ 631 632 633 634 635 ~~636~~ 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 ~~653 654 655~~ 656 657 658 ~~659~~ 660 661 662 663 664 665 666 667 668 669 670 ~~671~~ 672 673 674 675 ~~676~~ 677 ~~678~~ 679 680 681 682 683 684 685 686 ~~687~~ 688 689 690 ~~691~~ 692 693 694 ~~695 696 697 698 699 700 701~~ ~~702 703 704 705 706 707~~ ~~708 709 710 711 712 713 714 715 716 717~~ ~~718 719 720 721 722 723 724 725~~ ~~726 727 728 729 730~~ ~~731~~ 732 733 734 735 736 ~~737~~ 738 739 740 741 742 743 744 ~~745~~ 746 747 ~~748~~ 749 750 751 752 ~~753~~ 754 755 ~~756~~ 757 758 ~~759~~ 760 761 ~~762 763~~ 764 765 ~~766~~ 767 768 ~~769~~ 770 ~~771~~ 772 ~~773~~ 774 ~~775~~ 776 777	original thread, able to process these events. Note that this also allows the creation of a channel whose two endpoints live in two different threads and provide a stream-oriented bridge between these threads. In other words we can provide a way for regular stream communication between threads instead of having to send commands. ~~When a thread or interpreter is deleted all channels created with the ~~'''~~chan create~~'''~~ command using this thread/interpreter as their computing base will~~ be deleted as well, in all interpreters they have been shared with or moved into, and in whatever thread they have been moved to. This pulls the rug out ~~under the other thread(s) and/or interpreter(s), this however cannot be~~ avoided. Trying to use such a channel will cause the generation of the regular error about unknown channel handles. ~~~~ Interaction with Safe Interpreters~~ ~~The new subcommands ~~'''~~create~~'''~~ and ~~'''~~postevent~~'''~~ of ~~'''~~chan~~'''~~ are safe~~ and therefore made accessible to safe interpreters. ~~While ~~'''~~create~~'''~~ arranges for the execution of code this code is always executed within the safe interpreter, even if the channel was moved (See previous section).~~ ~~The subcommand ~~'''~~postevent~~'''~~ can trigger the execution of fileevent~~ handlers, however if they are executed in trusted interpreters then they were ~~registered by these interpreters as well. (Moving channels between threads~~ strips fileevent handlers, and just between interpreters keeps them, and ~~executes them where they were added).~~ ~~~~ Early versus Late Binding of the Handler Command~~ We have two principal methods for using the handler command. These are called early and late binding. Early binding means that the command implementation to use is determined at ~~the time of the creation of the channel, i.e. when ~~'''~~chan create~~'''~~ is~~ executed, before any methods are called. Afterward it cannot change. The result of the command resolution is stored internally and used until the channel is destroyed. Renaming the handler command has no effect. In other words, the system will automatically call the command under the new name. The destruction of the handler command is intercepted and causes the channel to close as well. Late binding means that the handler command is stored internally essentially as a string, and this string is mapped to the implementation to use for each and every call to a method of the handler. Renaming the command, or destroying it means that the next call of a handler method will fail, causing the higher level channel command to fail as well. Depending on the method the error message may not be able to explain the reason of that failure. Another problem with this approach is that the context for the resolution of the command name has to be specified explicitly to avoid problems with relative names. Early binding resolves once, in the context of the ~~'''~~chan create~~'''~~ call. Late binding performs resolution anywhere where channel commands like ~~'''~~puts~~'''~~, ~~'''~~gets~~'''~~, etc. are called, i.e. in a random context. To prevent problems with different commands of the same name in several namespaces it becomes necessary to force the usage of a specific fixed context for the resolution. The only context suitable for such is the global ~~context (per ''uplevel ~~#0''~~, not ''namespace eval ::'').~~ Note that moving a different command into place after renaming the original handler allows the Tcl level to change the implementation dynamically at runtime. This however is not really an advantage over early binding as the early bound command can be written such that it delegates to the actual implementation, and that can then be changed dynamically as well. However, despite all this late binding is so far the method of choice for the implementation of callbacks, be they in Tcl, or Tk; and has been chosen for the reflection as well. ~~~~ Miscellanea~~ The channel reflection API reserves the driver type "tclrchannel" for itself. Usage of this driver type by other channel types is not allowed. ~~~ Examples~~ ~~~~ Driver Implementations~~ A simple way of implementing new types of channels is to use any of the various object systems for Tcl. Create a class for the channel type. Create the new channel in the constructor for new objects and store the channel handle. Make the new object the command handler for the channel. This automatically translates the sub commands for the command handler into object methods. Implement the various methods required. when the object is deleted close the channel, and delete the object when the channel announces that it ~~has been ~~'''~~close~~'''~~d. This part is a bit tricky, flags have to be used to~~ break the potential cycle. Another possibility is to implement the command handler as a regular command, ~~together with a creation command wrapping around ~~'''~~chan create~~'''~~ and a~~ backend which keeps track of all handles created by it and their state, associated data, etc. ~~\| object based example ... \| \| snit::type new_channel { \| constructor {mode args} { \| # Handle args ... \| set chan [chan create $mode $self] ~~\| }~~~~ ~~\| destructor { \| # ... delete internal state ... \| if {$dead} return \| set dead 1 \| close $chan ~~\| }~~~~ \| \| method handle {} {return $chan} \| variable chan \| variable dead 0 \| \| method finalize {dummy} { \| if {$dead} return \| set dead 1 \| $self destroy ~~\| }~~ \| method initialize {dummy mode} {} \| method read {dummy count} {} \| method write {dummy data} {} \| method seek {dummy offset base} {} \| method configure {dummy args} {} \| method watch {dummy events} {} \| method blocking {dummy isblocking} {} ~~\| }~~ ~~\| \| proc newchannel_open {args} { \| return [[new_channel %AUTO% {expand}$args] handle] ~~\| }~~~~ ~~~~ Other Possible Drivers~~ * Memory channel based on a string. Block and/or FIFO oriented. * Null device. Writable, not writable. WOM device. Data sink. * Random data (Writing to it may re-seed the PRNG). * Zero channel. Readable, returns a stream of binary 0s. Not writable. * FIFO channel between different threads. * Optimized virtual filesystem implementations. ~~> Current VFS implementations have to use the package ''memchan'' to provide~~ the channels when a file in them is opened, which necessitates that for all open files all of their data is in memory, possibly even more than once ~~(when several channels are open on the same file). A reflected driver~~ however allows implementations which keep only part of the data in memory. Or nearly none at all if the VFS provides computed information / is based on some data structure. ~~> A more concrete example would be a driver which provides access to files~~ stored in some archive file. Using a reflect driver the archive file can be memory mapped and the driver will then read whatever data is needed when ~~requested. Currently it will have to copy the data into a ''memchan''~~ channel, i.e duplicate it in memory. ~~> Note that of course the internals of the archive file may limit the amount~~ of memory savings we can achieve. If for example the file we wish to access is stored in a compressed form we will have to decompress it in memory at ~~least to the highest location requested so far. And any write operation (if allowed) will have to keep the data in memory until it has been compressed~~ and committed. ~~~ Reference Implementation~~ A reference implementation is provided at SourceForge ~~[http://sourceforge.net/support/tracker.php?aid=1025294].~~ ~~~ Comments~~ ~~''[~~[ Add comments on the document here ]~~]''~~ ~~~ Copyright~~ This document has been placed in the public domain.	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| < > \| \| \| \| \| < > \| \| \| \| \| \| \| \| \| < > \| \| \| \| \| \| \| < > \| \| \| < \| > \| \| \| \| \| \| \| \| \| \| \| \| \| \| >	599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777	original thread, able to process these events. Note that this also allows the creation of a channel whose two endpoints live in two different threads and provide a stream-oriented bridge between these threads. In other words we can provide a way for regular stream communication between threads instead of having to send commands. When a thread or interpreter is deleted all channels created with the chan create command using this thread/interpreter as their computing base will be deleted as well, in all interpreters they have been shared with or moved into, and in whatever thread they have been moved to. This pulls the rug out under the other thread\(s\) and/or interpreter\(s\), this however cannot be avoided. Trying to use such a channel will cause the generation of the regular error about unknown channel handles. ## Interaction with Safe Interpreters The new subcommands create and postevent of chan are safe and therefore made accessible to safe interpreters. While create arranges for the execution of code this code is always executed within the safe interpreter, even if the channel was moved \(See previous section\). The subcommand postevent can trigger the execution of fileevent handlers, however if they are executed in trusted interpreters then they were registered by these interpreters as well. \(Moving channels between threads strips fileevent handlers, and just between interpreters keeps them, and executes them where they were added\). ## Early versus Late Binding of the Handler Command We have two principal methods for using the handler command. These are called early and late binding. Early binding means that the command implementation to use is determined at the time of the creation of the channel, i.e. when chan create is executed, before any methods are called. Afterward it cannot change. The result of the command resolution is stored internally and used until the channel is destroyed. Renaming the handler command has no effect. In other words, the system will automatically call the command under the new name. The destruction of the handler command is intercepted and causes the channel to close as well. Late binding means that the handler command is stored internally essentially as a string, and this string is mapped to the implementation to use for each and every call to a method of the handler. Renaming the command, or destroying it means that the next call of a handler method will fail, causing the higher level channel command to fail as well. Depending on the method the error message may not be able to explain the reason of that failure. Another problem with this approach is that the context for the resolution of the command name has to be specified explicitly to avoid problems with relative names. Early binding resolves once, in the context of the chan create call. Late binding performs resolution anywhere where channel commands like puts, gets, etc. are called, i.e. in a random context. To prevent problems with different commands of the same name in several namespaces it becomes necessary to force the usage of a specific fixed context for the resolution. The only context suitable for such is the global context \(per _uplevel \#0_, not _namespace eval ::_\). Note that moving a different command into place after renaming the original handler allows the Tcl level to change the implementation dynamically at runtime. This however is not really an advantage over early binding as the early bound command can be written such that it delegates to the actual implementation, and that can then be changed dynamically as well. However, despite all this late binding is so far the method of choice for the implementation of callbacks, be they in Tcl, or Tk; and has been chosen for the reflection as well. ## Miscellanea The channel reflection API reserves the driver type "tclrchannel" for itself. Usage of this driver type by other channel types is not allowed. # Examples ## Driver Implementations A simple way of implementing new types of channels is to use any of the various object systems for Tcl. Create a class for the channel type. Create the new channel in the constructor for new objects and store the channel handle. Make the new object the command handler for the channel. This automatically translates the sub commands for the command handler into object methods. Implement the various methods required. when the object is deleted close the channel, and delete the object when the channel announces that it has been closed. This part is a bit tricky, flags have to be used to break the potential cycle. Another possibility is to implement the command handler as a regular command, together with a creation command wrapping around chan create and a backend which keeps track of all handles created by it and their state, associated data, etc. object based example ... snit::type new_channel { constructor {mode args} { # Handle args ... set chan [chan create $mode $self] } destructor { # ... delete internal state ... if {$dead} return set dead 1 close $chan } method handle {} {return $chan} variable chan variable dead 0 method finalize {dummy} { if {$dead} return set dead 1 $self destroy } method initialize {dummy mode} {} method read {dummy count} {} method write {dummy data} {} method seek {dummy offset base} {} method configure {dummy args} {} method watch {dummy events} {} method blocking {dummy isblocking} {} } proc newchannel_open {args} { return [[new_channel %AUTO% {expand}$args] handle] } ## Other Possible Drivers * Memory channel based on a string. Block and/or FIFO oriented. * Null device. Writable, not writable. WOM device. Data sink. * Random data \(Writing to it may re-seed the PRNG\). * Zero channel. Readable, returns a stream of binary 0s. Not writable. * FIFO channel between different threads. * Optimized virtual filesystem implementations. > Current VFS implementations have to use the package _memchan_ to provide the channels when a file in them is opened, which necessitates that for all open files all of their data is in memory, possibly even more than once \(when several channels are open on the same file\). A reflected driver however allows implementations which keep only part of the data in memory. Or nearly none at all if the VFS provides computed information / is based on some data structure. > A more concrete example would be a driver which provides access to files stored in some archive file. Using a reflect driver the archive file can be memory mapped and the driver will then read whatever data is needed when requested. Currently it will have to copy the data into a _memchan_ channel, i.e duplicate it in memory. > Note that of course the internals of the archive file may limit the amount of memory savings we can achieve. If for example the file we wish to access is stored in a compressed form we will have to decompress it in memory at least to the highest location requested so far. And any write operation \(if allowed\) will have to keep the data in memory until it has been compressed and committed. # Reference Implementation A reference implementation is provided at SourceForge <http://sourceforge.net/support/tracker.php?aid=1025294> . # Comments _[ Add comments on the document here ]_ # Copyright This document has been placed in the public domain.

~~1 2 3 4 5 6 7 8 9 10~~ 11 12 13 14 15 16 17 18 19 ~~20 21 22~~ 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53	~~TIP: 247~~ T~~itle~~: Tcl/Tk Engineering Manual State: Draft Type: Informational Vote: Pending Post-History: ~~Version: $Revision: 1.9 $~~ Author: John K. Ousterhout <[email protected]> Author: Donal K. Fellows <[email protected]> Created: 01-Jun-2005 ~~~ Abstract~~ This document describes the set of conventions used for writing C code to go into the Tcl and Tk core. It is also recommended that extensions be written in the same style for clarity. ~~~~NOTE~~ ~~''A transcription of the original version (dated September 1, 1994) of this file into PDF is available online at http://tcl.sourceforge.net/engManual.pdf - Donal K. Fellows''~~ ~~''Also note that the figures might lag the text. We'll fix them eventually.''~~ ~~~ Introduction~~ This is a manual for people who are developing C code for Tcl, Tk, and their extensions and applications. It describes a set of conventions for writing code and the associated test scripts. There are two reasons for the conventions. First, the conventions ensure that certain important things get done; for example, every procedure must have documentation that describes each of its arguments and its result, and there must exist test scripts that exercise every line of code. Second, the conventions guarantee that all of the Tcl and Tk code has a uniform style. This makes it easier for us to use, read, and maintain each other's code. Most of the conventions originated in the Sprite operating system project at U.C. Berkeley. At the beginning of the Sprite project my students and I decided that we wanted a uniform style for our code and documentation, so we held a series of meetings to choose the rules. The result of these meetings ~~was a document called ''The Sprite Engineering Manual''. None of us was~~ completely happy with all the rules, but we all managed to live by them during the project and I think everyone was happy with the results. When I started work on Tcl and Tk, I decided to stick with the Sprite conventions. This ~~document is based heavily on ''The Sprite Engineering Manual''.~~ There are few things that I consider non-negotiable, but the contents of this manual are one of them. I don't claim that these conventions are the best possible ones, but the exact conventions don't really make that much difference. The most important thing is that we all do things the same way. Given that the core Tcl and Tk code follows the conventions, changing the rules now would cause more harm than good.	< \| \| \| \| \| < \| \| \| > \| \| \| \| \| \| \| \| \|	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52	# TIP 247: Tcl/Tk Engineering Manual State: Draft Type: Informational Vote: Pending Post-History: Author: John K. Ousterhout <[email protected]> Author: Donal K. Fellows <[email protected]> Created: 01-Jun-2005 ----- # Abstract This document describes the set of conventions used for writing C code to go into the Tcl and Tk core. It is also recommended that extensions be written in the same style for clarity. ## NOTE _A transcription of the original version \(dated September 1, 1994\) of this file into PDF is available online at <http://tcl.sourceforge.net/engManual.pdf> - Donal K. Fellows_ _Also note that the figures might lag the text. We'll fix them eventually._ # Introduction This is a manual for people who are developing C code for Tcl, Tk, and their extensions and applications. It describes a set of conventions for writing code and the associated test scripts. There are two reasons for the conventions. First, the conventions ensure that certain important things get done; for example, every procedure must have documentation that describes each of its arguments and its result, and there must exist test scripts that exercise every line of code. Second, the conventions guarantee that all of the Tcl and Tk code has a uniform style. This makes it easier for us to use, read, and maintain each other's code. Most of the conventions originated in the Sprite operating system project at U.C. Berkeley. At the beginning of the Sprite project my students and I decided that we wanted a uniform style for our code and documentation, so we held a series of meetings to choose the rules. The result of these meetings was a document called _The Sprite Engineering Manual_. None of us was completely happy with all the rules, but we all managed to live by them during the project and I think everyone was happy with the results. When I started work on Tcl and Tk, I decided to stick with the Sprite conventions. This document is based heavily on _The Sprite Engineering Manual_. There are few things that I consider non-negotiable, but the contents of this manual are one of them. I don't claim that these conventions are the best possible ones, but the exact conventions don't really make that much difference. The most important thing is that we all do things the same way. Given that the core Tcl and Tk code follows the conventions, changing the rules now would cause more harm than good.
︙			︙
70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 ~~100~~ 101 ~~102 103~~ 104 105 106 107 ~~108 109~~ 110 111 ~~112~~ 113 ~~114~~ 115 116 117 118 119 120 ~~121~~ 122 123 124 125 ~~126~~ 127 128 129 130 131 132 ~~133~~ 134 135 136 137 138 139 ~~140~~ 141 142 143 144 145 ~~146~~ 147 148 149 150 151 152 153 154 155 156 157 158 ~~159~~ 160 ~~161~~ 162 163 164 ~~165~~ 166 167 ~~168~~ 169 ~~170 171 172 173~~ 174 ~~175~~ 176 177 178 179 180 181 182 183 184 185 186 ~~187~~ 188 189 190 191 192 193 194 195 196 197 ~~198~~ 199 200 201 ~~202 203~~ 204 205 ~~206 207~~ 208 209 210 211 ~~212 213 214~~ 215 216 ~~217~~ 218 219 220 221 222 223 224 225 226 227 228 229 ~~230 231 232 233~~ 234 235 ~~236~~ 237 ~~238 239~~ 240 ~~241 242 243 244 245~~ 246 247 248 ~~249~~ 250 251 ~~252~~ 253 ~~254~~ 255 256 257 258 ~~259~~ 260 ~~261~~ 262 ~~263 264~~ 265 266 267 ~~268 269~~ 270 271 272 273 274 275 ~~276~~ 277 ~~278~~ 279 280 281 282 283 284 285 286 287 288 ~~289~~ 290 291 292 293 294 295 ~~296~~ 297 298 299 300 301 302 ~~303~~ 304 305 306 307 308 309 310 311 312 313 314 315 316 ~~317~~ 318 ~~319~~ 320 ~~321~~ 322 323 324 ~~325~~ 326 327 328 329 330 331 332 ~~333 334~~ 335 336 337 338 339 340 341 ~~342~~ 343 344 345 346 347 348 ~~349~~ 350 351 352 353 354 355 356 357 ~~358~~ 359 360 ~~361~~ 362 363 364 365 366 367 368	Section 4 desribes the Tcl/Tk naming conventions. Section 5 presents low-level coding conventions, such as how to indent and where to put curly braces. Section 6 contains a collection of rules and suggestions for writing comments. Section 7 describes how to write and maintain test suites. Section 8 describes how to make code portable without making it unreadable too. Section 9 contains a few miscellaneous topics, such as keeping a change log. ~~~ Packages and Header Files~~ ~~Tcl applications consist of collections of ''packages''. Each package provides~~ code to implement a related set of features. For example, Tcl itself is a package, as is Tk; various extensions such as Tcl-DP, TclX, Expect, and BLT are also packages. Packages are the units in which code is developed and distributed: a single package is typically developed by a single person or group and distributed as a unit. One of the best things about Tcl is that it is possible to combine many independently-developed packages into a single application; packages should be designed with this in mind. This section describes the file structure of packages with an emphasis on header files; later sections discuss conventions for code files. You may also wish to review Chapter 31 of the Tcl book for additional information on packages, such as how to interface them to the rest of an application. ~~~~Package Prefixes~~ ~~Each package has a unique short ''prefix''. The prefix is used in file names,~~ procedure names, and variable names in order to prevent name conflicts with other packages. For example, the prefix for Tcl is tcl; Tcl's exported header file is called tcl.h and exported procedures and variables have names like ~~Tcl_Eval.~~ ~~~~Version Numbers~~ ~~Each package has a two-part version number such as 7.4. The first number (7) is called the major version number and the second (4) is called the minor~~ version number. The version number changes with each public release of the package. If a new release contains only bug fixes, new features, and other upwardly compatible changes, so that code and scripts that worked with the old version will also work with the new version, then the minor version number ~~increments and the major version number stays the same (e.g., from 7.4 to 7.5). If the new release contains substantial incompatibilities, so that~~ existing code and scripts will have to be modified to run with the new version, then the major version number increments and the minor version number ~~resets to zero (e.g., from 7.4 to 8.0).~~ ~~~~Overall Structure~~ A package typically consists of several code files, plus at least two header files, plus additional files for building and configuring the package, such as a Makefile and a configure.in file for the autoconf program. The header files for a package generally fall into the following categories: * A ''package header file'', which is named after the package, such as tcl.h or tk.h. This header file describes all of the externally-visible features of the package, such as procedures, global variables, and structure declarations. The package header file is eventually installed in a system directory such as /usr/local/include; it is what clients of the package ~~#include in their C code. As a general rule of thumb, the package header~~ file should define as few things as possible: it's very hard to change an exported feature since it breaks client code that uses the package, so the less you export, the easier it will be to make changes to the package. Thus, for example, try not to make the internal fields of structures visible in package header files. * An ''internal header file'', which is typically #included by all of the C files in the package. The internal header file has a name like tclInt.h or tkInt.h, consisting of the the package prefix followed by Int.h. The internal header file describes features that are used in multiple files within the package but aren't exported out of the package. For example, key package structures and internal utility procedures are defined in the internal header file. The internal header file should also contain ~~#includes for other headers that are used widely within the package, so~~ they don't have to be included over and over in each code file. As with the package header, the internal header file should be as small as possible: structures and procedures that are only used in a single C file in the package should not appear in it. * A ''porting header file'', which contains definitions that hide the differences between the systems on which the package can be used. The name of the porting header should consist of the package prefix follwed by Port.h, such as tclPort.h. * Other internal header files for various subpackages within the package. For example, there is a file tkText.h in Tk that is shared among all the files that implement text widgets and another file tkCanvas.h that is shared among all the widgets implementing canvases. I recommend having as few header files as possible in each package. In almost all cases a package header file, a single internal header file, and a porting header file will be sufficient, and in many cases the porting header file may ~~not be necessary. The internal header file should automatically #include the~~ package header file and perhaps even the porting header file, so each C file ~~in the package only needs to #include one or at most two header files. I~~ recommend keeping the porting header separate from the internal header file in order to maintain a clean separation between porting code and the rest of the module. Other internal headers should only be necessary in unusual cases, such ~~as the Tk text and canvas widgets (each of tkText.h and tkCanvas.h is many~~ hundred lines long, due to the complexity of the widgets, and they are needed only in the source files that implement the particular widgets, so I thought ~~it would be easier to manage these headers separately from tkInt.h). If you~~ have lots of internal header files, such as one for each source file, then you will end up with lots of #include statements in each C file and you'll find that either ''(a~~)''~~ you #include every header in every C file (in which case there's not much advantage to having the separate .h files) or ''(b~~)''~~ you are constantly adding and deleting #include statements as you modify source files. ~~~~Header File Structure~~ Figure 1 illustrates the format of a header file. Your header files should follow this structure exactly: same indentation, same order of information, and so on. To make this as easy as possible, the directory engManual in the Tcl source tree contains templates for various pieces of source files. For example, the file proto.h contains a template for a header file; there are also templates for code files and procedure headers. You should be able to set up your editor to incorporate the templates when needed, then you can modify them for the particular situation in which they are used. This should make it easy for you to conform to the conventions without a lot of typing overhead. ~~~~#image:247fig1~~ Figure 1. An example of a header file. The file~~ engManual/proto.h contains a template for header files. Each header file contains the following parts, which are labelled in Figure 1: Abstract: the first few lines give the name of the file plus a short description of its overall purpose. Copyright notice: this protects the ownership of the file and controls distribution; different notices may be used on different files, depending on whether the file is to be released freely or restricted. The wording in ~~copyright notices is sensitive (e.g. the use of upper case is important)~~ so don't make changes in notices without checking with a legal authority. Revision string: the contents of this string are managed automatically by the ~~source code control system for the file, such as RCS or SCCS (RCS is used in the example in the figure). It identifies the file's current revision,~~ date of last modification, and so on. ~~Multiple include #ifdef: when a large application is developed with many related packages, it is hard to arrange the #include statements so that~~ each include file is included exactly once For example, files a.h and b.h might both include c.h, and a particular code file might include both a.h and b.h. This will cause c.h to be processed twice, and could potentially result in compiler errors such as multiply-defined symbols. With the recursion #ifdef, plus the matching #endif at the end of the file, the header file can be #included multiple times without problems. The symbol _TCL is defined the first time the header file is included; if the header is included again the presence of the symbol causes the body of the header file to be skipped. The symbol used in any given header file should be the ~~same as the name of the header file except with the .h stripped off, a _~~ prepended, and everything else capitalized. Version defines: for each package, three symbols related to the current version number should be defined. The first gives the full version number as a string, and the second and third give the major and minor numbers separately as integers. The names for these symbols should be derived from the package prefix as in Figure 1. Declarations: the rest of the header file consists of declarations for the things that are exported from the package to its clients. Most of the conventions for coding these declarations will be discussed later. When declaring variables and procedures, use EXTERN instead of extern to declare them external. The symbol EXTERN can then be #defined to either extern or extern "C" to allow the header file to be used in both C and C++ programs. The header file tcl.h contains code to #define the EXTERN symbol; if your header file doesn't #include tcl.h, you can copy the code from tcl.h to your header file. ~~~~_ANSI~~_ARGS~~_ Prototypes~~ ~~Procedure prototypes ~~''may''~~ use the _ANSI~~_ARGS~~_ macro as shown in Figure 1. _ANSI~~_ARGS~~_ makes it possible to write full procedure prototypes for the~~ normal case where an ANSI C compiler will be used, yet it also allows the file to be used with older non-ANSI compilers. To use _ANSI~~_ARGS~~_, specify the entire argument list, including parentheses, as an argument to the _ANSI~~_ARGS~~_ macro; _ANSI~~_ARGS~~_ will evaluate to either this argument list or (), depending on whether or not an ANSI C compiler is being used. The _ANSI~~_ARGS~~_ macro is defined in ''tcl.h''. In the argument lists in procedure prototypes, be sure to specify names for the arguments as well as their types. The names aren't required for ~~compilation (for example, the declaration for Tcl_Eval could have been written~~ as ~~\| EXTERN int Tcl_Eval _ANSI_ARGS_((Tcl_Interp , const char ));~~ ~~in Figure 1) but the names provide additional information about the arguments.~~ Note that for modern code, it is usually preferred to omit this macro, resulting in the above example looking like: ~~\| EXTERN int Tcl_Eval(Tcl_Interp interp, const char scriptPtr);~~ ~~~~MODULE_LOCAL Prototypes~~ ~~''(Not yet shown in any figure, to be used from Tcl 8.5 onwards for the Tcl and Tk core only.~~)''~~~~ Where a function is only exported so that it may be accessed from a file other than the file that declares it, that function should be declared as being ~~MODULE_LOCAL. While this does not have an effect with all toolchains, some (such as the ones used for MS Windows and MacOS X) can use this information~~ during the linking stage to ensure that the symbol in the resulting library cannot be linked against by external code. This is useful for keeping the internal implementation of library code away from casual misuse. Example of usage: ~~\| MODULE_SCOPE int TclIsLocalScalar(const char *src, int len);~~ ~~~How to Organize a Code File~~ Each source code file should contain a related set of procedures, such as the implementation of a widget or canvas item type, or a set of procedures to implement hash tables. Before writing any code you should think carefully about what functions are to be provided and divide them up into files in a logical way. In my experience, the most manageable size for files is usually in the range of 500-2000 lines. If a file gets much larger than this, it will be hard to remember everything that the file does. If a file is much shorter than this, then you may end up with too many files in a directory, which is also hard to manage. Code files are divided into pages separated by formfeed ~~(control-L) characters. The first page of the file is a header page containing~~ information that is used throughout the file. Each additional page of the file contains one procedure. This approach has two advantages. First, when you print a code file each procedure header will start at the top of the page, which makes for easier reading. Second, you can browse through all of the procedures in a file by searching for the formfeed characters. ~~~~The File Header Page~~ The first page of a code file is a header page. It contains overall information that is relevant throughout the file, which consists of everything but the definitions of the file's procedures. The header page typically has six parts, as shown in Figure 2: ~~~~#image:247fig2~~ Figure 2. An example of a header page. Part of the text of the~~ copyright notice has been omitted. The file engManual/proto.c contains a template for a header page. Abstract: the first few lines give the name of the file and a brief description of the overall functions provided by the file, just as in header files. Copyright notice: protects ownership of the file, just as in header files. Revision string: similar to the revision strings in header files, except that its value is used to initialize a string variable. This allows the revision information to be checked in the executable object file. ~~Include statements: all of the #include statements for the file should appear~~ on the header file just after the version string. In general there should ~~be very few #include statements in a given code file, typically just for~~ the package's internal header file and porting header file. If additional ~~#includes are needed they should appear in the package's internal header~~ file or porting header file. Declarations: any structures used only in this file should be declared on the ~~header page (exported structures must be declared in header files). In~~ addition, if the file defines any static or global variables then they should be declared on the header page. This makes it easy to tell whether or not a file has static variables, which is important if the file is ever used in a multi-threaded environment. Static variables are generally undesirable and should be avoided as much as possible. Prototypes: procedure prototypes for procedures referenced only in this file ~~should appear at the very end of the header page (prototypes for exported procedures must appear in the package header file). Use the _ANSI~~_ARGS~~_~~ macro described in Section 2.5. Please structure your header pages in exactly the order given above and follow the syntax of Figure 2 as closely as possible. The file engManual/proto.c provides a template for a header page. Source files should never contain extern statements. Instead, create header ~~files to hold the extern statements and #include the header files. This makes~~ code files easier to read and makes it easier to manage the extern statements, since they're centralized in .h files instead of spread around dozens of code files. For example, the internal header file for a package has extern statements for all of the procedures that are used by multiple files within the package but aren't exported outside it. ~~~~Procedure Headers~~ Each page after the first one in a file should contain exactly one procedure. The page should begin with a procedure header that gives overall documentation for the procedure, followed by the declaration and body for the procedure. See Figures 3 and 4 for examples. The header should contain everything that a caller of the procedure needs to know in order to use the procedure, and nothing else. It consists of three parts: ~~~~#image:247fig3~~ Figure 3. The header comments and declaration for a procedure.~~ The file engManual/prochead contains a template for this information. ~~~~#image:247fig4~~ Figure 4. The header for a procedure with side effects.~~ Abstract: the first lines in the header give the procedure's name, followed by a brief description of what the procedure does. This should not be a detailed description of how the procedure is implemented, but rather a high-level summary of its overall function. In some cases, such as callback procedures, I recommend also describing the conditions under which the procedure is invoked and who calls the procedure, as in Figure 4.	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367	Section 4 desribes the Tcl/Tk naming conventions. Section 5 presents low-level coding conventions, such as how to indent and where to put curly braces. Section 6 contains a collection of rules and suggestions for writing comments. Section 7 describes how to write and maintain test suites. Section 8 describes how to make code portable without making it unreadable too. Section 9 contains a few miscellaneous topics, such as keeping a change log. # Packages and Header Files Tcl applications consist of collections of _packages_. Each package provides code to implement a related set of features. For example, Tcl itself is a package, as is Tk; various extensions such as Tcl-DP, TclX, Expect, and BLT are also packages. Packages are the units in which code is developed and distributed: a single package is typically developed by a single person or group and distributed as a unit. One of the best things about Tcl is that it is possible to combine many independently-developed packages into a single application; packages should be designed with this in mind. This section describes the file structure of packages with an emphasis on header files; later sections discuss conventions for code files. You may also wish to review Chapter 31 of the Tcl book for additional information on packages, such as how to interface them to the rest of an application. ## Package Prefixes Each package has a unique short _prefix_. The prefix is used in file names, procedure names, and variable names in order to prevent name conflicts with other packages. For example, the prefix for Tcl is tcl; Tcl's exported header file is called tcl.h and exported procedures and variables have names like Tcl\_Eval. ## Version Numbers Each package has a two-part version number such as 7.4. The first number \(7\) is called the major version number and the second \(4\) is called the minor version number. The version number changes with each public release of the package. If a new release contains only bug fixes, new features, and other upwardly compatible changes, so that code and scripts that worked with the old version will also work with the new version, then the minor version number increments and the major version number stays the same \(e.g., from 7.4 to 7.5\). If the new release contains substantial incompatibilities, so that existing code and scripts will have to be modified to run with the new version, then the major version number increments and the minor version number resets to zero \(e.g., from 7.4 to 8.0\). ## Overall Structure A package typically consists of several code files, plus at least two header files, plus additional files for building and configuring the package, such as a Makefile and a configure.in file for the autoconf program. The header files for a package generally fall into the following categories: * A _package header file_, which is named after the package, such as tcl.h or tk.h. This header file describes all of the externally-visible features of the package, such as procedures, global variables, and structure declarations. The package header file is eventually installed in a system directory such as /usr/local/include; it is what clients of the package \#include in their C code. As a general rule of thumb, the package header file should define as few things as possible: it's very hard to change an exported feature since it breaks client code that uses the package, so the less you export, the easier it will be to make changes to the package. Thus, for example, try not to make the internal fields of structures visible in package header files. * An _internal header file_, which is typically \#included by all of the C files in the package. The internal header file has a name like tclInt.h or tkInt.h, consisting of the the package prefix followed by Int.h. The internal header file describes features that are used in multiple files within the package but aren't exported out of the package. For example, key package structures and internal utility procedures are defined in the internal header file. The internal header file should also contain \#includes for other headers that are used widely within the package, so they don't have to be included over and over in each code file. As with the package header, the internal header file should be as small as possible: structures and procedures that are only used in a single C file in the package should not appear in it. * A _porting header file_, which contains definitions that hide the differences between the systems on which the package can be used. The name of the porting header should consist of the package prefix follwed by Port.h, such as tclPort.h. * Other internal header files for various subpackages within the package. For example, there is a file tkText.h in Tk that is shared among all the files that implement text widgets and another file tkCanvas.h that is shared among all the widgets implementing canvases. I recommend having as few header files as possible in each package. In almost all cases a package header file, a single internal header file, and a porting header file will be sufficient, and in many cases the porting header file may not be necessary. The internal header file should automatically \#include the package header file and perhaps even the porting header file, so each C file in the package only needs to \#include one or at most two header files. I recommend keeping the porting header separate from the internal header file in order to maintain a clean separation between porting code and the rest of the module. Other internal headers should only be necessary in unusual cases, such as the Tk text and canvas widgets \(each of tkText.h and tkCanvas.h is many hundred lines long, due to the complexity of the widgets, and they are needed only in the source files that implement the particular widgets, so I thought it would be easier to manage these headers separately from tkInt.h\). If you have lots of internal header files, such as one for each source file, then you will end up with lots of \#include statements in each C file and you'll find that either _\(a\)_ you \#include every header in every C file \(in which case there's not much advantage to having the separate .h files\) or _\(b\)_ you are constantly adding and deleting \#include statements as you modify source files. ## Header File Structure Figure 1 illustrates the format of a header file. Your header files should follow this structure exactly: same indentation, same order of information, and so on. To make this as easy as possible, the directory engManual in the Tcl source tree contains templates for various pieces of source files. For example, the file proto.h contains a template for a header file; there are also templates for code files and procedure headers. You should be able to set up your editor to incorporate the templates when needed, then you can modify them for the particular situation in which they are used. This should make it easy for you to conform to the conventions without a lot of typing overhead. ![Figure 1. An example of a header file. The file](../assets/247fig1.png) engManual/proto.h contains a template for header files. Each header file contains the following parts, which are labelled in Figure 1: Abstract: the first few lines give the name of the file plus a short description of its overall purpose. Copyright notice: this protects the ownership of the file and controls distribution; different notices may be used on different files, depending on whether the file is to be released freely or restricted. The wording in copyright notices is sensitive \(e.g. the use of upper case is important\) so don't make changes in notices without checking with a legal authority. Revision string: the contents of this string are managed automatically by the source code control system for the file, such as RCS or SCCS \(RCS is used in the example in the figure\). It identifies the file's current revision, date of last modification, and so on. Multiple include \#ifdef: when a large application is developed with many related packages, it is hard to arrange the \#include statements so that each include file is included exactly once For example, files a.h and b.h might both include c.h, and a particular code file might include both a.h and b.h. This will cause c.h to be processed twice, and could potentially result in compiler errors such as multiply-defined symbols. With the recursion \#ifdef, plus the matching \#endif at the end of the file, the header file can be \#included multiple times without problems. The symbol \_TCL is defined the first time the header file is included; if the header is included again the presence of the symbol causes the body of the header file to be skipped. The symbol used in any given header file should be the same as the name of the header file except with the .h stripped off, a \_ prepended, and everything else capitalized. Version defines: for each package, three symbols related to the current version number should be defined. The first gives the full version number as a string, and the second and third give the major and minor numbers separately as integers. The names for these symbols should be derived from the package prefix as in Figure 1. Declarations: the rest of the header file consists of declarations for the things that are exported from the package to its clients. Most of the conventions for coding these declarations will be discussed later. When declaring variables and procedures, use EXTERN instead of extern to declare them external. The symbol EXTERN can then be \#defined to either extern or extern "C" to allow the header file to be used in both C and C\+\+ programs. The header file tcl.h contains code to \#define the EXTERN symbol; if your header file doesn't \#include tcl.h, you can copy the code from tcl.h to your header file. ## \_ANSI\_ARGS\_ Prototypes Procedure prototypes _may_ use the \_ANSI\_ARGS\_ macro as shown in Figure 1. \_ANSI\_ARGS\_ makes it possible to write full procedure prototypes for the normal case where an ANSI C compiler will be used, yet it also allows the file to be used with older non-ANSI compilers. To use \_ANSI\_ARGS\_, specify the entire argument list, including parentheses, as an argument to the \_ANSI\_ARGS\_ macro; \_ANSI\_ARGS\_ will evaluate to either this argument list or \(\), depending on whether or not an ANSI C compiler is being used. The \_ANSI\_ARGS\_ macro is defined in _tcl.h_. In the argument lists in procedure prototypes, be sure to specify names for the arguments as well as their types. The names aren't required for compilation \(for example, the declaration for Tcl\_Eval could have been written as EXTERN int Tcl_Eval _ANSI_ARGS_((Tcl_Interp , const char )); in Figure 1\) but the names provide additional information about the arguments. Note that for modern code, it is usually preferred to omit this macro, resulting in the above example looking like: EXTERN int Tcl_Eval(Tcl_Interp interp, const char scriptPtr); ## MODULE\_LOCAL Prototypes _\(Not yet shown in any figure, to be used from Tcl 8.5 onwards for the Tcl and Tk core only.\)_ Where a function is only exported so that it may be accessed from a file other than the file that declares it, that function should be declared as being MODULE\_LOCAL. While this does not have an effect with all toolchains, some \(such as the ones used for MS Windows and MacOS X\) can use this information during the linking stage to ensure that the symbol in the resulting library cannot be linked against by external code. This is useful for keeping the internal implementation of library code away from casual misuse. Example of usage: MODULE_SCOPE int TclIsLocalScalar(const char *src, int len); # How to Organize a Code File Each source code file should contain a related set of procedures, such as the implementation of a widget or canvas item type, or a set of procedures to implement hash tables. Before writing any code you should think carefully about what functions are to be provided and divide them up into files in a logical way. In my experience, the most manageable size for files is usually in the range of 500-2000 lines. If a file gets much larger than this, it will be hard to remember everything that the file does. If a file is much shorter than this, then you may end up with too many files in a directory, which is also hard to manage. Code files are divided into pages separated by formfeed \(control-L\) characters. The first page of the file is a header page containing information that is used throughout the file. Each additional page of the file contains one procedure. This approach has two advantages. First, when you print a code file each procedure header will start at the top of the page, which makes for easier reading. Second, you can browse through all of the procedures in a file by searching for the formfeed characters. ## The File Header Page The first page of a code file is a header page. It contains overall information that is relevant throughout the file, which consists of everything but the definitions of the file's procedures. The header page typically has six parts, as shown in Figure 2: ![Figure 2. An example of a header page. Part of the text of the](../assets/247fig2.png) copyright notice has been omitted. The file engManual/proto.c contains a template for a header page. Abstract: the first few lines give the name of the file and a brief description of the overall functions provided by the file, just as in header files. Copyright notice: protects ownership of the file, just as in header files. Revision string: similar to the revision strings in header files, except that its value is used to initialize a string variable. This allows the revision information to be checked in the executable object file. Include statements: all of the \#include statements for the file should appear on the header file just after the version string. In general there should be very few \#include statements in a given code file, typically just for the package's internal header file and porting header file. If additional \#includes are needed they should appear in the package's internal header file or porting header file. Declarations: any structures used only in this file should be declared on the header page \(exported structures must be declared in header files\). In addition, if the file defines any static or global variables then they should be declared on the header page. This makes it easy to tell whether or not a file has static variables, which is important if the file is ever used in a multi-threaded environment. Static variables are generally undesirable and should be avoided as much as possible. Prototypes: procedure prototypes for procedures referenced only in this file should appear at the very end of the header page \(prototypes for exported procedures must appear in the package header file\). Use the \_ANSI\_ARGS\_ macro described in Section 2.5. Please structure your header pages in exactly the order given above and follow the syntax of Figure 2 as closely as possible. The file engManual/proto.c provides a template for a header page. Source files should never contain extern statements. Instead, create header files to hold the extern statements and \#include the header files. This makes code files easier to read and makes it easier to manage the extern statements, since they're centralized in .h files instead of spread around dozens of code files. For example, the internal header file for a package has extern statements for all of the procedures that are used by multiple files within the package but aren't exported outside it. ## Procedure Headers Each page after the first one in a file should contain exactly one procedure. The page should begin with a procedure header that gives overall documentation for the procedure, followed by the declaration and body for the procedure. See Figures 3 and 4 for examples. The header should contain everything that a caller of the procedure needs to know in order to use the procedure, and nothing else. It consists of three parts: ![Figure 3. The header comments and declaration for a procedure.](../assets/247fig3.png) The file engManual/prochead contains a template for this information. ![Figure 4. The header for a procedure with side effects.](../assets/247fig4.png) Abstract: the first lines in the header give the procedure's name, followed by a brief description of what the procedure does. This should not be a detailed description of how the procedure is implemented, but rather a high-level summary of its overall function. In some cases, such as callback procedures, I recommend also describing the conditions under which the procedure is invoked and who calls the procedure, as in Figure 4.
︙			︙
378 379 380 381 382 383 384 ~~385 386~~ 387 ~~388~~ 389 390 ~~391~~ 392 393 394 395 396 ~~397 398~~ 399 400 401 402 403 404 405 406 407 408 409 410 ~~411~~ 412 413 ~~414~~ 415 416 ~~417 418~~ 419 420 421 422 423 424 425 426 427 428 429 430 431 432 ~~433 434~~ 435 436 437 ~~438~~ 439 440 441 442 443 ~~444~~ 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 ~~463~~ 464 465 466 467 468 469 470 ~~471 472~~ 473 ~~474 475~~ 476 477 478 479 ~~480~~ 481 ~~482~~ 483 484 ~~485~~ 486 487 488 ~~489~~ 490 491 ~~492 493~~ 494 495 ~~496~~ 497 498 499 500 501 502 503 504 ~~505 506 507~~ 508 509 510 511 512 ~~513~~ 514 515 516 517 ~~518~~ 519 ~~520 521~~ 522 523 524 ~~525 526~~ 527 ~~528 529 530~~ 531 532 ~~533~~ 534 535 ~~536~~ 537 ~~538~~ 539 540 541 542 ~~543 544 545~~ 546 547 ~~548~~ 549 550 551 552 553 554 ~~555~~ 556 557 558 559 560 561 562 563 564 565 566 567 568 569 ~~570 571~~ 572 573 574 575 576 577 578 ~~579~~ 580 581 582 583 584 585 ~~586~~ 587 ~~588~~ 589 590 591 592 593 594 595 596 ~~597~~ 598 599 600 601 ~~602~~ 603 604 605 606 607 608 609 610 ~~611~~ 612 613 ~~614~~ 615 616 617 618 ~~619~~ 620 621 622 623 624 ~~625~~ 626 627 628 629 630 631 632 ~~633~~ 634 ~~635~~ 636 ~~637~~ 638 639 640 641 ~~642~~ 643 644 645 646 647 648 ~~649 650 651 652~~ 653 ~~654~~ 655 656 657 658 659 660 661 ~~662~~ 663 664 665 666 ~~667~~ 668 669 ~~670 671~~ 672 ~~673~~ 674 ~~675~~ 676 677 678 679 680 ~~681~~ 682 683 684 ~~685 686 687~~ 688 689 690 691 ~~692~~ 693 694 695 ~~696 697 698~~ 699 700 701 702 703 704 ~~705~~ 706 707 708 ~~709 710 711 712 713 714 715 716 717 718~~ ~~719~~ 720 721 722 ~~723 724~~ 725 726 727 728 ~~729~~ 730 731 732 ~~733~~ 734 ~~735~~ 736 ~~737~~ 738 739 ~~740 741 742~~ 743 ~~744 745~~ 746 747 748 ~~749~~ 750 ~~751 752~~ 753 754 ~~755~~ 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 ~~771~~ 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 ~~792~~ 793 ~~794 795 796~~ 797 ~~798 799 800 801~~ 802 803 804 805 806 ~~807 808 809~~ 810 811 812 ~~813 814 815 816 817~~ 818 819 820 821 822 823 ~~824 825 826~~ ~~827 828~~ 829 830 831 832 833 ~~834 835 836~~ ~~837 838 839 840~~ 841 842 843 844 ~~845~~ 846 847 848 849 850 851 852 853 854 855 856 ~~857 858~~ 859 860 861 862 863 864 865 866 867 868 869 870 871 ~~872~~ 873 874 875 876 877 878 879 880 881 ~~882~~ 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 ~~903~~ 904 905 906 907 908 909 910 911 912 ~~913~~ 914 915 916 917 918 919 920 921 922 923 924 925 926 ~~927~~ 928 929 930 931 932 ~~933 934 935 936 937 938~~ 939 940 ~~941~~ 942 943 944 945 ~~946 947 948 949 950 951 952 953 954 955~~ 956 957 958 959 960 961 962 963 964 965 ~~966~~ 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 ~~984~~ 985 986 987 ~~988 989 990 991~~ 992 993 994 995 996 997 998 999 1000 1001 1002 ~~1003~~ 1004 1005 1006 1007 1008 1009 ~~1010~~ 1011 ~~1012~~ 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 ~~1032~~ 1033 1034 1035 1036 1037 1038 1039 1040 ~~1041 1042 1043 1044~~ 1045 ~~1046~~ 1047 ~~1048 1049~~ 1050 1051 1052 1053 1054 1055 ~~1056 1057 1058~~ 1059 ~~1060~~ 1061 ~~1062 1063~~ 1064 1065 ~~1066~~ 1067 ~~1068~~ 1069 ~~1070~~ 1071 1072 1073 1074 1075 1076 ~~1077~~ 1078 ~~1079~~ 1080 ~~1081~~ 1082 ~~1083~~ 1084 ~~1085 1086~~ 1087 1088 ~~1089~~ 1090 ~~1091 1092~~ 1093 ~~1094~~ 1095 ~~1096 1097 1098 1099~~ 1100 1101 1102 1103 1104 ~~1105~~ 1106 1107 1108 ~~1109~~ 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 ~~1120~~ 1121 1122 1123 1124 1125 1126 1127 ~~1128~~ 1129 1130 1131 1132 ~~1133~~ 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 ~~1152~~ 1153 1154 1155 1156 1157 1158 1159 1160 ~~1161~~ 1162 1163 ~~1164~~ 1165 1166 1167 1168 1169 ~~1170~~ 1171 1172 ~~1173 1174 1175 1176 1177~~ 1178 1179 1180 ~~1181~~ 1182 1183 1184 1185 1186 1187 1188 1189 1190 ~~1191 1192~~ 1193 ~~1194 1195~~ 1196 ~~1197~~ 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 ~~1210 1211~~ 1212 1213 1214 1215 1216 1217 1218 ~~1219~~ 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 ~~1236~~ 1237 1238 1239 1240 ~~1241~~ 1242 1243 1244 ~~1245~~ 1246 1247 1248 ~~1249 1250 1251 1252 1253~~ 1254 1255 ~~1256~~ 1257 1258 1259 ~~1260~~ 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 ~~1280~~ 1281 1282 1283 ~~1284~~ 1285 ~~1286~~ 1287 1288 1289 1290 1291 ~~1292~~ 1293 ~~1294~~ 1295 1296 1297 1298 1299 1300 1301 1302 ~~1303 1304 1305 1306 1307 1308 1309 1310 1311 1312~~ 1313 1314 1315 1316 1317 1318 1319 ~~1320~~ 1321 1322 ~~1323~~ 1324 ~~1325~~ 1326 1327	section should not describe every internal variable modified by the procedure. It should simply provide the sort of information that users of the procedure need in order to use the procedure correctly. See Figure 4 for an example. The file engManual/prochead contains a template for a procedure header, which you can include from your editor to save typing. Follow the syntax of Figures ~~3 and 4 exactly (same indentation, double-dash after the procedure name, etc.).~~ ~~The Results and Side Effects parts of the header may be omitted ''only'' if~~ the function has no results or side effects respectively. ~~~~Procedure Declarations~~ The procedure declaration should also follow exactly the syntax in Figures 3 and 4. The first line gives the type of the procedure's result. All procedures must be typed: use void if the procedure returns no result. The second line gives the procedure's name and its argument list. If there are many arguments, ~~they may spill onto additional lines (see Sections 5.1 and 5.5 for information about indentation). After this come the declarations of argument types, one~~ argument per line, indented, with a comment after each argument giving a brief description of the argument. Every argument must be explicitly declared, and every argument must have a comment. This form for argument declarations is the old form that predates ANSI C. It's important to use the old form so that your code will compile on older pre-ANSI compilers. Hopefully there aren't too many of these compilers left, and perhaps in a few years we can switch to the ANSI form, but for now let's be safe. Every procedure should also have an ANSI-style prototype either on the file's header page or in a header file, so this approach still allows full argument checking. ~~''Note that'' for new code it is preferred to use ANSI declarations,~~ especially if the code will not build on non-ANSI compilers. ~~~~Parameter Order~~ Procedure parameters may be divided into three categories. In parameters only ~~pass information into the procedure (either directly or by pointing to information that the procedure reads). Out parameters point to things in the~~ caller's memory that the procedure modifies. In-out parameters do both. Below is a set of rules for deciding on the order of parameters to a procedure: 1. Parameters should normally appear in the order in, in/out, out, except where overridden by the rules below. 2. If there is a group of procedures, all of which operate on structures of a particular type, such as a hash table, the token for the structure should be the first argument to each of the procedures. 3. When two parameters are the address of a callback procedure and a ClientData value to pass to that procedure, the procedure address should appear in the argument list immediately before the ClientData. ~~4. If a callback procedure takes a ClientData argument (and all callbacks should), the ClientData argument should be the first argument to the~~ procedure. Typically the ClientData is a pointer to the structure managed by the callback, so this is really the same as rule 2. ~~~~Procedure Bodies~~ The body of a procedure follows the declaration. See Section 5 for the coding conventions that govern procedure bodies. The curly braces enclosing the body should be on separate lines as shown in Figures 3 and 4. ~~~Naming Conventions~~ Choosing names is one of the most important aspects of programming. Good names clarify the function of a program and reduce the need for other documentation. Poor names result in ambiguity, confusion, and error. For example, in the Sprite operating system we spent four months tracking down a subtle problem with the file system that caused seemingly random blocks on disk to be overwritten from time to time. It turned out that the same variable name was used in some places to refer to physical blocks on disk, and in other places to logical blocks in a file; unfortunately, in one place the variable was accidentally used for the wrong purpose. The bug probably would not have occurred if different variable names had been used for the two kinds of block identifiers. This section gives some general principles to follow when choosing names, then lists specific rules for name syntax, such as capitalization, and finally describes how to use package prefixes to clarify the module structure of your code. ~~~~General Considerations~~ The ideal variable name is one that instantly conveys as much information as possible about the purpose of the variable it refers to. When choosing names, play devil's advocate with yourself to see if there are ways that a name might be misinterpreted or confused. Here are some things to consider: 1. Are you consistent? Use the same name to refer to the same thing ~~everywhere. For example, in the Tcl implementation the name ''interp'' is used consistently for pointers to the uservisible Tcl_Interp structure.~~ Within the code for each widget, a standard name is always used for a ~~pointer to the widget record, such as ''butPtr'' in the button widget code and ''menuPtr'' in the menu widget code.~~ 2. If someone sees the name out of context, will they realize what it stands for, or could they confuse it with something else? For example, in Sprite the procedure for doing byte-swapping and other format conversion was ~~originally called Swap_Buffer. When I first saw that name I assumed it had~~ something to do with I/O buffer management, not reformatting. We ~~subsequently changed the name to Fmt_Convert.~~ 3. Could this name be confused with some other name? For example, it's ~~probably a mistake to have two variables ~~''s''~~ and ''string'' in the same~~ procedure, both referring to strings: it will be hard for anyone to remember which is which. Instead, change the names to reflect their functions. For example, if the strings are used as source and destination ~~for a copy operation, name them ~~''src''~~ and ~~''dst''~~.~~ 4. Is the name so generic that it doesn't convey any information? The ~~variable ~~''s''~~ from the previous paragraph is an example of this; changing its name to ~~''src''~~ makes the name less generic and hence conveys more~~ information. ~~~~Basic Syntax Rules~~ Below are some specific rules governing the syntax of names. Please follow the rules exactly, since they make it possible to determine certain properties of a variable just from its name. 1. Variable names always start with a lower-case letter. Procedure and type names always start with an upper-case letter. ~~\| int counter; \| extern char FindElement(); \| typedef int Boolean;~~ 2. In multi-word names, the first letter of each trailing word is capitalized. Do not use underscores as separators between the words of a name, except as described in rule 5 below and in Section 4.3. ~~\| int numWindows;~~ 3. Any name that refers to a pointer ends in Ptr. If the name refers to a pointer to a pointer, then it ends in PtrPtr, and so on. There are two exceptions to this rule. The first is for variables that are opaque ~~handles for structures, such as variables of type Tk_Window. These~~ variables are actually pointers, but they are never dereferenced outside ~~Tk (clients can never look at the structure they point to except by invoking Tk macros and procedures). In this case the Ptr is omitted in~~ variable names. The second exception to the rule is for strings. We decided in Sprite not to require Ptr suffixes for strings, since they are always referenced with pointers. However, if a variable holds a pointer to ~~a string pointer, then it must have the Ptr suffix (there's just one less level of Ptr for strings than for other structures).~~ ~~\| TkWindow winPtr; \| char name; \| char namePtr;~~ 4. Variables that hold the addresses of functions should have names ending in ~~Proc (for "procedure"). Typedefs for these variables should also have~~ names ending in Proc. ~~\| typedef void (Tk_ImageDeleteProc)(ClientData clientData);~~ ~~5. #defined constants and macros have names that are all capital letters,~~ except for macros that are used as replacements for procedures, in which case you should follow the naming conventions for procedures. If names in all caps contain multiple words, use underscores to separate the words. ~~\| #define NULL 0 \| #define BUFFER_SIZE 1024 \| #define Min(a,b) (((a) < (b)) ? (a) : (b))~~ 6. Names of programs, Tcl commands, and keyword arguments to Tcl commands ~~(such as Tk configuration options) are usually entirely in lower case, in~~ spite of the rules above. The reason for this rule is that these names are likely to typed interactively, and I thought that using all lower case would make it easier to type them. In retrospect I'm not sure this was a good idea; in any case, Tcl procedure and variable names should follow the same rules as C procedures and variables. ~~~~Names Reflect Package Structure~~ Names that are exported outside a single file must include the package prefix in order to make sure that they don't conflict with global names defined in other packages. The following rules define how to use package prefixes in names: 1. If a variable or procedure or type is exported by its package, the first letters of its name must consist of the package prefix followed by an underscore. Only the first letter of the prefix is ever capitalized, and it is subject to the capitalization rules from Section 4.2. The first letter after the prefix is always capitalized. The first example below shows an exported variable, and the second shows an exported type and exported procedure. ~~\| extern int tk_numMainWindows; \| extern Tcl_Interp Tcl_CreateInterp(void);~~ 2. If a module contains several files, and if a name is used in several of those files but isn't used outside the package, then the name must have the package prefix but no underscore. The prefix guarantees that the name won't conflict with a similar name from a different package; the missing underscore indicates that the name is private to the package. ~~\| extern void TkEventDeadWindow(TkWindow winPtr);~~ 3. If a name is only used within a single procedure or file, then it need not have the module prefix. To avoid conflicts with similar names in other files, variables and procedures declared outside procedures must always be declared static if they have no module prefix. ~~\| static int initialized;~~ ~~~~Standard Names~~ The following variable names are used consistently throughout Tcl and Tk. Please use these names for the given purposes in any code you write, and don't use the names for other purposes. clientData: Used for variables of type ClientData, which are associated with callback procedures. ~~interp: Used for variables of type Tcl_Interp. These are the (mostly) opaque~~ handles for interpreters that are given to Tcl clients. These variables should really have a Ptr extension, but the name was chosen at a time when interpreters were totally opaque to clients. ~~iPtr: Used for variables of type Interp , which are pointers to Tcl's~~ internal structures for interpreters. Tcl procedures often have an argument named interp, which is copied into a local variable named iPtr in order to access the contents of the interpreter. nextPtr: A field with this name is used in structures to point to the next structure in a linked list. This is usally the last field of the structure. ~~tkwin: Used for variables of type Tk_Window, which are opaque handles for the~~ window structures managed by Tk. ~~winPtr: Used for variables of type TkWindow , which are pointers to Tk's~~ internal structures for windows. Tk procedures often take an argument named tkwin and immediately copy the argument into a local variable named winPtr in order to access the contents of the window structure. ~~~Low-Level Coding Conventions~~ This section describes several low-level syntactic rules for writing C code. The reason for having these rules is not because they're better than all other ways of structuring code, but in order to make all our code look the same. ~~~~Indents are 4 Spaces~~ Each level of indentation should be four spaces. There are ways to set 4-space indents in all editors that I know of. Be sure that your editor really uses four spaces for the indent, rather than just displaying tabs as four spaces wide; if you use the latter approach then the indents will appear eight spaces wide in other editors. ~~If you use tabs, they ''must'' be to 8-space indents.~~ ~~~~Code Comments Occupy Full Lines~~ ~~Comments that document code (as opposed to declarations) should occupy full~~ lines, rather than being tacked onto the ends of lines containing code. The reason for this is that side-byside comments are hard to see, particularly if neighboring statements are long enough to overlap the side-by-side comments. Comments must have exactly the structure shown in Figure 5, including a ~~leading / line, a trailing / line, and additional blank lines above and~~ below. The leading blank line can be omitted if the comment is at the beginning of a block, as is the case in the second comment in Figure 5. Each comment should be indented to the same level as the surrounding code. Use proper English in comments: write complete sentences, capitalize the first word of each sentence, and so on. ~~#image:247fig5~~ Figure 5. Comments in code have the form shown above, using full lines, with lined-up stars, the / and / symbols on separate lines, and blank separator lines around each comment (except that the leading blank line can be omitted if the comment is at the beginning of a code block). ~~~~Declaration Comments are Side-By-Side~~ When documenting the arguments for procedures and the members of structures, place the comments on the same lines as the declarations. Figures 3 and 4 show comments for procedure arguments and Figure 6 shows a simple structure declaration. The format for comments is the same in both cases. Place the comments to the right of the declarations, with all the left edges of all the comments lined up. When a comment requires more than one line, indent the ~~additional lines to the same level as the first line, with the closing / on~~ the same line as the end of the text. For structure declarations it is usually useful to have a block of comments preceding the declaration, as in Figure 6. This comments before the declaration use the format given in Section 5.2. ~~~~#image:247fig6~~ Figure 6. Use side-by-side comments when declaring structure~~ members and procedure arguments. ~~Declaration comments should normally begin in the 33rd column (i.e. where you would be after 32 spaces or 4 tabs).~~ ~~~~Curly Braces: { Goes at the End of a Line~~ ~~Open curly braces should not (normally) appear on lines by themselves.~~ Instead, they should be placed at the end of the preceding line. Close curly braces always appear as the first non-blank character on a line. Figure 5 shows how to use curly braces in statements such as if and while, and Figure 6 shows how curly braces should be used in structure declarations. If an if statement has an else clause then else appears on the same line as the ~~preceding } and the following {. Close curly braces are indented to the same~~ level as the outer code, i.e., four spaces less than the statements they enclose. The only cases where a { appears on a line by itself are the initial { for the body of a procedure (see Figures 3 and 4) or where a block is being started ''without'' being the body of an if, do, for, while or switch construct. Always use curly braces around compound statements, even if there is only one statement in the block. Thus you shouldn't write code like ~~\| if (filePtr->numLines == 0) return -1;~~ but rather ~~\| if (filePtr->numLines == 0) { \| return -1; ~~\| }~~~~ This approach makes code less dense, but it avoids potential mistakes when adding additional lines to an existing single-statement block. It also makes it easier to set breakpoints in a debugger, since it guarantees that each statement on is on a separate line and can be named individually. ~~There is one exception to the rule about enclosing blocks in {}. For if~~ statements with cascaded else if clauses, you may use a form like the following: \| if (strcmp(argv[1], "delete") == 0) { \| ... \| } else if (strcmp(argv[1], "get") == 0) { \| ... \| } else if (strcmp(argv[1], "set") == 0) { \| ... \| } else { \| ... ~~\| }~~ ~~~~Continuation Lines are Indented 8 Spaces~~ You should use continuation lines to make sure that no single line exceeds 80 characters in length. Continuation lines should be indented 8 spaces so that ~~they won't be confused with an immediately-following nested block (see Figure 7). Pick clean places to break your lines for continuation, so that the~~ continuation doesn't obscure the structure of the statement. For example, if a procedure call requires continuation lines, make sure that each argument is on a single line. If the test for an if or while command spans lines, try to make each line have the same nesting level of parentheses if possible. I try to ~~start each continuation line with an operator such as , &&, or \|\|; this makes~~ it clear that the line is a continuation, since a new statement would never start with such an operator. ~~~~#image:247fig7~~ Figure 7. Continuation lines are indented 8 spaces.~~ ~~~~Avoid Macros Except for Simple Things~~ ~~#define statements provide a fine mechanism for specifying constants~~ symbolically, and you should always use them instead of embedding specific numbers in your code. However, it is generally a bad idea to use macros for complex operations; procedures are almost always better (for example, you can set breakpoints inside procedures but not in the middle of macros). The only time that it is OK to use #define's for complex operations is if the operations are critical to performance and there is no other way to get the ~~performance (have you measured the performance before and after to be sure it matters?).~~ When defining macros, remember always to enclose the arguments in parentheses: ~~\| #define Min(a,b) (((a) < (b)) ? (a) : (b))~~ ~~Otherwise, if the macro is invoked with a complex argument such as ab or small\|\|red it may result in a parse error or, even worse, an unintended result~~ that is difficult to debug. ~~~Documenting Code~~ The purpose of documentation is to save time and reduce errors. Documentation is typically used for two purposes. First, people will read the documentation to find out how to use your code. For example, they will read procedure headers to learn how to call the procedures. Ideally, people should have to learn as little as possible about your code in order to use it correctly. Second, people will read the documentation to find out how your code works internally, so they can fix bugs or add new features; again, good documentation will allow them to make their fixes or enhancements while learning the minimum possible about your code. More documentation isn't necessarily better: wading through pages of documentation may not be any easier than deciphering the code. Try to pick out the most important things that will help people to understand your code and focus on these in your documentation. ~~~~Document Things with Wide Impact~~ The most important things to document are those that affect many different pieces of a program. Thus it is essential that every procedure interface, every structure declaration, and every global variable be documented clearly. If you haven't documented one of these things it will be necessary to look at all the uses of the thing to figure out how it's supposed to work; this will be time-consuming and error-prone. On the other hand, things with only local impact may not need much documentation. For example, in short procedures I don't usually have comments explaining the local variables. If the overall function of the procedure has been explained, and if there isn't much code in the procedure, and if the variables have meaningful names, then it will be easy to figure out how they are used. On the other hand, for long procedures with many variables I usually document the key variables. Similarly, when I write short procedures I don't usually have any comments in the procedure's code: the procedure header provides enough information to figure out what is going on. For long procedures I place a comment block before each major piece of the procedure to clarify the overall flow through the procedure. ~~~~Don't Just Repeat What's in the Code~~ ~~The most common mistake I see in documentation (besides it not being there at all) is that it repeats what is already obvious from the code, such as this trivial (but exasperatingly common) example:~~ ~~\| /* \| * Increment i. \| / \| i += 1;~~ Documentation should provide higher-level information about the overall function of the code, helping readers to understand what a complex collection of statements really means. For example, the comment ~~\| / \| * Probe into the hash table to see if the symbol exists. \| /~~ is likely to be much more helpful than ~~\| / \| * Mask off all but the lower 8 bits of x, then index into table \| * t, then traverse the list looking for a character string \| * identical to s. \| /~~ Everything in this second comment is probably obvious from the code that follows it. Another thing to consider in your comments is word choice. Use different words in the comments than the words that appear in variable or procedure names. For example, the comment \| / \| * VmMapPage -- \| * ~~\| * Map a page. \| * ...~~ which appears in the header for the Sprite procedure VmMapPage, doesn't provide any new information. Everything in the comment is already obvious from the procedure's name. Here is a much more useful comment: \| /* \| * VmMapPage -- \| * ~~\| * Make the given physical page addressable in the kernel's \| * virtual address space. This procedure is used when the \| * kernel needs to access a user's page. \| * ...~~ This comment tells why you might want to use the procedure, in addition to what it does, which makes the comment much more useful. ~~~~Document Each Thing in Exactly One Place~~ Systems evolve over time. If something is documented in several places, it will be hard to keep the documentation up to date as the system changes. Instead, try to document each major design decision in exactly one place, as near as possible to the code that implements the design decision. For example, put the documentation for each structure right next to the declaration for the structure, including the general rules for how the structure is used. You need not explain the fields of the structure again in the code that uses the structure; people can always refer back to the structure declaration for this. The principal documentation for each procedure goes in the procedure header. There's no need to repeat this information again in the body of the procedure ~~(but you might have additional comments in the procedure body to fill in details not described in the procedure header). If a library procedure is~~ documented thoroughly in a manual entry, then I may make the header for the procedure very terse, simply referring to the manual entry. For example, I use this terse form in the headers for all Tcl command procedures, since there is a separate manual entry describing each command. The other side of this coin is that every major design decision needs to be documented at least once. If a design decision is used in many places, it may be hard to pick a central place to document it. Try to find a data structure or key procedure where you can place the main body of comments; then reference this body in the other places where the decision is used. If all else fails, add a block of comments to the header page of one of the files implementing the decision. ~~~~Write Clean Code~~ The best way to produce a well-documented system is to write clean and simple code. This way there won't be much to document. If code is clean, it means that there are a few simple ideas that explain its operation; all you have to do is to document those key ideas. When writing code, ask yourself if there is a simple concept behind the code. If not, perhaps you should rethink the code. If it takes a lot of documentation to explain a piece of code, it is a sign that you haven't found an elegant solution to the problem. ~~~~Document As You Go~~ It is extremely important to write the documentation as you write the code. It's very tempting to put off the documentation until the end; after all, the code will change, so why waste time writing documentation now when you'll have to change it later? The problem is that the end never comes - there is always more code to write. Also, the more undocumented code that you accumulate, the harder it is to work up the energy to document it. So, you just write more undocumented code. I've seen many people start a project fully intending to go back at the end and write all the documentation, but I've never seen anyone actually do it. If you do the documentation as you go, it won't add much to your coding time and you won't have to worry about doing it later. Also, the best time to document code is when the key ideas are fresh in your mind, which is when you're first writing the code. When I write new code, I write all of the header comments for a group of procedures before I fill in any of the bodies of the procedures. This way I can think about the overall structure and how the pieces fit together before getting bogged down in the details of individual procedures. ~~~~Document Tricky Situations~~ If code is non-obvious, meaning that its structure and correctness depend on information that won't be obvious to someone reading it for the first time, be sure to document the non-obvious information. One good indicator of a tricky situation is a bug. If you discover a subtle property of your program while fixing a bug, be sure to add a comment explaining the problem and its solution. Of course, it's even better if you can fix the bug in a way that eliminates the subtle behavior, but this isn't always possible. ~~~Testing~~ One of the environments where Tcl works best is for testing. If all the functionality of an application is available as Tcl commands, you should be able to write Tcl scripts that exercise the application and verify that it behaves correctly. For example, Tcl contains a large suite of tests that exercise nearly all of the Tcl functionality. Whenever you write new code you should write Tcl test scripts to go with that code and save the tests in files so that they can be re-run later. Writing test scripts isn't as tedious as it may sound. If you're developing your code carefully you're already doing a lot of testing; all you need to do is type your test cases into a script file where they can be re-used, rather than typing them interactively where they vanish into the void after they're run. ~~~~Basics~~ Tests should be organized into script files, where each file contains a collection of related tests. Individual tests should be based on the procedure test, just like in the Tcl and Tk test suites. Here are two examples: ~~\| test expr-3.1 {floating-point operators} { \| expr 2.3.6 \| } 1.38 \| test expr-3.2 {floating-point operators} { \| list [catch {expr 2.3/0} msg] $msg \| } {1 {divide by zero}}~~ test is a procedure defined in a script file named defs, which is sourced by ~~each test file. The ~~'''~~test~~'''~~ command takes four arguments: a test~~ identifier, a string describing the test, a test script, and the expected result of the script. test evaluates the script and checks to be sure that it produces the expected result. If not, it prints a message like the following: ~~\| ==== expr-3.1 floating-point operators \| ==== Contents of test case: \| \| expr 2.3.6 \| \| ==== Result was: \| 1.39 \| ---- Result should have been: \| 1.38 \| ---- expr-2.1 FAILED~~ To run a set of tests, you start up the application and source a test file. If all goes well no messages appear; if errors are detected, a message is printed for each one. The test identifier, such as expr-3.1, is printed when errors occur. It can be used to search a test script to locate the source for a failed test. The first part of the identifier, such as expr, should be the same as the name of the test file, except that the test file should have a .test extension, such as expr.test. The two numbers allow you to divide your tests into groups. The ~~tests in a particular group (e.g., all the expr-3.n tests) relate to a single~~ sub-feature, such as a single C procedure or a single option of a Tcl command. The tests should appear in the test file in the same order as their numbers. The test name, such as floating-point operators, is printed when errors occur. It provides human-readable information about the general nature of the test. Before writing tests I suggest that you look over some of the test files for Tcl and Tk to see how they are structured. You may also want to look at the README files in the Tcl and Tk test directories to learn about additional features that provide more verbose output or restrict the set of tests that are run. Although it is possible to automatically generate names for tests, this is not recommended because it makes it difficult to search for the specific test in the test suite if all you have to go on is the test name. ~~~~Organizing Tests~~ Organize your tests to match the code being tested. The best way to do this is to have one test file for each source code file, with the name of the test file derived from the name of the source file in an obvious way (e.g. textWind.test contains tests for the code in tkTextWind.c). Within the test file, have one group of tests for each procedure (for example, all the textWind-2.n tests in textWind.test are for the procedure TkTextWindowCmd). The order of the tests within a group should be the same as the order of the code within the procedure. This approach makes it easy to find the tests for a particular piece of code and add new tests as the code changes. The Tcl test suite was written a long time ago and uses a different style where there is one file for each Tcl command or group of related commands, and the tests are grouped within the file by sub-command or features. In this approach the relationship between tests and particular pieces of code is much less obvious, so it is harder to maintain the tests as the code evolves. I don't recommend using this approach for new tests. ~~~~Coverage~~ When writing tests, you should attempt to exercise every line of source code at least once. There will be occasionally be code that you can't exercise, such as code that exits the application, but situations like this are rare. You may find it hard to exercise some pieces of code because existing Tcl commands don't provide fine enough control to generate all the possible ~~execution paths (for example, at the time I wrote the test suite for Tcl's~~ dynamic string facility there were very few Tcl commands using the facility; ~~some of the procedures were not called at all). In situations like this, write~~ one or more new Tcl commands just for testing purposes. For example, the file tclTest.c in the Tcl source directory contains a command testdstring, which provides a number of options that allow all of the dynamic string code to be exercised. tclTest.c is only included in a special testing version of tclsh, so the testdstring command isn't present in normal Tcl applications. Use a similar approach in your own code, where you have an extra file with additional commands for testing. It's not sufficient just to make sure each line of code is executed by your tests. In addition, your tests must discriminate between code that executes correctly and code that isn't correct. For example, write tests to make sure that the then and else branches of each if statement are taken under the correct conditions. For loops, run different tests to make the loop execute zero times, one time, and two or more times. If a piece of code removes an element from a list, try cases where the element to be removed is the first element, last element, only element, and neither first element nor last. Try to find all the places where different pieces of code interact in unusual ways, and exercise the different possible interactions. ~~~~Memory Allocation~~ Tcl and Tk use a modified memory allocator that checks for several kinds of memory allocation errors, such as freeing a block twice, failing to free a block, or writing past the end of a block. In order to use this allocator, don't call malloc, free, or realloc directly. Call ckalloc instead of malloc, ckfree instead of free, and ckrealloc instead of realloc. These procedures behave identically to malloc, free, and realloc except that they monitor memory usage. Ckalloc, ckfree, and ckrealloc are actually macros that can be configured with a compiler switch: if TCL~~_MEM~~_DEBUG is defined, they perform the checks but run more slowly and use more memory; if TCL~~_MEM~~_DEBUG is not defined, then the macros are just #defined to malloc, free, and realloc so there is no penalty in efficiency. I always run with TCL~~_MEM~~_DEBUG in my development environment and you should too. Official releases typically do not ~~have TCL~~_MEM~~_DEBUG set.~~ ~~If you set TCL~~_MEM~~_DEBUG anywhere in your code then you must set it everywhere (including the Tcl and Tk libraries); the memory allocator will get hopelessly~~ confused if a block of memory is allocated with malloc and freed with ckfree, or allocated with ckalloc and freed with free. There is nothing equivalent to calloc in the debugging memory allocator. If you need a new block to be zeroed, call memset to clear its contents. If you compile with TCL~~_MEM~~_DEBUG, then an additional Tcl command named memory will appear in your application (assuming that you're using the standard Tcl or Tk main program). The memory command has the following options: ~~~~'''~~memory active~~''' ''~~file''~~ ~~> Dumps a list of all allocated blocks (and where they were allocated) to ''file''. Memory leaks can be tracked down by comparing dumps made at~~ different times. ~~~~'''~~memory break~~_on~~_malloc~~''' ''~~number''~~ ~~> Enter the debugger after ''number'' calls to ~~'''~~ckalloc~~'''~~.~~ ~~'''~~memory info~~'''~~ > Prints a report containing the total allocations and frees since Tcl began, the number of blocks currently allocated, the number of bytes currently allocated, and the maximum number of blocks and bytes allocated at any one time. ~~~~'''~~memory init~~''' ''~~onoff''~~ ~~> If ''onoff'' is on, new blocks of memory are initialized with a strange~~ value to help locate uninitialized uses of the block. Any other value for ~~''onoff'' turns initialization off. Initialization is on by default.~~ ~~~~'''~~memory trace~~''' ''~~onoff''~~ ~~> If ''onoff'' is on, one line will be printed to stderr for each call to ~~'''~~ckalloc~~'''~~. Any other value for ''onoff'' turns tracing off. Tracing is~~ off by default. ~~~~'''~~memory trace_on~~_at~~_malloc~~''' ''~~number''~~ ~~> Arranges for tracing to be turned on after ''number'' calls to ~~'''~~ckalloc~~'''~~.~~ ~~~~'''~~memory validate~~''' ''~~onoff''~~ > If ''onoff'' is on, guard zones around every allocated block are checked on every call to ~~'''~~ckalloc~~'''~~ or ~~'''~~ckfree~~'''~~ in order to detect memory overruns as soon as possible. If ''onoff'' is anything other than on, checks are made only during ~~'''~~ckfree~~'''~~ calls and only for the block being freed. Memory validation has a very large performance impact, so it is off by default. The debugging memory allocator is inferior in many ways to commercial products like Purify, so its worth using one of the commercial products if possible. ~~Even so, please use ~~'''~~ckalloc~~'''~~ and ~~'''~~ckfree~~'''~~ everywhere in your code, so~~ that other people without access to the commercial checkers can still use the Tcl debugging allocator. ~~~~Fixing Bugs~~ Whenever you find a bug in your code it means that the test suite wasn't complete. As part of fixing the bug, you should add new tests that detect the presence of the bug. I recommend writing the tests after you've located the bug but before you fix it. That way you can verify that the bug happens before you implement the fix and goes away afterwards, so you'll know you've really fixed something. Use bugs to refine your testing approach: think about what you might be able to do differently when you write tests in the future to keep bugs like this one from going undetected. ~~~~Tricky Features~~ I also use tests as a way of illustrating the need for tricky code. If a piece of code has an unusual structure, and particularly if the code is hard to explain, I try to write additional tests that will fail if the code is implemented in the obvious manner instead of using the tricky approach. This way, if someone comes along later, doesn't understand the documentation for the code, decides the complex structure is unnecessary, and changes the code ~~back to the simple (but incorrect) form, the test will fail and the person~~ will be able to use the test to understand why the code needs to be the way it is. Illustrative tests are not a substitute for good documentation, but they provide a useful addition. ~~~~Test Independence~~ Try to make tests independent of each other, so that each test can be understood in isolation. For example, one test shouldn't depend on commands executed in a previous test. This is important because the test suite allows tests to be run selectively: if the tests depend on each other, then false errors will be reported when someone runs a few of the tests without the others. For convenience, you may execute a few statements in the test file to set up a test configuration and then run several tests based on that configuration. If you do this, put the setup code outside the calls to the test procedure so it will always run even if the individual tests aren't run. I suggest keeping a very simple structure consisting of setup followed by a group of tests. Don't perform some setup, run a few tests, modify the setup slightly, run a few more tests, modify the setup again, and so on. If you do this, it will be hard for people to figure out what the setup is at any given point and when they add tests later they are likely to break the setup. ~~~Porting Issues~~ The X Window System, ANSI C, and POSIX provide a standard set of interfaces that make it possible to write highly portable code. However, some additional work will still be needed if code is to port among all of the UNIX platforms. As Tcl and Tk move from the UNIX world onto PCs and Macintoshes, porting issues will become even more important. This section contains a few tips on how to write code that can run on many different platforms. ~~~~Stick to Standards~~ The easiest way to make your code portable is to use only library interfaces ~~that are available everywhere (or nearly everywhere). For example, the ANSI C~~ library procedures, POSIX system calls, and Xlib windowing calls are available on many platforms; if you code to these standards your packages will be quite portable. Avoid using system-specific library procedures, since they will introduce porting problems. ~~~~Minimize #ifdefs~~ Although there will be situations where you have to do things differently on different machines, #ifdefs are rarely the best way to deal with these problems. If you load up your code with #ifdef statements based on various machines and operating systems, the code will turn into spaghetti. #ifdefs make code unreadable: it is hard to look at #ifdef-ed code and figure out exactly what will happen on any one machine. Furthermore, #ifdefs encourage a style where lots of machine dependencies creep all through the code; it is much better to isolate machine dependencies in a few well-defined places. ~~Thus you should almost never use #ifdefs. Instead, think carefully about the~~ ways in which systems differ and define procedural interfaces to the machine-dependent code. Then provide a different implementation of the machine-dependent procedures for each machine. When linking, choose the version appropriate for the current machine. This way all of the machine dependencies for a particular system are located in one or a few files that are totally separate from the machine-dependent code for other systems and from the main body of your code. The only "conditional" code left will be the code that selects which version to link with. ~~You won't be able to eliminate #ifdefs completely, but please avoid them as much as possible. If you end up with code that has a lot of #ifdefs, this~~ should be a warning to you that something is wrong. See if you can find a way ~~to re-organize the code (perhaps using the techniques described later in this section) to reduce the number of #ifdefs.~~ ~~~~Organize by Feature, Not by System~~ Don't think about porting issues in terms of specific systems. Instead, think in terms of specific features that are present or absent in the systems. For example, don't divide your code up according to what is needed in HP-UX versus Solaris versus Windows. Instead, consider what features are present in the different systems; for example, some systems have a waitpid procedure, while others don't yet provide one, and some systems have ANSI C compilers that support procedure prototypes, while some systems do not. The feature-based approach has a number of advantages over the system-based approach. First, many systems have features in common, so you can share feature-based porting code among different systems. Second, if you think in ~~terms of features then you can consider each feature separately ("what do I do if there is no waitpid?"); this replaces one large problem with several~~ smaller problems that can be dealt with individually. Lastly, the autoconf program can be used to check for the presence or absence of particular features and configure your code automatically. Once you've gotten your code running on several different systems, you'll find that many new systems can be handled with no additional work: their features are similar to those in systems you've already considered, so autoconf can handle them automatically. ~~~~Use Emulation~~ One of the cleanest ways to handle porting problems is with emulation: assume the existence of certain procedures, such as those in the POSIX standard, and if they don't exist on a given system then write procedures to emulate the desired functionality with the facilities that are present on the system. For example, when Tcl first started being used widely I discovered that many systems did not support the waitpid kernel call, even though it was part of the POSIX standard. So, I wrote a waitpid procedure myself, which emulated the functionality of waitpid using the wait and wait3 kernel calls. The best way to emulate waitpid was with wait3, but unfortunately wait3 wasn't available everywhere either, so the emulation worked differently on systems that had wait3 and those that supported only wait. The autoconf program checks to see which of the kernel calls are available, includes the emulation for waitpid if it isn't available, and sets a compiler flag that indicates to the emulation code whether or not wait3 is available. ~~You can also emulate using #defines in a header file. For example, not all~~ systems support symbolic links, and those that don't support symbolic links don't support the lstat kernel call either. For these systems Tcl uses stat to emulate lstat with the following statement in tclUnix.h: ~~\| #define lstat stat~~ If a header file is missing on a particular system, write your own version of the header file to supply the definitions needed by your code. Then you can ~~#include your version in your code if the system doesn't have a version of its~~ own. For example, here is the code in tclUnix.h that handles unistd.h, which isn't yet available on all UNIX systems: ~~\| #ifdef HAVE_UNISTD_H \| #include <unistd.h> \| #else \| #include "compat/unistd.h" \| #endif~~ The configure script generated by autoconf checks for the existence of ~~unistd.h in the system include directories and sets HAVE_UNISTD_H if it is~~ present. If it isn't present, tclUnix.h includes a version from the Tcl source tree. ~~~~Use Autoconf~~ The GNU autoconf program provides a powerful way to configure your code for different systems. With autoconf you write a script called configure.in that describes the porting issues for your software in terms of particular features that are needed and what to do if they aren't present. Before creating a release of your software you run autoconf, which processes configure.in and generates a shell script called configure. You then include configure with your distribution. When it is time to install the distribution on a particular system, the installer runs the configure script. configure pokes around in the system to find out what features are present, then it modifies the Makefile accordingly. The modifications typically consist of compiling additional files to substitute for missing procedures, or setting compiler flags that can be used for conditional compilation in the code. Use of libtool is not recommended; it tends to inhibit porting to anything other than fairly conventional UNIX platforms. ~~~~Porting Header File~~ In spite of all the above advice, you will still end up needing some conditional compilation, for example to include alternate header files where ~~standard ones are missing or to #define symbols that aren't defined on the~~ system. Put all of this code in the porting header file for the package, then ~~#include this header file in each of the source files of the package. With~~ this approach you only need to change a single place if you have to modify your approach to portability, and you can see all of the porting issues in one place. You can look at tclPort.h and tkPort.h for examples of porting header files. ~~~Miscellaneous~~ ~~~~Changes Files~~ Each package should contain a file named changes that keeps a log of all significant changes made to the package. The changes file provides a way for users to find out what's new in each new release, what bugs have been fixed, and what compatibility problems might be introduced by the new release. The changes file should be in chronological order. Just add short blurbs to it each time you make a change. Here is a sample from the Tk changes file: \| 5/19/94 (bug fix) Canvases didn't generate proper Postscript for \| stippled text. \| \| 5/20/94 (new feature) Added "bell" command to ring the display's bell. \| \| 5/26/94 (feature removed) Removed support for "fill" justify mode \| from Tk_GetJustify and from the TK_CONFIG_JUSTIFY configuration \| option. None of the built-in widgets ever supported this mode \| anyway. \| * POTENTIAL INCOMPATIBILITY * The entries in the changes file can be relatively terse; once someone finds a change that is relevant, they can always go to the manual entries or code to find out more about it. Be sure to highlight changes that cause compatibility problems, so people can scan the changes file quickly to locate the incompatibilities. ~~''(The Tcl and Tk core additionally uses a ChangeLog file that has a much~~ higher detail within it. This has the advantage of having more tooling support, but tends to be so verbose that the shorter summaries in the changes ~~file are still written up by the core maintainers before each release.~~)''~~~~ ~~~ Copyright~~ This document has been placed in the public domain.	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| < > \| \| \| \| \| \| \| \| \| < \| > \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| < > \| \| \| \| < > \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| >	377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327	section should not describe every internal variable modified by the procedure. It should simply provide the sort of information that users of the procedure need in order to use the procedure correctly. See Figure 4 for an example. The file engManual/prochead contains a template for a procedure header, which you can include from your editor to save typing. Follow the syntax of Figures 3 and 4 exactly \(same indentation, double-dash after the procedure name, etc.\). The Results and Side Effects parts of the header may be omitted _only_ if the function has no results or side effects respectively. ## Procedure Declarations The procedure declaration should also follow exactly the syntax in Figures 3 and 4. The first line gives the type of the procedure's result. All procedures must be typed: use void if the procedure returns no result. The second line gives the procedure's name and its argument list. If there are many arguments, they may spill onto additional lines \(see Sections 5.1 and 5.5 for information about indentation\). After this come the declarations of argument types, one argument per line, indented, with a comment after each argument giving a brief description of the argument. Every argument must be explicitly declared, and every argument must have a comment. This form for argument declarations is the old form that predates ANSI C. It's important to use the old form so that your code will compile on older pre-ANSI compilers. Hopefully there aren't too many of these compilers left, and perhaps in a few years we can switch to the ANSI form, but for now let's be safe. Every procedure should also have an ANSI-style prototype either on the file's header page or in a header file, so this approach still allows full argument checking. _Note that_ for new code it is preferred to use ANSI declarations, especially if the code will not build on non-ANSI compilers. ## Parameter Order Procedure parameters may be divided into three categories. In parameters only pass information into the procedure \(either directly or by pointing to information that the procedure reads\). Out parameters point to things in the caller's memory that the procedure modifies. In-out parameters do both. Below is a set of rules for deciding on the order of parameters to a procedure: 1. Parameters should normally appear in the order in, in/out, out, except where overridden by the rules below. 2. If there is a group of procedures, all of which operate on structures of a particular type, such as a hash table, the token for the structure should be the first argument to each of the procedures. 3. When two parameters are the address of a callback procedure and a ClientData value to pass to that procedure, the procedure address should appear in the argument list immediately before the ClientData. 4. If a callback procedure takes a ClientData argument \(and all callbacks should\), the ClientData argument should be the first argument to the procedure. Typically the ClientData is a pointer to the structure managed by the callback, so this is really the same as rule 2. ## Procedure Bodies The body of a procedure follows the declaration. See Section 5 for the coding conventions that govern procedure bodies. The curly braces enclosing the body should be on separate lines as shown in Figures 3 and 4. # Naming Conventions Choosing names is one of the most important aspects of programming. Good names clarify the function of a program and reduce the need for other documentation. Poor names result in ambiguity, confusion, and error. For example, in the Sprite operating system we spent four months tracking down a subtle problem with the file system that caused seemingly random blocks on disk to be overwritten from time to time. It turned out that the same variable name was used in some places to refer to physical blocks on disk, and in other places to logical blocks in a file; unfortunately, in one place the variable was accidentally used for the wrong purpose. The bug probably would not have occurred if different variable names had been used for the two kinds of block identifiers. This section gives some general principles to follow when choosing names, then lists specific rules for name syntax, such as capitalization, and finally describes how to use package prefixes to clarify the module structure of your code. ## General Considerations The ideal variable name is one that instantly conveys as much information as possible about the purpose of the variable it refers to. When choosing names, play devil's advocate with yourself to see if there are ways that a name might be misinterpreted or confused. Here are some things to consider: 1. Are you consistent? Use the same name to refer to the same thing everywhere. For example, in the Tcl implementation the name _interp_ is used consistently for pointers to the uservisible Tcl\_Interp structure. Within the code for each widget, a standard name is always used for a pointer to the widget record, such as _butPtr_ in the button widget code and _menuPtr_ in the menu widget code. 2. If someone sees the name out of context, will they realize what it stands for, or could they confuse it with something else? For example, in Sprite the procedure for doing byte-swapping and other format conversion was originally called Swap\_Buffer. When I first saw that name I assumed it had something to do with I/O buffer management, not reformatting. We subsequently changed the name to Fmt\_Convert. 3. Could this name be confused with some other name? For example, it's probably a mistake to have two variables _s_ and _string_ in the same procedure, both referring to strings: it will be hard for anyone to remember which is which. Instead, change the names to reflect their functions. For example, if the strings are used as source and destination for a copy operation, name them _src_ and _dst_. 4. Is the name so generic that it doesn't convey any information? The variable _s_ from the previous paragraph is an example of this; changing its name to _src_ makes the name less generic and hence conveys more information. ## Basic Syntax Rules Below are some specific rules governing the syntax of names. Please follow the rules exactly, since they make it possible to determine certain properties of a variable just from its name. 1. Variable names always start with a lower-case letter. Procedure and type names always start with an upper-case letter. int counter; extern char FindElement(); typedef int Boolean; 2. In multi-word names, the first letter of each trailing word is capitalized. Do not use underscores as separators between the words of a name, except as described in rule 5 below and in Section 4.3. int numWindows; 3. Any name that refers to a pointer ends in Ptr. If the name refers to a pointer to a pointer, then it ends in PtrPtr, and so on. There are two exceptions to this rule. The first is for variables that are opaque handles for structures, such as variables of type Tk\_Window. These variables are actually pointers, but they are never dereferenced outside Tk \(clients can never look at the structure they point to except by invoking Tk macros and procedures\). In this case the Ptr is omitted in variable names. The second exception to the rule is for strings. We decided in Sprite not to require Ptr suffixes for strings, since they are always referenced with pointers. However, if a variable holds a pointer to a string pointer, then it must have the Ptr suffix \(there's just one less level of Ptr for strings than for other structures\). TkWindow winPtr; char name; char namePtr; 4. Variables that hold the addresses of functions should have names ending in Proc \(for "procedure"\). Typedefs for these variables should also have names ending in Proc. typedef void (Tk_ImageDeleteProc)(ClientData clientData); 5. \#defined constants and macros have names that are all capital letters, except for macros that are used as replacements for procedures, in which case you should follow the naming conventions for procedures. If names in all caps contain multiple words, use underscores to separate the words. #define NULL 0 #define BUFFER_SIZE 1024 #define Min(a,b) (((a) < (b)) ? (a) : (b)) 6. Names of programs, Tcl commands, and keyword arguments to Tcl commands \(such as Tk configuration options\) are usually entirely in lower case, in spite of the rules above. The reason for this rule is that these names are likely to typed interactively, and I thought that using all lower case would make it easier to type them. In retrospect I'm not sure this was a good idea; in any case, Tcl procedure and variable names should follow the same rules as C procedures and variables. ## Names Reflect Package Structure Names that are exported outside a single file must include the package prefix in order to make sure that they don't conflict with global names defined in other packages. The following rules define how to use package prefixes in names: 1. If a variable or procedure or type is exported by its package, the first letters of its name must consist of the package prefix followed by an underscore. Only the first letter of the prefix is ever capitalized, and it is subject to the capitalization rules from Section 4.2. The first letter after the prefix is always capitalized. The first example below shows an exported variable, and the second shows an exported type and exported procedure. extern int tk_numMainWindows; extern Tcl_Interp Tcl_CreateInterp(void); 2. If a module contains several files, and if a name is used in several of those files but isn't used outside the package, then the name must have the package prefix but no underscore. The prefix guarantees that the name won't conflict with a similar name from a different package; the missing underscore indicates that the name is private to the package. extern void TkEventDeadWindow(TkWindow winPtr); 3. If a name is only used within a single procedure or file, then it need not have the module prefix. To avoid conflicts with similar names in other files, variables and procedures declared outside procedures must always be declared static if they have no module prefix. static int initialized; ## Standard Names The following variable names are used consistently throughout Tcl and Tk. Please use these names for the given purposes in any code you write, and don't use the names for other purposes. clientData: Used for variables of type ClientData, which are associated with callback procedures. interp: Used for variables of type Tcl\_Interp. These are the \(mostly\) opaque handles for interpreters that are given to Tcl clients. These variables should really have a Ptr extension, but the name was chosen at a time when interpreters were totally opaque to clients. iPtr: Used for variables of type Interp \, which are pointers to Tcl's internal structures for interpreters. Tcl procedures often have an argument named interp, which is copied into a local variable named iPtr in order to access the contents of the interpreter. nextPtr: A field with this name is used in structures to point to the next structure in a linked list. This is usally the last field of the structure. tkwin: Used for variables of type Tk\_Window, which are opaque handles for the window structures managed by Tk. winPtr: Used for variables of type TkWindow \, which are pointers to Tk's internal structures for windows. Tk procedures often take an argument named tkwin and immediately copy the argument into a local variable named winPtr in order to access the contents of the window structure. # Low-Level Coding Conventions This section describes several low-level syntactic rules for writing C code. The reason for having these rules is not because they're better than all other ways of structuring code, but in order to make all our code look the same. ## Indents are 4 Spaces Each level of indentation should be four spaces. There are ways to set 4-space indents in all editors that I know of. Be sure that your editor really uses four spaces for the indent, rather than just displaying tabs as four spaces wide; if you use the latter approach then the indents will appear eight spaces wide in other editors. If you use tabs, they _must_ be to 8-space indents. ## Code Comments Occupy Full Lines Comments that document code \(as opposed to declarations\) should occupy full lines, rather than being tacked onto the ends of lines containing code. The reason for this is that side-byside comments are hard to see, particularly if neighboring statements are long enough to overlap the side-by-side comments. Comments must have exactly the structure shown in Figure 5, including a leading /\ line, a trailing \/ line, and additional blank lines above and below. The leading blank line can be omitted if the comment is at the beginning of a block, as is the case in the second comment in Figure 5. Each comment should be indented to the same level as the surrounding code. Use proper English in comments: write complete sentences, capitalize the first word of each sentence, and so on. ![Figure 5. Comments in code have the form shown above, using](../assets/247fig5.png) full lines, with lined-up stars, the /\ and \/ symbols on separate lines, and blank separator lines around each comment \(except that the leading blank line can be omitted if the comment is at the beginning of a code block\). ## Declaration Comments are Side-By-Side When documenting the arguments for procedures and the members of structures, place the comments on the same lines as the declarations. Figures 3 and 4 show comments for procedure arguments and Figure 6 shows a simple structure declaration. The format for comments is the same in both cases. Place the comments to the right of the declarations, with all the left edges of all the comments lined up. When a comment requires more than one line, indent the additional lines to the same level as the first line, with the closing \/ on the same line as the end of the text. For structure declarations it is usually useful to have a block of comments preceding the declaration, as in Figure 6. This comments before the declaration use the format given in Section 5.2. ![Figure 6. Use side-by-side comments when declaring structure](../assets/247fig6.png) members and procedure arguments. Declaration comments should normally begin in the 33rd column \(i.e. where you would be after 32 spaces or 4 tabs\). ## Curly Braces: \{ Goes at the End of a Line Open curly braces should not \(normally\) appear on lines by themselves. Instead, they should be placed at the end of the preceding line. Close curly braces always appear as the first non-blank character on a line. Figure 5 shows how to use curly braces in statements such as if and while, and Figure 6 shows how curly braces should be used in structure declarations. If an if statement has an else clause then else appears on the same line as the preceding \} and the following \{. Close curly braces are indented to the same level as the outer code, i.e., four spaces less than the statements they enclose. The only cases where a \{ appears on a line by itself are the initial \{ for the body of a procedure \(see Figures 3 and 4\) or where a block is being started _without_ being the body of an if, do, for, while or switch construct. Always use curly braces around compound statements, even if there is only one statement in the block. Thus you shouldn't write code like if (filePtr->numLines == 0) return -1; but rather if (filePtr->numLines == 0) { return -1; } This approach makes code less dense, but it avoids potential mistakes when adding additional lines to an existing single-statement block. It also makes it easier to set breakpoints in a debugger, since it guarantees that each statement on is on a separate line and can be named individually. There is one exception to the rule about enclosing blocks in \{\}. For if statements with cascaded else if clauses, you may use a form like the following: if (strcmp(argv[1], "delete") == 0) { ... } else if (strcmp(argv[1], "get") == 0) { ... } else if (strcmp(argv[1], "set") == 0) { ... } else { ... } ## Continuation Lines are Indented 8 Spaces You should use continuation lines to make sure that no single line exceeds 80 characters in length. Continuation lines should be indented 8 spaces so that they won't be confused with an immediately-following nested block \(see Figure 7\). Pick clean places to break your lines for continuation, so that the continuation doesn't obscure the structure of the statement. For example, if a procedure call requires continuation lines, make sure that each argument is on a single line. If the test for an if or while command spans lines, try to make each line have the same nesting level of parentheses if possible. I try to start each continuation line with an operator such as \, &&, or \\|\\|; this makes it clear that the line is a continuation, since a new statement would never start with such an operator. ![Figure 7. Continuation lines are indented 8 spaces.](../assets/247fig7.png) ## Avoid Macros Except for Simple Things \#define statements provide a fine mechanism for specifying constants symbolically, and you should always use them instead of embedding specific numbers in your code. However, it is generally a bad idea to use macros for complex operations; procedures are almost always better \(for example, you can set breakpoints inside procedures but not in the middle of macros\). The only time that it is OK to use \#define's for complex operations is if the operations are critical to performance and there is no other way to get the performance \(have you measured the performance before and after to be sure it matters?\). When defining macros, remember always to enclose the arguments in parentheses: #define Min(a,b) (((a) < (b)) ? (a) : (b)) Otherwise, if the macro is invoked with a complex argument such as a\b or small\\|\\|red it may result in a parse error or, even worse, an unintended result that is difficult to debug. # Documenting Code The purpose of documentation is to save time and reduce errors. Documentation is typically used for two purposes. First, people will read the documentation to find out how to use your code. For example, they will read procedure headers to learn how to call the procedures. Ideally, people should have to learn as little as possible about your code in order to use it correctly. Second, people will read the documentation to find out how your code works internally, so they can fix bugs or add new features; again, good documentation will allow them to make their fixes or enhancements while learning the minimum possible about your code. More documentation isn't necessarily better: wading through pages of documentation may not be any easier than deciphering the code. Try to pick out the most important things that will help people to understand your code and focus on these in your documentation. ## Document Things with Wide Impact The most important things to document are those that affect many different pieces of a program. Thus it is essential that every procedure interface, every structure declaration, and every global variable be documented clearly. If you haven't documented one of these things it will be necessary to look at all the uses of the thing to figure out how it's supposed to work; this will be time-consuming and error-prone. On the other hand, things with only local impact may not need much documentation. For example, in short procedures I don't usually have comments explaining the local variables. If the overall function of the procedure has been explained, and if there isn't much code in the procedure, and if the variables have meaningful names, then it will be easy to figure out how they are used. On the other hand, for long procedures with many variables I usually document the key variables. Similarly, when I write short procedures I don't usually have any comments in the procedure's code: the procedure header provides enough information to figure out what is going on. For long procedures I place a comment block before each major piece of the procedure to clarify the overall flow through the procedure. ## Don't Just Repeat What's in the Code The most common mistake I see in documentation \(besides it not being there at all\) is that it repeats what is already obvious from the code, such as this trivial \(but exasperatingly common\) example: /* * Increment i. / i += 1; Documentation should provide higher-level information about the overall function of the code, helping readers to understand what a complex collection of statements really means. For example, the comment / * Probe into the hash table to see if the symbol exists. / is likely to be much more helpful than / * Mask off all but the lower 8 bits of x, then index into table * t, then traverse the list looking for a character string * identical to s. / Everything in this second comment is probably obvious from the code that follows it. Another thing to consider in your comments is word choice. Use different words in the comments than the words that appear in variable or procedure names. For example, the comment / * VmMapPage -- * * Map a page. * ... which appears in the header for the Sprite procedure VmMapPage, doesn't provide any new information. Everything in the comment is already obvious from the procedure's name. Here is a much more useful comment: /* * VmMapPage -- * * Make the given physical page addressable in the kernel's * virtual address space. This procedure is used when the * kernel needs to access a user's page. * ... This comment tells why you might want to use the procedure, in addition to what it does, which makes the comment much more useful. ## Document Each Thing in Exactly One Place Systems evolve over time. If something is documented in several places, it will be hard to keep the documentation up to date as the system changes. Instead, try to document each major design decision in exactly one place, as near as possible to the code that implements the design decision. For example, put the documentation for each structure right next to the declaration for the structure, including the general rules for how the structure is used. You need not explain the fields of the structure again in the code that uses the structure; people can always refer back to the structure declaration for this. The principal documentation for each procedure goes in the procedure header. There's no need to repeat this information again in the body of the procedure \(but you might have additional comments in the procedure body to fill in details not described in the procedure header\). If a library procedure is documented thoroughly in a manual entry, then I may make the header for the procedure very terse, simply referring to the manual entry. For example, I use this terse form in the headers for all Tcl command procedures, since there is a separate manual entry describing each command. The other side of this coin is that every major design decision needs to be documented at least once. If a design decision is used in many places, it may be hard to pick a central place to document it. Try to find a data structure or key procedure where you can place the main body of comments; then reference this body in the other places where the decision is used. If all else fails, add a block of comments to the header page of one of the files implementing the decision. ## Write Clean Code The best way to produce a well-documented system is to write clean and simple code. This way there won't be much to document. If code is clean, it means that there are a few simple ideas that explain its operation; all you have to do is to document those key ideas. When writing code, ask yourself if there is a simple concept behind the code. If not, perhaps you should rethink the code. If it takes a lot of documentation to explain a piece of code, it is a sign that you haven't found an elegant solution to the problem. ## Document As You Go It is extremely important to write the documentation as you write the code. It's very tempting to put off the documentation until the end; after all, the code will change, so why waste time writing documentation now when you'll have to change it later? The problem is that the end never comes - there is always more code to write. Also, the more undocumented code that you accumulate, the harder it is to work up the energy to document it. So, you just write more undocumented code. I've seen many people start a project fully intending to go back at the end and write all the documentation, but I've never seen anyone actually do it. If you do the documentation as you go, it won't add much to your coding time and you won't have to worry about doing it later. Also, the best time to document code is when the key ideas are fresh in your mind, which is when you're first writing the code. When I write new code, I write all of the header comments for a group of procedures before I fill in any of the bodies of the procedures. This way I can think about the overall structure and how the pieces fit together before getting bogged down in the details of individual procedures. ## Document Tricky Situations If code is non-obvious, meaning that its structure and correctness depend on information that won't be obvious to someone reading it for the first time, be sure to document the non-obvious information. One good indicator of a tricky situation is a bug. If you discover a subtle property of your program while fixing a bug, be sure to add a comment explaining the problem and its solution. Of course, it's even better if you can fix the bug in a way that eliminates the subtle behavior, but this isn't always possible. # Testing One of the environments where Tcl works best is for testing. If all the functionality of an application is available as Tcl commands, you should be able to write Tcl scripts that exercise the application and verify that it behaves correctly. For example, Tcl contains a large suite of tests that exercise nearly all of the Tcl functionality. Whenever you write new code you should write Tcl test scripts to go with that code and save the tests in files so that they can be re-run later. Writing test scripts isn't as tedious as it may sound. If you're developing your code carefully you're already doing a lot of testing; all you need to do is type your test cases into a script file where they can be re-used, rather than typing them interactively where they vanish into the void after they're run. ## Basics Tests should be organized into script files, where each file contains a collection of related tests. Individual tests should be based on the procedure test, just like in the Tcl and Tk test suites. Here are two examples: test expr-3.1 {floating-point operators} { expr 2.3.6 } 1.38 test expr-3.2 {floating-point operators} { list [catch {expr 2.3/0} msg] $msg } {1 {divide by zero}} test is a procedure defined in a script file named defs, which is sourced by each test file. The test* command takes four arguments: a test identifier, a string describing the test, a test script, and the expected result of the script. test evaluates the script and checks to be sure that it produces the expected result. If not, it prints a message like the following: ==== expr-3.1 floating-point operators ==== Contents of test case: expr 2.3.6 ==== Result was: 1.39 ---- Result should have been: 1.38 ---- expr-2.1 FAILED To run a set of tests, you start up the application and source a test file. If all goes well no messages appear; if errors are detected, a message is printed for each one. The test identifier, such as expr-3.1, is printed when errors occur. It can be used to search a test script to locate the source for a failed test. The first part of the identifier, such as expr, should be the same as the name of the test file, except that the test file should have a .test extension, such as expr.test. The two numbers allow you to divide your tests into groups. The tests in a particular group \(e.g., all the expr-3.n tests\) relate to a single sub-feature, such as a single C procedure or a single option of a Tcl command. The tests should appear in the test file in the same order as their numbers. The test name, such as floating-point operators, is printed when errors occur. It provides human-readable information about the general nature of the test. Before writing tests I suggest that you look over some of the test files for Tcl and Tk to see how they are structured. You may also want to look at the README files in the Tcl and Tk test directories to learn about additional features that provide more verbose output or restrict the set of tests that are run. Although it is possible to automatically generate names for tests, this is not recommended because it makes it difficult to search for the specific test in the test suite if all you have to go on is the test name. ## Organizing Tests Organize your tests to match the code being tested. The best way to do this is to have one test file for each source code file, with the name of the test file derived from the name of the source file in an obvious way \(e.g. textWind.test contains tests for the code in tkTextWind.c\). Within the test file, have one group of tests for each procedure \(for example, all the textWind-2.n tests in textWind.test are for the procedure TkTextWindowCmd\). The order of the tests within a group should be the same as the order of the code within the procedure. This approach makes it easy to find the tests for a particular piece of code and add new tests as the code changes. The Tcl test suite was written a long time ago and uses a different style where there is one file for each Tcl command or group of related commands, and the tests are grouped within the file by sub-command or features. In this approach the relationship between tests and particular pieces of code is much less obvious, so it is harder to maintain the tests as the code evolves. I don't recommend using this approach for new tests. ## Coverage When writing tests, you should attempt to exercise every line of source code at least once. There will be occasionally be code that you can't exercise, such as code that exits the application, but situations like this are rare. You may find it hard to exercise some pieces of code because existing Tcl commands don't provide fine enough control to generate all the possible execution paths \(for example, at the time I wrote the test suite for Tcl's dynamic string facility there were very few Tcl commands using the facility; some of the procedures were not called at all\). In situations like this, write one or more new Tcl commands just for testing purposes. For example, the file tclTest.c in the Tcl source directory contains a command testdstring, which provides a number of options that allow all of the dynamic string code to be exercised. tclTest.c is only included in a special testing version of tclsh, so the testdstring command isn't present in normal Tcl applications. Use a similar approach in your own code, where you have an extra file with additional commands for testing. It's not sufficient just to make sure each line of code is executed by your tests. In addition, your tests must discriminate between code that executes correctly and code that isn't correct. For example, write tests to make sure that the then and else branches of each if statement are taken under the correct conditions. For loops, run different tests to make the loop execute zero times, one time, and two or more times. If a piece of code removes an element from a list, try cases where the element to be removed is the first element, last element, only element, and neither first element nor last. Try to find all the places where different pieces of code interact in unusual ways, and exercise the different possible interactions. ## Memory Allocation Tcl and Tk use a modified memory allocator that checks for several kinds of memory allocation errors, such as freeing a block twice, failing to free a block, or writing past the end of a block. In order to use this allocator, don't call malloc, free, or realloc directly. Call ckalloc instead of malloc, ckfree instead of free, and ckrealloc instead of realloc. These procedures behave identically to malloc, free, and realloc except that they monitor memory usage. Ckalloc, ckfree, and ckrealloc are actually macros that can be configured with a compiler switch: if TCL\_MEM\_DEBUG is defined, they perform the checks but run more slowly and use more memory; if TCL\_MEM\_DEBUG is not defined, then the macros are just \#defined to malloc, free, and realloc so there is no penalty in efficiency. I always run with TCL\_MEM\_DEBUG in my development environment and you should too. Official releases typically do not have TCL\_MEM\_DEBUG set. If you set TCL\_MEM\_DEBUG anywhere in your code then you must set it everywhere \(including the Tcl and Tk libraries\); the memory allocator will get hopelessly confused if a block of memory is allocated with malloc and freed with ckfree, or allocated with ckalloc and freed with free. There is nothing equivalent to calloc in the debugging memory allocator. If you need a new block to be zeroed, call memset to clear its contents. If you compile with TCL\_MEM\_DEBUG, then an additional Tcl command named memory will appear in your application \(assuming that you're using the standard Tcl or Tk main program\). The memory command has the following options: memory active* _file_ > Dumps a list of all allocated blocks \(and where they were allocated\) to _file_. Memory leaks can be tracked down by comparing dumps made at different times. memory break\_on\_malloc _number_ > Enter the debugger after _number_ calls to ckalloc. memory info > Prints a report containing the total allocations and frees since Tcl began, the number of blocks currently allocated, the number of bytes currently allocated, and the maximum number of blocks and bytes allocated at any one time. memory init _onoff_ > If _onoff_ is on, new blocks of memory are initialized with a strange value to help locate uninitialized uses of the block. Any other value for _onoff_ turns initialization off. Initialization is on by default. memory trace _onoff_ > If _onoff_ is on, one line will be printed to stderr for each call to ckalloc. Any other value for _onoff_ turns tracing off. Tracing is off by default. memory trace\_on\_at\_malloc _number_ > Arranges for tracing to be turned on after _number_ calls to ckalloc. memory validate _onoff_ > If _onoff_ is on, guard zones around every allocated block are checked on every call to ckalloc or ckfree in order to detect memory overruns as soon as possible. If _onoff_ is anything other than on, checks are made only during ckfree calls and only for the block being freed. Memory validation has a very large performance impact, so it is off by default. The debugging memory allocator is inferior in many ways to commercial products like Purify, so its worth using one of the commercial products if possible. Even so, please use ckalloc and ckfree everywhere in your code, so that other people without access to the commercial checkers can still use the Tcl debugging allocator. ## Fixing Bugs Whenever you find a bug in your code it means that the test suite wasn't complete. As part of fixing the bug, you should add new tests that detect the presence of the bug. I recommend writing the tests after you've located the bug but before you fix it. That way you can verify that the bug happens before you implement the fix and goes away afterwards, so you'll know you've really fixed something. Use bugs to refine your testing approach: think about what you might be able to do differently when you write tests in the future to keep bugs like this one from going undetected. ## Tricky Features I also use tests as a way of illustrating the need for tricky code. If a piece of code has an unusual structure, and particularly if the code is hard to explain, I try to write additional tests that will fail if the code is implemented in the obvious manner instead of using the tricky approach. This way, if someone comes along later, doesn't understand the documentation for the code, decides the complex structure is unnecessary, and changes the code back to the simple \(but incorrect\) form, the test will fail and the person will be able to use the test to understand why the code needs to be the way it is. Illustrative tests are not a substitute for good documentation, but they provide a useful addition. ## Test Independence Try to make tests independent of each other, so that each test can be understood in isolation. For example, one test shouldn't depend on commands executed in a previous test. This is important because the test suite allows tests to be run selectively: if the tests depend on each other, then false errors will be reported when someone runs a few of the tests without the others. For convenience, you may execute a few statements in the test file to set up a test configuration and then run several tests based on that configuration. If you do this, put the setup code outside the calls to the test procedure so it will always run even if the individual tests aren't run. I suggest keeping a very simple structure consisting of setup followed by a group of tests. Don't perform some setup, run a few tests, modify the setup slightly, run a few more tests, modify the setup again, and so on. If you do this, it will be hard for people to figure out what the setup is at any given point and when they add tests later they are likely to break the setup. # Porting Issues The X Window System, ANSI C, and POSIX provide a standard set of interfaces that make it possible to write highly portable code. However, some additional work will still be needed if code is to port among all of the UNIX platforms. As Tcl and Tk move from the UNIX world onto PCs and Macintoshes, porting issues will become even more important. This section contains a few tips on how to write code that can run on many different platforms. ## Stick to Standards The easiest way to make your code portable is to use only library interfaces that are available everywhere \(or nearly everywhere\). For example, the ANSI C library procedures, POSIX system calls, and Xlib windowing calls are available on many platforms; if you code to these standards your packages will be quite portable. Avoid using system-specific library procedures, since they will introduce porting problems. ## Minimize \#ifdefs Although there will be situations where you have to do things differently on different machines, \#ifdefs are rarely the best way to deal with these problems. If you load up your code with \#ifdef statements based on various machines and operating systems, the code will turn into spaghetti. \#ifdefs make code unreadable: it is hard to look at \#ifdef-ed code and figure out exactly what will happen on any one machine. Furthermore, \#ifdefs encourage a style where lots of machine dependencies creep all through the code; it is much better to isolate machine dependencies in a few well-defined places. Thus you should almost never use \#ifdefs. Instead, think carefully about the ways in which systems differ and define procedural interfaces to the machine-dependent code. Then provide a different implementation of the machine-dependent procedures for each machine. When linking, choose the version appropriate for the current machine. This way all of the machine dependencies for a particular system are located in one or a few files that are totally separate from the machine-dependent code for other systems and from the main body of your code. The only "conditional" code left will be the code that selects which version to link with. You won't be able to eliminate \#ifdefs completely, but please avoid them as much as possible. If you end up with code that has a lot of \#ifdefs, this should be a warning to you that something is wrong. See if you can find a way to re-organize the code \(perhaps using the techniques described later in this section\) to reduce the number of \#ifdefs. ## Organize by Feature, Not by System Don't think about porting issues in terms of specific systems. Instead, think in terms of specific features that are present or absent in the systems. For example, don't divide your code up according to what is needed in HP-UX versus Solaris versus Windows. Instead, consider what features are present in the different systems; for example, some systems have a waitpid procedure, while others don't yet provide one, and some systems have ANSI C compilers that support procedure prototypes, while some systems do not. The feature-based approach has a number of advantages over the system-based approach. First, many systems have features in common, so you can share feature-based porting code among different systems. Second, if you think in terms of features then you can consider each feature separately \("what do I do if there is no waitpid?"\); this replaces one large problem with several smaller problems that can be dealt with individually. Lastly, the autoconf program can be used to check for the presence or absence of particular features and configure your code automatically. Once you've gotten your code running on several different systems, you'll find that many new systems can be handled with no additional work: their features are similar to those in systems you've already considered, so autoconf can handle them automatically. ## Use Emulation One of the cleanest ways to handle porting problems is with emulation: assume the existence of certain procedures, such as those in the POSIX standard, and if they don't exist on a given system then write procedures to emulate the desired functionality with the facilities that are present on the system. For example, when Tcl first started being used widely I discovered that many systems did not support the waitpid kernel call, even though it was part of the POSIX standard. So, I wrote a waitpid procedure myself, which emulated the functionality of waitpid using the wait and wait3 kernel calls. The best way to emulate waitpid was with wait3, but unfortunately wait3 wasn't available everywhere either, so the emulation worked differently on systems that had wait3 and those that supported only wait. The autoconf program checks to see which of the kernel calls are available, includes the emulation for waitpid if it isn't available, and sets a compiler flag that indicates to the emulation code whether or not wait3 is available. You can also emulate using \#defines in a header file. For example, not all systems support symbolic links, and those that don't support symbolic links don't support the lstat kernel call either. For these systems Tcl uses stat to emulate lstat with the following statement in tclUnix.h: #define lstat stat If a header file is missing on a particular system, write your own version of the header file to supply the definitions needed by your code. Then you can \#include your version in your code if the system doesn't have a version of its own. For example, here is the code in tclUnix.h that handles unistd.h, which isn't yet available on all UNIX systems: #ifdef HAVE_UNISTD_H #include <unistd.h> #else #include "compat/unistd.h" #endif The configure script generated by autoconf checks for the existence of unistd.h in the system include directories and sets HAVE\_UNISTD\_H if it is present. If it isn't present, tclUnix.h includes a version from the Tcl source tree. ## Use Autoconf The GNU autoconf program provides a powerful way to configure your code for different systems. With autoconf you write a script called configure.in that describes the porting issues for your software in terms of particular features that are needed and what to do if they aren't present. Before creating a release of your software you run autoconf, which processes configure.in and generates a shell script called configure. You then include configure with your distribution. When it is time to install the distribution on a particular system, the installer runs the configure script. configure pokes around in the system to find out what features are present, then it modifies the Makefile accordingly. The modifications typically consist of compiling additional files to substitute for missing procedures, or setting compiler flags that can be used for conditional compilation in the code. Use of libtool is not recommended; it tends to inhibit porting to anything other than fairly conventional UNIX platforms. ## Porting Header File In spite of all the above advice, you will still end up needing some conditional compilation, for example to include alternate header files where standard ones are missing or to \#define symbols that aren't defined on the system. Put all of this code in the porting header file for the package, then \#include this header file in each of the source files of the package. With this approach you only need to change a single place if you have to modify your approach to portability, and you can see all of the porting issues in one place. You can look at tclPort.h and tkPort.h for examples of porting header files. # Miscellaneous ## Changes Files Each package should contain a file named changes that keeps a log of all significant changes made to the package. The changes file provides a way for users to find out what's new in each new release, what bugs have been fixed, and what compatibility problems might be introduced by the new release. The changes file should be in chronological order. Just add short blurbs to it each time you make a change. Here is a sample from the Tk changes file: 5/19/94 (bug fix) Canvases didn't generate proper Postscript for stippled text. 5/20/94 (new feature) Added "bell" command to ring the display's bell. 5/26/94 (feature removed) Removed support for "fill" justify mode from Tk_GetJustify and from the TK_CONFIG_JUSTIFY configuration option. None of the built-in widgets ever supported this mode anyway. * POTENTIAL INCOMPATIBILITY * The entries in the changes file can be relatively terse; once someone finds a change that is relevant, they can always go to the manual entries or code to find out more about it. Be sure to highlight changes that cause compatibility problems, so people can scan the changes file quickly to locate the incompatibilities. _\(The Tcl and Tk core additionally uses a ChangeLog file that has a much higher detail within it. This has the advantage of having more tooling support, but tends to be so verbose that the shorter summaries in the changes file are still written up by the core maintainers before each release.\)_ # Copyright This document has been placed in the public domain.

~~1 2 3 4 5 6 7 8 9~~ 10 11 12 13 14 15 16 17 18 19 20 21 22 ~~23 24~~ 25 26 27 28 ~~29 30~~ 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 ~~55 56~~ 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 ~~97 98~~ 99 100 101 102 ~~103~~ 104 105 106 107 ~~108~~ 109 110 ~~111~~ 112 ~~113~~ 114 ~~115~~ 116 117 ~~118~~ 119 ~~120~~ 121 ~~122~~ 123 124 125 126 127 ~~128 129 130 131 132~~ 133 ~~134~~ 135 136 ~~137~~ 138 139 140 141 142 ~~143~~ 144 145 146 147 148 ~~149~~ 150 ~~151~~ 152 153 154 155 156 157 ~~158~~ 159 160 ~~161~~ 162 163 164 165 166 167 168 169 ~~170~~ 171 172 173 174 175 176 177 ~~178~~ 179 180 181 182 183 ~~184 185 186~~ 187 188 189 190 191 192 193 194 195 196 197 198 ~~199~~ 200 ~~201~~ 202 203 ~~204~~ 205 206 ~~207 208~~ 209 210 211 212 ~~213~~ 214 215 216 ~~217~~ 218 219 220 221 222 223 224 225 226 227 228 229 230 ~~231~~ 232 233 234 235 236 237 238	~~TIP: 28~~ T~~itle~~: How to be a good maintainer for Tcl/Tk ~~Version: $Revision: 1.24 $~~ Author: Don Porter <[email protected]> State: Draft Type: Informative Vote: Pending Created: 23-Feb-2001 Post-History: ~~~ Abstract~~ This document presents information and advice to maintainers in the ~~form of a Frequently Asked Questions ~~(FAQ~~) list.~~ ~~~ Preface~~ Notice in the header above that this is a Draft document. It won't be ~~the ''official'' word of the TCT unless/until it is accepted by the~~ TCT. Meanwhile, it should still be a helpful guide to those serving or considering service as maintainers. At the very least it's a useful straw man to revise into something better. Help us make it ~~even more useful by using the [~~[Edit]~~] link at the bottom of this page (if any) to add/revise the questions and answers, or add your~~ comments. ~~~ Background~~ ~~TCT procedures (see [0]) calls for one or more ''maintainers'' to take responsibility for each functional area of the Tcl ~~([16]~~) or Tk ~~([23]~~)~~ source code. Every source code patch to Tcl or Tk will be committed to the official branches of the appropriate CVS repository only after approval by an appropriate set of maintainers. ~~~ Can I be a Tcl/Tk maintainer?~~ Most likely. To be a maintainer, you should have... * ...an interest in Tcl/Tk. * ...access to the Internet (Web and e-mail). * ...some volunteer time to contribute. * ...the ability and the support software to code in C and/or Tcl, use CVS, use SourceForge facilities, and familiarity with a portion of the Tcl/Tk source code to be maintained, or the willingness to acquire these things. For the most part, if you are reading this document, you probably have what it takes to be a Tcl/Tk maintainer. ~~~ What can I maintain?~~ ~~The Tcl Core Team ~~(TCT~~) has divided up the Tcl/Tk source code into functional areas as described in [~~16]~~ and [~~23]~~. You can volunteer to~~ help maintain as many areas as you think you can handle. Select those you have experience with or an interest in. ~~~ What does a maintainer do?~~ Maintainers are the people who make changes to the files that make up the source code distribution of Tcl or Tk -- code, documentation, and tests. That's what a maintainer does: check in changes to the official source code in the area he/she maintains. The source code can be changed for several reasons: to correct a bug, to add a new feature, or to re-implement an existing feature in a new way. The reason for a change controls how much oversight the maintainer must have while making the change. More on this below. ~~~ How do I prepare to be a maintainer?~~ The official repositories of Tcl and Tk source code are kept at SourceForge, so you need to register for a SourceForge account ~~[https://sourceforge.net/account/register.php]. As part of the~~ registration, you will select a login name. When you volunteer as a maintainer, the administrators of the Tcl or Tk projects will need that name to give you write access to the appropriate repository. Once you have a SourceForge account, get familiar with the tools it provides. Most important is that you get set up to use CVS over SSH to access the repository. This can be difficult. There are some ~~notes [http://tcltk.org/sourceforge] on how other Developers on the~~ Tcl and Tk projects have been able to successfully get this done. This document does not include instructions on how to use CVS. See the following references for assistance with learning CVS. * http://cvsbook.red-bean.com/cvsbook.html ~~''Add more references here please.''~~ ~~~ How do I volunteer to be a maintainer?~~ Send a message to <[email protected]> telling the TCT ~~your SourceForge login name and what area(s) you want to help maintain. Someone will add you to the list of ''Developers'' on the~~ Tcl or Tk projects and enable your access to SourceForge features like the Bug Tracker and Patch Manager. As a Developer, you will have write access to the appropriate repository of official source code. ~~~ Write access! So I can just start changing Tcl/Tk?!~~ For some purposes, yes. For others, you'll need to get approval from the TCT first. Read on... ~~~ What Internet resources does a maintainer use?~~ A maintainer uses the SourceForge Bug Tracker for Tcl or Tk to learn ~~what bugs are reported in his area (browse by Category).~~ * http://sourceforge.net/bugs/?group_id=10894 * http://sourceforge.net/bugs/?group_id=12997 A maintainer uses the SourceForge Patch Manager for Tcl or Tk to learn ~~what patches make changes in his area (browse by Category).~~ * http://sourceforge.net/patch/?group_id=10894 * http://sourceforge.net/patch/?group_id=12997 A maintainer uses CVS via SSH to access, track, and modify the various branches of development in the repository of official Tcl or Tk source code. ~~\|cvs -d :ext:[email protected]:/cvsroot/tcl \ \| checkout -r $BRANCH_TAG -d $LOCAL_DIR tcl \| \|cvs -d :ext:[email protected]:/cvsroot/tktoolkit \ \| checkout -r $BRANCH_TAG -d $LOCAL_DIR tk~~ ~~A maintainer examines the state of Tcl Improvement Proposals ~~(TIPs~~) and~~ adds his comments to them at the TIP Document Collection. * http://purl.org/tcl/home/cgi-bin/tct/tip/ A maintainer may follow and participate in TCT discussions about TIPs and other matters concerning Tcl/Tk development on the TCLCORE mailing list. * http://lists.sourceforge.net/lists/listinfo/tcl-core A maintainer may receive e-mail notification every time any change is made to any entry in Tcl's or Tk's Bug Tracker or Patch Manager by subscribing to the TCLBUGS mailing list. * http://lists.sourceforge.net/lists/listinfo/tcl-bugs ~~~ There are multiple maintainers in my area. What do I do?~~ The maintainer tasks are the same; you just have more hands to get the job done. It is up to the maintainers of an area to decide among themselves how they will divide the tasks. They might each take on a particular subset of files. Or they might let some maintainers fix bugs while others review new features. Or they might appoint one ~~maintainer as the ''lead'' and let him assign tasks to the others.~~ Whatever works for you, and gets the work done. ~~~ I found a bug in my area. What do I do?~~ Bug finding and reporting is a job for the whole community, so when you find a bug, take off your maintainer hat. Report it to the Bug Tracker just like anyone would. If you recognize that the bug is in your area, go ahead and assign it to the Category for your area and to yourself or one of the other maintainers who share responsibility for that area. ~~~ Why do I report the bug to myself?~~ So that the bug appears in the database. Someone else may find it too, and when they go to report it to the Bug Tracker, they should discover that it's an already reported problem. A registered bug report is also the place where progress on fixing the bug can be recorded for all to see. ~~~ There's a bug reported in the Category for the area I maintain. What do I do?~~ First, understand the bug report. The best bug reports are clear and come with a demonstration script, but not all reports are so well crafted. You may need to exchange messages with the person who reported the bug. If the reporter logged in to SourceForge as ~~''username'' before submitting a report, then you can write back to ''[email protected]''. If the bug was reported by ''nobody'', the best you can do is post a followup comment to the bug~~ asking for more information, and hope the reporter comes back to check. Next, confirm that the bug report is valid, original, and that it belongs in your area. Does it correctly assert that some public interface provided by your area behaves differently from its documented behavior? If not, then you should take the appropriate action: 1. If the bug report notes a problem in another project, assign it to a Developer who is an Admin of the other project. Add a comment asking them to reassign to the correct project. Assigned ~~To: ''an Admin of the other project''.~~ ~~> If no Developer is an Admin of the other project, or the other~~ project isn't hosted by SourceForge, note the error in a comment, and mark the report invalid. Resolution: Invalid; Status: ~~Closed; Assigned To: ''yourself''.~~ 1. If the bug report notes a problem due to a bug in another area, ~~reassign it to the appropriate Category. Category: ''correct category''~~ 1. If the reporter's expectations are incorrect, point them to the documentation. You may also want to revise the documentation if it is not clear. Resolution: Invalid; Status: Closed; Assigned ~~To: ''yourself''.~~ 1. If the bug report notes a problem already noted by another bug report, note the duplication. Resolution: Duplicate; Status: ~~Closed; Assigned To: ''yourself''.~~ 1. If the bug report acknowledges that the code is behaving as documented, but argues that the documented behavior should be revised, then the report is a feature request rather than a bug report. More on handling feature requests below. Group: Feature Request. Valid, original bug reports in your area should be assigned to a maintainer of your area. If you are the only maintainer of your area, assign the bug to yourself. If there are multiple maintainers, you should decide among yourselves how to divide up the bug report assignments. ~~~ There's a bug assigned to me. What do I do?~~ Now we get the the heart of what a maintainer does. This is where you unleash the energies and talents you bring to the table. So, the best answer is "Do what works best for you." The rest of this answer should be read as additional guidelines and tips that have worked well for others and might help you, but not as a mandatory checklist you must follow. If some advice below seems more burdensome than helpful, fall	< \| < \| \| \| \| \| \| > \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237	# TIP 28: How to be a good maintainer for Tcl/Tk Author: Don Porter <[email protected]> State: Draft Type: Informative Vote: Pending Created: 23-Feb-2001 Post-History: ----- # Abstract This document presents information and advice to maintainers in the form of a Frequently Asked Questions \(FAQ\) list. # Preface Notice in the header above that this is a Draft document. It won't be the _official_ word of the TCT unless/until it is accepted by the TCT. Meanwhile, it should still be a helpful guide to those serving or considering service as maintainers. At the very least it's a useful straw man to revise into something better. Help us make it even more useful by using the [Edit] link at the bottom of this page \(if any\) to add/revise the questions and answers, or add your comments. # Background TCT procedures \(see [[0]](0.md)\) calls for one or more _maintainers_ to take responsibility for each functional area of the Tcl \([[16]](16.md)\) or Tk \([[23]](23.md)\) source code. Every source code patch to Tcl or Tk will be committed to the official branches of the appropriate CVS repository only after approval by an appropriate set of maintainers. # Can I be a Tcl/Tk maintainer? Most likely. To be a maintainer, you should have... * ...an interest in Tcl/Tk. * ...access to the Internet \(Web and e-mail\). * ...some volunteer time to contribute. * ...the ability and the support software to code in C and/or Tcl, use CVS, use SourceForge facilities, and familiarity with a portion of the Tcl/Tk source code to be maintained, or the willingness to acquire these things. For the most part, if you are reading this document, you probably have what it takes to be a Tcl/Tk maintainer. # What can I maintain? The Tcl Core Team \(TCT\) has divided up the Tcl/Tk source code into functional areas as described in [[16]](16.md) and [[23]](23.md). You can volunteer to help maintain as many areas as you think you can handle. Select those you have experience with or an interest in. # What does a maintainer do? Maintainers are the people who make changes to the files that make up the source code distribution of Tcl or Tk -- code, documentation, and tests. That's what a maintainer does: check in changes to the official source code in the area he/she maintains. The source code can be changed for several reasons: to correct a bug, to add a new feature, or to re-implement an existing feature in a new way. The reason for a change controls how much oversight the maintainer must have while making the change. More on this below. # How do I prepare to be a maintainer? The official repositories of Tcl and Tk source code are kept at SourceForge, so you need to register for a SourceForge account <https://sourceforge.net/account/register.php> . As part of the registration, you will select a login name. When you volunteer as a maintainer, the administrators of the Tcl or Tk projects will need that name to give you write access to the appropriate repository. Once you have a SourceForge account, get familiar with the tools it provides. Most important is that you get set up to use CVS over SSH to access the repository. This can be difficult. There are some notes <http://tcltk.org/sourceforge> on how other Developers on the Tcl and Tk projects have been able to successfully get this done. This document does not include instructions on how to use CVS. See the following references for assistance with learning CVS. * <http://cvsbook.red-bean.com/cvsbook.html> _Add more references here please._ # How do I volunteer to be a maintainer? Send a message to <[email protected]> telling the TCT your SourceForge login name and what area\(s\) you want to help maintain. Someone will add you to the list of _Developers_ on the Tcl or Tk projects and enable your access to SourceForge features like the Bug Tracker and Patch Manager. As a Developer, you will have write access to the appropriate repository of official source code. # Write access! So I can just start changing Tcl/Tk?! For some purposes, yes. For others, you'll need to get approval from the TCT first. Read on... # What Internet resources does a maintainer use? A maintainer uses the SourceForge Bug Tracker for Tcl or Tk to learn what bugs are reported in his area \(browse by Category\). * <http://sourceforge.net/bugs/?group\_id=10894> * <http://sourceforge.net/bugs/?group\_id=12997> A maintainer uses the SourceForge Patch Manager for Tcl or Tk to learn what patches make changes in his area \(browse by Category\). * <http://sourceforge.net/patch/?group\_id=10894> * <http://sourceforge.net/patch/?group\_id=12997> A maintainer uses CVS via SSH to access, track, and modify the various branches of development in the repository of official Tcl or Tk source code. cvs -d :ext:[email protected]:/cvsroot/tcl \ checkout -r $BRANCH_TAG -d $LOCAL_DIR tcl cvs -d :ext:[email protected]:/cvsroot/tktoolkit \ checkout -r $BRANCH_TAG -d $LOCAL_DIR tk A maintainer examines the state of Tcl Improvement Proposals \(TIPs\) and adds his comments to them at the TIP Document Collection. * <http://purl.org/tcl/home/cgi-bin/tct/tip/> A maintainer may follow and participate in TCT discussions about TIPs and other matters concerning Tcl/Tk development on the TCLCORE mailing list. * <http://lists.sourceforge.net/lists/listinfo/tcl-core> A maintainer may receive e-mail notification every time any change is made to any entry in Tcl's or Tk's Bug Tracker or Patch Manager by subscribing to the TCLBUGS mailing list. * <http://lists.sourceforge.net/lists/listinfo/tcl-bugs> # There are multiple maintainers in my area. What do I do? The maintainer tasks are the same; you just have more hands to get the job done. It is up to the maintainers of an area to decide among themselves how they will divide the tasks. They might each take on a particular subset of files. Or they might let some maintainers fix bugs while others review new features. Or they might appoint one maintainer as the _lead_ and let him assign tasks to the others. Whatever works for you, and gets the work done. # I found a bug in my area. What do I do? Bug finding and reporting is a job for the whole community, so when you find a bug, take off your maintainer hat. Report it to the Bug Tracker just like anyone would. If you recognize that the bug is in your area, go ahead and assign it to the Category for your area and to yourself or one of the other maintainers who share responsibility for that area. # Why do I report the bug to myself? So that the bug appears in the database. Someone else may find it too, and when they go to report it to the Bug Tracker, they should discover that it's an already reported problem. A registered bug report is also the place where progress on fixing the bug can be recorded for all to see. # There's a bug reported in the Category for the area I maintain. What do I do? First, understand the bug report. The best bug reports are clear and come with a demonstration script, but not all reports are so well crafted. You may need to exchange messages with the person who reported the bug. If the reporter logged in to SourceForge as _username_ before submitting a report, then you can write back to _[email protected]_. If the bug was reported by _nobody_, the best you can do is post a followup comment to the bug asking for more information, and hope the reporter comes back to check. Next, confirm that the bug report is valid, original, and that it belongs in your area. Does it correctly assert that some public interface provided by your area behaves differently from its documented behavior? If not, then you should take the appropriate action: 1. If the bug report notes a problem in another project, assign it to a Developer who is an Admin of the other project. Add a comment asking them to reassign to the correct project. Assigned To: _an Admin of the other project_. > If no Developer is an Admin of the other project, or the other project isn't hosted by SourceForge, note the error in a comment, and mark the report invalid. Resolution: Invalid; Status: Closed; Assigned To: _yourself_. 1. If the bug report notes a problem due to a bug in another area, reassign it to the appropriate Category. Category: _correct category_ 1. If the reporter's expectations are incorrect, point them to the documentation. You may also want to revise the documentation if it is not clear. Resolution: Invalid; Status: Closed; Assigned To: _yourself_. 1. If the bug report notes a problem already noted by another bug report, note the duplication. Resolution: Duplicate; Status: Closed; Assigned To: _yourself_. 1. If the bug report acknowledges that the code is behaving as documented, but argues that the documented behavior should be revised, then the report is a feature request rather than a bug report. More on handling feature requests below. Group: Feature Request. Valid, original bug reports in your area should be assigned to a maintainer of your area. If you are the only maintainer of your area, assign the bug to yourself. If there are multiple maintainers, you should decide among yourselves how to divide up the bug report assignments. # There's a bug assigned to me. What do I do? Now we get the the heart of what a maintainer does. This is where you unleash the energies and talents you bring to the table. So, the best answer is "Do what works best for you." The rest of this answer should be read as additional guidelines and tips that have worked well for others and might help you, but not as a mandatory checklist you must follow. If some advice below seems more burdensome than helpful, fall
︙			︙
267 268 269 270 271 272 273 ~~274~~ 275 276 277 278 279 280 281	feature is added to Tcl/Tk. In those cases, add a comment to the original bug report so those interested will know what is causing the delay. SourceForge may offer a way to denote these dependencies as well. If you have trouble fixing the bug, ask for help. Try the other maintainers of your area first. Then try posting comments attached to ~~the original bug report. Using ''cvs log'', you can get a list of~~ developers who've recently made changes to the files you maintain. They might be able to offer advice, or explanations about why the code is the way it is. If none of these focused searches for help bears fruit, then try broader requests to the TCLCORE mailing lists, or the news:comp.lang.tcl newsgroup. At any time, you may have several bugs assigned to you. It will help	\|	266 267 268 269 270 271 272 273 274 275 276 277 278 279 280	feature is added to Tcl/Tk. In those cases, add a comment to the original bug report so those interested will know what is causing the delay. SourceForge may offer a way to denote these dependencies as well. If you have trouble fixing the bug, ask for help. Try the other maintainers of your area first. Then try posting comments attached to the original bug report. Using _cvs log_, you can get a list of developers who've recently made changes to the files you maintain. They might be able to offer advice, or explanations about why the code is the way it is. If none of these focused searches for help bears fruit, then try broader requests to the TCLCORE mailing lists, or the news:comp.lang.tcl newsgroup. At any time, you may have several bugs assigned to you. It will help
︙			︙
291 292 293 294 295 296 297 ~~298 299~~ 300 301 302 303 304 ~~305 306~~ 307 308 309 310 ~~311~~ 312 313 314 315 316 ~~317 318~~ 319 320 321 322 ~~323~~ 324 325 326 327 328 329 330 331 332 ~~333~~ 334 335 336 ~~337~~ 338 339 340 341 ~~342 343~~ 344 ~~345~~ 346 347 348 349 350 351 352 353 354 355 356 357 358 ~~359~~ 360 361 362 363 364 ~~365 366 367~~ 368 369 370 371 372 373 374 375 376 377 ~~378~~ 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 ~~399 400~~ 401 402 403 404 405 406 ~~407 408~~ 409 410 411 412 413 414 415	1. Other bug fixes are waiting on this bug fix. 1. Several duplicate reports or "me too" comments about the bug are coming in from the community. Some reasons you might give a bug a lower priority include: ~~1. A workaround is identified (add it as a comment attached to the bug report).~~ 1. Feature requests tend to get lower priority since they should be handled through the TIP process. Once you have crafted a fix for the bug, create a patch to the ~~official source code (including the new tests that test for the fixed bug) and register it with the SourceForge Patch Manager. Note the~~ number of the bug report fixed by the patch somewhere in the summary or comments associated with the patch. Assign the patch to yourself. Assign the Category to the area you maintain. ~~~ There's a patch registered under the Category I maintain. What do I do?~~ The SourceForge Patch Manager is used to review and revise patches before they are committed to the official source code. Your actions depend on what the patch does to your area, and who the patch is assigned to. The patch may change the public interface provided by ~~your area (feature change); or the change may be completely internal (bug fix, or re-implementation) within your area. The patch may be~~ assigned to you, to someone else, or to nobody. The person the patch is assigned to is the person who is leading the effort to integrate the patch into the official source code. ~~~ What if the patch is assigned to nobody?~~ The patch has probably been contributed by someone not on the list of Developers. It may be a contributed bug fix, or a contributed implementation of a TIP. Assign contributed bug fixes to the same maintainer who is assigned the corresponding bug report. If there is no corresponding bug report, add one. Assign TIP implementations to the Developer identified in the TIP as the one responsible for implementation of that TIP, or the TCT member who sponsored the TIP. ~~If the patch changes only your area (and shared or generated files),~~ then leave the Category in your area. If the patch changes other areas as well as yours, change the category to None. ~~~ What if the patch is assigned to me?~~ Presumably you've assigned it to yourself to indicate that you're taking charge of integrating that patch into the official sources. If that's a mistake, treat the patch as if it were assigned to nobody. ~~If you are the one leading the integration effort, see below (How do I integrate a patch into the official sources?).~~ ~~~ What if the patch is assigned to someone else?~~ If the patch is assigned to another maintainer in your area, let him handle it. Leave it alone. If the patch makes no changes in your area, change the Category of the patch to None. If the patch makes changes in your area, and is assigned to a Developer who is not a maintainer of your area, that Developer is asking for review of the patch's changes to your area. You or one of the other maintainers of your area should review the patch and accept or reject it. Read on... ~~~ What special review does a "feature change" patch require?~~ Changes to the public interface of your section must be proposed to and accepted by the TCT through the TIP process before they can be added to the official Tcl source code. If the patch changes the public interface of your section, then there should be an associated ~~TIP describing the new feature(s) that patch implements. Until there is such a TIP, and that TIP has been accepted by the TCT (check the value of the State header), you should not approve the patch.~~ Once there is an approved TIP corresponding to the patch, you should confirm that the patch correctly implements the accepted feature as described by the TIP. If not, you should not approve the patch. After confirming that the patch correctly implements the feature change described in an accepted TIP, you should still review the technical merit of the patch's changes to your area before approving it. ~~~ How do I review the technical merits of a patch?~~ Apply the patch and run the test suites that cover your area. Check that the patch does not add any new test failures. If the patch is a bug fix, check that it actually fixes the bug. Think five times before approving a patch that causes new test failures or incompletely fixes a bug or incompletely implements an approved TIP. Keep in mind that once the patch is integrated into the official sources, you'll be expected to maintain it. It is not in your interest to approve patches that make your job harder. Think four times before approving a patch that you do not understand. Check that the patch keeps the features offered on different platforms consistent. If not, be certain that the documentation properly notes the platform-specific behavior. Think three times before approving a patch that causes the capabilities of Tcl/Tk to further diverge on different platforms. Check that the patch follows Tcl's established coding conventions. See the Tcl/Tk Engineering Manual ~~[http://purl.org/tcl/home/doc/engManual.pdf] and the Tcl Style Guide [http://purl.org/tcl/home/doc/styleGuide.pdf] for details. This is~~ especially important when accepting contributed patches. Think twice before approving a patch that doesn't conform to these conventions. Check the effect of the patch on the performance of Tcl/Tk. Use the tclbench set of benchmarks. ~~\|cvs -d :pserver:[email protected]:/cvsroot/tcllib \ \| checkout tclbench~~ Think carefully before approving a patch that significantly degrades the performance of important operations. Finally, while examining the patch, you may see a better way to accomplish the effect of the changes in your area. If you can provide that alternative implementation reasonably quickly, then propose it as	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414	1. Other bug fixes are waiting on this bug fix. 1. Several duplicate reports or "me too" comments about the bug are coming in from the community. Some reasons you might give a bug a lower priority include: 1. A workaround is identified \(add it as a comment attached to the bug report\). 1. Feature requests tend to get lower priority since they should be handled through the TIP process. Once you have crafted a fix for the bug, create a patch to the official source code \(including the new tests that test for the fixed bug\) and register it with the SourceForge Patch Manager. Note the number of the bug report fixed by the patch somewhere in the summary or comments associated with the patch. Assign the patch to yourself. Assign the Category to the area you maintain. # There's a patch registered under the Category I maintain. What do I do? The SourceForge Patch Manager is used to review and revise patches before they are committed to the official source code. Your actions depend on what the patch does to your area, and who the patch is assigned to. The patch may change the public interface provided by your area \(feature change\); or the change may be completely internal \(bug fix, or re-implementation\) within your area. The patch may be assigned to you, to someone else, or to nobody. The person the patch is assigned to is the person who is leading the effort to integrate the patch into the official source code. # What if the patch is assigned to nobody? The patch has probably been contributed by someone not on the list of Developers. It may be a contributed bug fix, or a contributed implementation of a TIP. Assign contributed bug fixes to the same maintainer who is assigned the corresponding bug report. If there is no corresponding bug report, add one. Assign TIP implementations to the Developer identified in the TIP as the one responsible for implementation of that TIP, or the TCT member who sponsored the TIP. If the patch changes only your area \(and shared or generated files\), then leave the Category in your area. If the patch changes other areas as well as yours, change the category to None. # What if the patch is assigned to me? Presumably you've assigned it to yourself to indicate that you're taking charge of integrating that patch into the official sources. If that's a mistake, treat the patch as if it were assigned to nobody. If you are the one leading the integration effort, see below \(How do I integrate a patch into the official sources?\). # What if the patch is assigned to someone else? If the patch is assigned to another maintainer in your area, let him handle it. Leave it alone. If the patch makes no changes in your area, change the Category of the patch to None. If the patch makes changes in your area, and is assigned to a Developer who is not a maintainer of your area, that Developer is asking for review of the patch's changes to your area. You or one of the other maintainers of your area should review the patch and accept or reject it. Read on... # What special review does a "feature change" patch require? Changes to the public interface of your section must be proposed to and accepted by the TCT through the TIP process before they can be added to the official Tcl source code. If the patch changes the public interface of your section, then there should be an associated TIP describing the new feature\(s\) that patch implements. Until there is such a TIP, and that TIP has been accepted by the TCT \(check the value of the State header\), you should not approve the patch. Once there is an approved TIP corresponding to the patch, you should confirm that the patch correctly implements the accepted feature as described by the TIP. If not, you should not approve the patch. After confirming that the patch correctly implements the feature change described in an accepted TIP, you should still review the technical merit of the patch's changes to your area before approving it. # How do I review the technical merits of a patch? Apply the patch and run the test suites that cover your area. Check that the patch does not add any new test failures. If the patch is a bug fix, check that it actually fixes the bug. Think five times before approving a patch that causes new test failures or incompletely fixes a bug or incompletely implements an approved TIP. Keep in mind that once the patch is integrated into the official sources, you'll be expected to maintain it. It is not in your interest to approve patches that make your job harder. Think four times before approving a patch that you do not understand. Check that the patch keeps the features offered on different platforms consistent. If not, be certain that the documentation properly notes the platform-specific behavior. Think three times before approving a patch that causes the capabilities of Tcl/Tk to further diverge on different platforms. Check that the patch follows Tcl's established coding conventions. See the Tcl/Tk Engineering Manual <http://purl.org/tcl/home/doc/engManual.pdf> and the Tcl Style Guide <http://purl.org/tcl/home/doc/styleGuide.pdf> for details. This is especially important when accepting contributed patches. Think twice before approving a patch that doesn't conform to these conventions. Check the effect of the patch on the performance of Tcl/Tk. Use the tclbench set of benchmarks. cvs -d :pserver:[email protected]:/cvsroot/tcllib \ checkout tclbench Think carefully before approving a patch that significantly degrades the performance of important operations. Finally, while examining the patch, you may see a better way to accomplish the effect of the changes in your area. If you can provide that alternative implementation reasonably quickly, then propose it as
︙			︙
430 431 432 433 434 435 436 ~~437~~ 438 439 440 441 ~~442~~ 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 ~~463~~ 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 ~~482~~ 483 484 485 486 487 488 489 490 491 492 493 494 495 ~~496~~ 497 498 499 ~~500 501~~ 502 503 504 ~~505~~ 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 ~~521~~ 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 ~~541 542 543~~ 544 545 546 547 548 549 550 551 552 553 554 555 556 ~~557~~ 558 559 560 561 562 563 564 565 566 567 568 569 570 571 ~~572~~ 573 574 575 576 ~~577 578~~ 579 ~~580~~ 581 582	you can supply the needed revisions with reasonable effort, do so. If the patch changes multiple areas, set the Category of the patch back to None. Unless the patch is assigned to you, do not change the Status of the patch. Leave that to the Developer assigned to the patch. ~~~ How do I integrate a patch into the official sources?~~ First you need the approval of at least one maintainer of each section changed by the patch. ~~~ How do I get approval for integration?~~ First, assign the patch to yourself to indicate that you are leading the integration effort. Next, determine the list of categories corresponding to the areas changed by the patch. It may help if you list them in a comment attached to the patch. For each category in the list, assign the Category of the patch to that category. Then wait for a maintainer for that area to review the patch. If one approves it, then assign the next Category in the list. If maintainers for all areas on the list approve the same patch, you may integrate the patch into the official sources. If a maintainer rejects the patch, revise the patch to address his concerns. Then start the review again. Start with the maintainer who rejected the first patch to be sure his concerns are addressed first. Note that if the patch changes only the area you maintain, then you may immediately integrate the patch into the official sources once you are satisfied with it and it is registered in the Patch Manager. ~~~ The patch is approved. How should it be integrated?~~ Get a CVS working directory that is up to date with the HEAD branch of the official source repository. Apply the patch to your working directory, and then 'cvs commit' the changes to the HEAD branch. At the same time you commit the patch, be sure to add an entry to the ChangeLog file describing the change. Follow the established format, which is derived from the GNU coding conventions. The description should be brief, but should describe the change reasonably completely. Include the SourceForge Bug and Patch ID numbers in the ChangeLog entry, but do not assume that the reader will have access to the Bug Tracker and Patch Manager to be able to understand the entry. You may assume the reader has access to the documentation. Finally, with the patch integrated, change the Status of the patch in the Patch Manager to Accepted. If any bugs were fixed by the patch, change their Resolution to Fixed, and their Status to Closed. ~~~ I want a patch review even though the patch changes only my area.~~ Keep in mind that integrating a patch into the official sources is not an irreversible act. Commits to the HEAD branch will be checked out and tested by members of the Tcl community who are tracking Tcl/Tk development. Alpha and beta releases of Tcl/Tk that include your patch will also get your changes reviewed in practical settings. That said, if you really want a pre-commit review of your patch, you can add a comment to the patch asking for review. Someone will probably respond. It's up to your judgment how long to wait, keeping in mind that you are the maintainer, so your judgment on the quality of patches in your area is implicitly trusted. ~~~ What about CVS branches?~~ When you integrate a patch into the official source code, you will usually 'cvs commit' the patch onto the HEAD branch. If the patch ~~includes a feature change, it must (except in unusual circumstances approved by the TCT) be committed to the HEAD branch. The HEAD branch~~ is the development branch from which alpha releases of Tcl/Tk are generated. ~~At any time, there is also one or more ''stable'' branches of~~ development. As of February, 2001, the branch 'core-8-3-1-branch' indicates the sequence of revisions from which the 8.3.x releases of Tcl/Tk are generated. Since the Tcl Core Team took over development of Tcl/Tk, no changes have been committed to a stable branch, so we really have not established procedures on how we will decide what bug fixes should and should not be applied to the stable branch. It is possible that maintainers will be involved, though. It is also possible that a special team will be appointed to update the stable branch in preparation for the next stable release. In the case that you as a maintainer are asked to commit to the stable branch, be aware that the only patches that should be committed to a stable branch are those that fix bugs. No new features should be committed here. ~~The other kind of branch is a ''feature'' branch. This is a~~ development branch on which a sequence of several revisions may be committed as work in progress on a new feature, or re-implementation of existing features. Typically a feature branch will be created if the effort... * ...touches on several functional areas; * ...is worked on jointly by several Developers; * ...is complex enough to require several revisions; * ...needs prototyping to determine the best TIP proposal to make; or * ...makes an incompatible change to Tcl/Tk that properly belongs on the next major version of Tcl/Tk before the HEAD branch has been designated for work toward the next major version. As a Developer, feel free to create a feature branch if you have a ~~reason to use one. Make a note of your branch tags in [~~31]~~. Avoid the use of a branch tag matching core-* . Save the core-* branch tags for the tags of official stable branches~~ and releases. To avoid conflict with other Developers, consider using your SourceForge login name as a prefix on the feature branch tags you create. Try to also make the branch tag descriptive of the purpose of the branch. One big advantage of a feature branch is that any Developer may commit changes to a feature branch without all the publication, review, and approval overhead required when committing patches to the HEAD or stable branches. On the feature branches you can go through multiple revisions reasonably quickly and spend the administrative overhead only at the end when it is time to apply the finished product to the official branches. ~~~ What other things does a maintainer do?~~ The tasks of fixing bugs and approving and committing patches to the official source code of Tcl and Tk are the core tasks that maintainers perform. That's all the job actually requires. You will probably want to keep an eye on the TCT's plans for Tcl/Tk development as well. If a TIP proposes a new feature in your area, it is in your interest to know about it, and propose revisions and improvements to it. Ultimately you will be asked to approve the patch that implements the new feature, and then you will be expected to maintain it, so if you have concerns about a proposal, it's best to make them known early. TCT members will probably ask your opinion on TIPs that propose changes to your area for this reason. ~~~ Comments~~ Please add your comments here. > Well, since I drafted this SourceForge has replaced the ~~Bug Tracker and Patch Manager with a ''Tracker''. This TIP ''really'' needs revision now.~~ ~~~ Copyright~~ This document has been placed in the public domain.	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| >	429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582	you can supply the needed revisions with reasonable effort, do so. If the patch changes multiple areas, set the Category of the patch back to None. Unless the patch is assigned to you, do not change the Status of the patch. Leave that to the Developer assigned to the patch. # How do I integrate a patch into the official sources? First you need the approval of at least one maintainer of each section changed by the patch. # How do I get approval for integration? First, assign the patch to yourself to indicate that you are leading the integration effort. Next, determine the list of categories corresponding to the areas changed by the patch. It may help if you list them in a comment attached to the patch. For each category in the list, assign the Category of the patch to that category. Then wait for a maintainer for that area to review the patch. If one approves it, then assign the next Category in the list. If maintainers for all areas on the list approve the same patch, you may integrate the patch into the official sources. If a maintainer rejects the patch, revise the patch to address his concerns. Then start the review again. Start with the maintainer who rejected the first patch to be sure his concerns are addressed first. Note that if the patch changes only the area you maintain, then you may immediately integrate the patch into the official sources once you are satisfied with it and it is registered in the Patch Manager. # The patch is approved. How should it be integrated? Get a CVS working directory that is up to date with the HEAD branch of the official source repository. Apply the patch to your working directory, and then 'cvs commit' the changes to the HEAD branch. At the same time you commit the patch, be sure to add an entry to the ChangeLog file describing the change. Follow the established format, which is derived from the GNU coding conventions. The description should be brief, but should describe the change reasonably completely. Include the SourceForge Bug and Patch ID numbers in the ChangeLog entry, but do not assume that the reader will have access to the Bug Tracker and Patch Manager to be able to understand the entry. You may assume the reader has access to the documentation. Finally, with the patch integrated, change the Status of the patch in the Patch Manager to Accepted. If any bugs were fixed by the patch, change their Resolution to Fixed, and their Status to Closed. # I want a patch review even though the patch changes only my area. Keep in mind that integrating a patch into the official sources is not an irreversible act. Commits to the HEAD branch will be checked out and tested by members of the Tcl community who are tracking Tcl/Tk development. Alpha and beta releases of Tcl/Tk that include your patch will also get your changes reviewed in practical settings. That said, if you really want a pre-commit review of your patch, you can add a comment to the patch asking for review. Someone will probably respond. It's up to your judgment how long to wait, keeping in mind that you are the maintainer, so your judgment on the quality of patches in your area is implicitly trusted. # What about CVS branches? When you integrate a patch into the official source code, you will usually 'cvs commit' the patch onto the HEAD branch. If the patch includes a feature change, it must \(except in unusual circumstances approved by the TCT\) be committed to the HEAD branch. The HEAD branch is the development branch from which alpha releases of Tcl/Tk are generated. At any time, there is also one or more _stable_ branches of development. As of February, 2001, the branch 'core-8-3-1-branch' indicates the sequence of revisions from which the 8.3.x releases of Tcl/Tk are generated. Since the Tcl Core Team took over development of Tcl/Tk, no changes have been committed to a stable branch, so we really have not established procedures on how we will decide what bug fixes should and should not be applied to the stable branch. It is possible that maintainers will be involved, though. It is also possible that a special team will be appointed to update the stable branch in preparation for the next stable release. In the case that you as a maintainer are asked to commit to the stable branch, be aware that the only patches that should be committed to a stable branch are those that fix bugs. No new features should be committed here. The other kind of branch is a _feature_ branch. This is a development branch on which a sequence of several revisions may be committed as work in progress on a new feature, or re-implementation of existing features. Typically a feature branch will be created if the effort... * ...touches on several functional areas; * ...is worked on jointly by several Developers; * ...is complex enough to require several revisions; * ...needs prototyping to determine the best TIP proposal to make; or * ...makes an incompatible change to Tcl/Tk that properly belongs on the next major version of Tcl/Tk before the HEAD branch has been designated for work toward the next major version. As a Developer, feel free to create a feature branch if you have a reason to use one. Make a note of your branch tags in [[31]](31.md). Avoid the use of a branch tag matching core-\* . Save the core-\* branch tags for the tags of official stable branches and releases. To avoid conflict with other Developers, consider using your SourceForge login name as a prefix on the feature branch tags you create. Try to also make the branch tag descriptive of the purpose of the branch. One big advantage of a feature branch is that any Developer may commit changes to a feature branch without all the publication, review, and approval overhead required when committing patches to the HEAD or stable branches. On the feature branches you can go through multiple revisions reasonably quickly and spend the administrative overhead only at the end when it is time to apply the finished product to the official branches. # What other things does a maintainer do? The tasks of fixing bugs and approving and committing patches to the official source code of Tcl and Tk are the core tasks that maintainers perform. That's all the job actually requires. You will probably want to keep an eye on the TCT's plans for Tcl/Tk development as well. If a TIP proposes a new feature in your area, it is in your interest to know about it, and propose revisions and improvements to it. Ultimately you will be asked to approve the patch that implements the new feature, and then you will be expected to maintain it, so if you have concerns about a proposal, it's best to make them known early. TCT members will probably ask your opinion on TIPs that propose changes to your area for this reason. # Comments Please add your comments here. > Well, since I drafted this SourceForge has replaced the Bug Tracker and Patch Manager with a _Tracker_. This TIP _really_ needs revision now. # Copyright This document has been placed in the public domain.

~~1 2 3 4 5 6 7 8 9 10 11~~ 12 13 14 15 16 17 18 19 20 ~~21 22 23~~ 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46	~~TIP: 352~~ T~~itle~~: Tcl Style Guide ~~Version: $Revision: 1.5 $~~ Author: Ray Johnson <[email protected]> Author: Donal K. Fellows <[email protected]> Author: Mark Janssen <[email protected]> State: Draft Type: Informative Vote: Pending Created: 14-Jul-2009 Post-History: ~~~ Abstract~~ This document describes a set of conventions that it is suggested people use when writing Tcl code. It is substantially based on the Tcl/Tk Engineering ~~Manual [~~247]~~.~~ ~~~~NOTE~~ ~~''A transcription of the original version (dated August 22, 1997) of this file into PDF is available online at http://www.tcl.tk/doc/styleGuide.pdf - Donal K. Fellows.''~~ ~~~ Introduction~~ This is a manual for people who are developing Tcl code for Wish or any other Tcl application. It describes a set of conventions for writing code and the associated test scripts. There are three reasons for the conventions. First, the conventions ensure that certain important things get done; for example, every procedure must have documentation that describes each of its arguments and its result, and there must exist test scripts that exercise every line of code. Second, the conventions guarantee that all of the Tcl and Tk code has a uniform style. This makes it easier for us to use, read, and maintain each other's code. Third, the conventions help to avoid some common mistakes by prohibiting error-prone constructs such as building lists by hand instead of using the list building procedures. ~~This document is based heavily on the ''Tcl/Tk Engineering Manual'' written by~~ John Ousterhout. John's engineering manual specified the style of the C code used in the implementation of Tcl/Tk and many of its extensions. The manual is very valuable to the development of Tcl/Tk and is an important reason why Tcl is a relatively easy system to maintain. Deciding any style standard involves making trade-offs that are usually subjective. This standard was created in an iterative process involving the	< \| < \| \| \| \| \| \| \| \| > \| \| \| \| \| \| \| \|	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45	# TIP 352: Tcl Style Guide Author: Ray Johnson <[email protected]> Author: Donal K. Fellows <[email protected]> Author: Mark Janssen <[email protected]> State: Draft Type: Informative Vote: Pending Created: 14-Jul-2009 Post-History: ----- # Abstract This document describes a set of conventions that it is suggested people use when writing Tcl code. It is substantially based on the Tcl/Tk Engineering Manual [[247]](247.md). ## NOTE _A transcription of the original version \(dated August 22, 1997\) of this file into PDF is available online at <http://www.tcl.tk/doc/styleGuide.pdf> - Donal K. Fellows._ # Introduction This is a manual for people who are developing Tcl code for Wish or any other Tcl application. It describes a set of conventions for writing code and the associated test scripts. There are three reasons for the conventions. First, the conventions ensure that certain important things get done; for example, every procedure must have documentation that describes each of its arguments and its result, and there must exist test scripts that exercise every line of code. Second, the conventions guarantee that all of the Tcl and Tk code has a uniform style. This makes it easier for us to use, read, and maintain each other's code. Third, the conventions help to avoid some common mistakes by prohibiting error-prone constructs such as building lists by hand instead of using the list building procedures. This document is based heavily on the _Tcl/Tk Engineering Manual_ written by John Ousterhout. John's engineering manual specified the style of the C code used in the implementation of Tcl/Tk and many of its extensions. The manual is very valuable to the development of Tcl/Tk and is an important reason why Tcl is a relatively easy system to maintain. Deciding any style standard involves making trade-offs that are usually subjective. This standard was created in an iterative process involving the
︙			︙
64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 ~~101~~ 102 ~~103 104 105~~ 106 ~~107~~ 108 ~~109 110~~ 111 ~~112 113~~ 114 ~~115~~ 116 117 118 ~~119~~ 120 121 122 ~~123 124 125~~ 126 127 128 129 130 131 ~~132~~ 133 ~~134 135~~ 136 ~~137 138~~ 139 140 141 ~~142~~ 143 ~~144~~ 145 146 ~~147~~ 148 149 150 151 ~~152~~ 153 154 155 156 157 158 ~~159~~ 160 ~~161~~ 162 ~~163~~ 164 165 ~~166~~ 167 ~~168~~ 169 170 171 172 173 174 175 ~~176 177 178~~ 179 ~~180~~ 181 ~~182 183~~ 184 185 186 187 ~~188 189~~ 190 191 ~~192~~ 193 ~~194~~ 195 196 197 198 199 200 ~~201~~ 202 203 204 ~~205 206~~ 207 208 ~~209~~ 210 211 ~~212~~ 213 214 215 ~~216 217 218~~ 219 ~~220~~ 221 222 223 224 225 226 227 228 229 230 231 232 ~~233~~ 234 ~~235~~ 236 237 238 239 ~~240 241 242 243 244 245 246 247 248 249 250~~ ~~251 252 253 254 255 256 257 258 259 260~~ 261 262 263 264 265 266 267 ~~268~~ 269 270 271 272 ~~273 274~~ 275 276 ~~277~~ 278 279 280 281 282 283 ~~284 285~~ 286 ~~287~~ 288 289 290 ~~291~~ 292 293 294 ~~295~~ 296 297 298 ~~299~~ 300 301 ~~302~~ 303 304 305 ~~306 307~~ ~~308 309 310~~ ~~311 312 313 314 315 316 317 318 319~~ 320 321 322 323 324 325 326	and how to write procedure headers. Section 5 desribes the Tcl naming conventions. Section 6 presents low-level coding conventions, such as how to indent and where to put curly braces. Section 7 contains a collection of rules and suggestions for writing comments. Section 8 describes how to write and maintain test suites. Section 9 contains a few miscellaneous topics, such as keeping a change log. ~~~ Executable files~~ An executable is a file, collection of files, or some other collection of Tcl code and necessary runtime environment. Often referred to as applications, an executable is simply what you run to start your program. The format and exact make up of an executable is platform-specific. At some point, however, a Tcl ~~''start-up script'' will be evaluated. It is the start-up script that will~~ bootstrap any Tcl based application. ~~The role of the start-up script is to load any needed ''packages'', set up any~~ non-package specific state, and finally start the Tcl application by calling routines inside a Tcl package. If the start-up script is more than a few lines it should probably be a package itself. There are several ways to create executable scripts. Each major platform usually has a unique way of creating an executable application. Here is a brief description of how these applications should be created on each platform: 1. The most common method for creating executable applications on UNIX ~~platforms is the infamous ~~'''~~#!~~'''~~ mechanism built into most shells.~~ Unfortunately, the most common approach of just giving a path to wish is not recommended. Don't do: ~~\| #! /usr/local/tclsh8.0 -f "$0" "$@"~~ ~~> This method will not work if the file ~~'''~~tclsh~~'''~~ is another script that,~~ for example, locates and starts the most recent version of Tcl. It also ~~requires ~~'''~~tclsh~~'''~~ to be in a particular place, which makes the script~~ less portable. Instead, the following method should be used which calls ~~~~'''~~/bin/sh~~'''~~ which will in turn exec the ~~'''~~wish~~'''~~ application.~~ ~~\| #!/bin/sh \| # the next line restarts using wish \ \| exec wish8.0 "$0" "$@"~~ ~~> This example will actually locate the ~~'''~~wish~~'''~~ application in the user's~~ path which can be very useful for developers. The backslash is recognized ~~as part of a comment to ~~''sh''~~, but in Tcl the backslash continues the comment into the next line which keeps the ''exec'' command from executing~~ again. However, more stable sites would probably want to include the full ~~path instead of just ~~'''~~wish~~'''~~. Note that the version number of the ~~'''~~tclsh~~'''~~ or ~~'''~~wish~~'''~~ interpreter is usually added to the end of the~~ program name. This allows you use a specific version of Tcl. In addition, ~~many sites include a link of ~~'''~~wish~~'''~~ to the latest version currently~~ installed. This is useful if you know that your code will work on any version of Tcl. ~~2. On the Windows platform you only need to end a file with the ~~'''~~.tcl~~'''~~~~ extension and the file will be run when the user double clicks on the file. This is, of course, assuming you have installed Tcl/Tk. > Alternatively, you may create a ~~'''~~.bat~~'''~~ file which explicitly executes ~~'''~~tclsh~~'''~~ or ~~'''~~wish~~'''~~ with an absolute path to your start-up script. Please check the Windows documentation for more details about ~~'''~~.bat~~'''~~ files. 3. The Macintosh platform doesn't really have a notion of an executable Tcl file. One of the reasons for this is that, unlike UNIX or Windows, you can only run one instance of an application at a time. So instead of callingwish with a specific script to load, we must create a copy of the ~~~~'''~~wish~~'''~~ application that is tied to our script.~~ ~~> The easiest way to do this is to use the application ''Drag&Drop Tclets'' or the ''SpecTcl'' GUI builder which can do this work for you. You can~~ also do this by hand by putting the start-up script into a TEXT resource ~~and name it ''tclshrc'' - which ensures it gets sourced on start-up. This can be done with ''ResEdit'' (a tool provided by Apple) or other tools~~ that manipulate resources. Additional scripts can also be placed in TEXT resource to make the application completely contained. ~~~ Packages and namespaces~~ ~~Tcl applications consist of collections of ''packages''. Each package provides~~ code to implement a related set of features. For example, Tcl itself is a package, as is Tk; these packages happen to be implemented in both C and Tcl. ~~Other packages are implemented completely in Tcl such as the ~~'''~~http~~'''~~~~ package included in the Tcl distribution. Packages are the units in which code is developed and distributed: a single package is typically developed by a single person or group and distributed as a unit. It is possible to combine many independently-developed packages into a single application; packages ~~should be designed with this in mind. The notion of ''namespaces'' were~~ created to help make this easier. Namespaces help to hide private aspects of packages and avoid name collisions. A package will generally export one public namespace which will include all state and routines that are associated with the package. A package should not contain any global variables or global procedures. Side effects when loading a package should be avoided. This document will focus on packages written entirely in Tcl. For a discussion of ~~packages built in C or C and Tcl see the ''Tcl/Tk Engineering Manual''.~~ ~~~~ Package names~~ ~~Each package should have a unique ''name''. The name of the package is used to~~ identify the package. It is also used as the name of the namespace that the package exports. It is best to have a simple one word name in all lower-case ~~like ~~'''~~http~~'''~~. Multi-word names are ok as well. Additional words should~~ just be concatenated with the first word but start with a capital letter like ~~~~'''~~specMenu~~'''~~.~~ Coming up with a unique name for your package requires a collaborative component. For internal projects this is an easy task and can usually be decided among the management or principal engineers in your organization. For packages you wish to publish, however, you should make an effort to make sure that an existing package isn't already using the same name you are. This can often be done by checking the comp.lang.tcl newsgroup or the standard Tcl ftp ~~sites. It is also suggested (but not required) that you register your name on the NIST Identifier Collaboration Service ~~(NICS~~). It is located at: http://pitch.nist.gov/nics~~ ~~~~ Version numbers~~ ~~Each package has a two-part version number such as 7.4. The first number (7) is called the major version number and the second (4) is called the minor~~ version number. The version number changes with each public release of the package. If a new release contains only bug fixes, new features, and other upwardly compatible changes, so that code and scripts that worked with the old version will also work with the new version, then the minor version number ~~increments and the major version number stays the same (e.g., from 7.4 to 7.5). If the new release contains substantial incompatibilities, so that~~ existing code and scripts will have to be modified to run with the new version, then the major version number increments and the minor version number ~~resets to zero (e.g., from 7.4 to 8.0).~~ ~~~~ Package namespaces~~ As of version 8.0, Tcl supports namespaces to hide the internal structure of a package. This helps avoid name collisions and provides a simpler way to manage packages. All packages written for Tcl 8.0 or newer should use namespaces. The name of the name space should be the same as the package name. ~~~~ Structure~~ There are a couple of ways to deploy a package of Tcl commands. * A ~~'''~~pkgIndex.tcl~~'''~~ file is used to create ''packages'' that can be loaded on demand by any Tcl script. Like a ~~'''~~tclIndex~~'''~~ file, a package specifies a set of Tcl and/or shared libraries that can be loaded when needed. A package, however, must be explicitly requested by using the ~~~~'''~~package require~~'''~~ command. You can use the ~~'''~~pkg_mkIndex~~'''~~ command to~~ create a package index file for your use. In most cases, particularly in code you distribute to others, it is better to use a package instead of ~~the ~~'''~~tclIndex~~'''~~ auto-loading mechanism.~~ * On the Macintosh platform, shared libraries can be made into self contained packages. You simply need to add a TEXT resource with the name of ~~'''~~pkgIndex~~'''~~. It will be treated in the exact same fashion as a ~~'''~~pkgIndex.tcl~~'''~~ file. The ~~'''~~pkgIndex~~'''~~ resource should have the same format as the ~~'''~~pkgIndex.tcl~~'''~~ file. ~~~ How to organize a code file~~ Each source code file should either contain an entire application or a set of related procedures that make up a package or a another type of identifiable module, such as the implementation of the menus for your application, or a set of procedures to implement HTTP access. Before writing any code you should think carefully about what functions are to be provided and divide them into files in a logical way. The most manageable size for files is usually in the range of 500-2000 lines. If a file gets much larger than this, it will be hard to remember everything that the file does. If a file is much shorter than this, then you may end up with too many files in a directory, which is also hard to manage. ~~~~ The file header~~ ~~The first part of a code file is referred to as the ''header''. It contains~~ overall information that is relevant throughout the file. It consists of everything but the definitions of the file's procedures. The header typically has four parts, as shown below: \| / # specMenu.tcl -- \| \| # \|Abstract \| # This file implements the Tcl code for creating and \| \| # managing the menus in the SpecTcl application. \| \ # \| / # Copyright (c) 1994-1997 Sun Microsystems, Inc. \| \| # \|Copyright \| # See the file "license.terms" for information on usage and \| \| # redistribution of this file, and for a DISCLAIMER OF ALL \| \ # WARRANTIES. ~~\| #~~ \|Revision # SCCS: %Z% %M% %I% %E% %U% \|String # RCS: Id \| \| / package require specTable \|Package \| package provide specMenu 1.0 \|Definition \| namespace eval specMenu { \| \| namespace export addMenu \| \| array set menuData {one two three} \| \| ... \| \ } Abstract: The first few lines give the name of the file and a brief description of the overall functions provided by the file, just as in header files. Copyright notice: The notice protects ownership of the file. The copyright shown above is included in the Tcl and Tk sources. More product specific ~~packages would probably have the words ''All rights reserved included''~~ instead. If more than one entity contributed to the page they should each have a distinct copyright line. Revision string: The contents of this string are managed automatically by the ~~source code control system for the file, such as RCS or SCCS (both are shown in the example). It identifies the file's current revision, date of~~ last modification, and so on. ~~Package definition: Also any ~~'''~~require~~'''~~ statements for other packages that~~ this package depends on should be the first code in the file. Any global variables that are managed by this file should be declared at the top of the page. The name space definition should be next and the export list should be the first item in the namespace definition. Please structure your header pages in exactly the order given above and follow ~~the syntax of the example as closely as possible. The file ~~'''~~fileHead.tcl~~'''~~ [~~[''~~not available~~'']~~] provides a template for a header page.~~ ~~~~ Multi-file packages~~ Some packages may be too large to fit into one file. You may want to consider breaking the package into multiple independent packages. However, when that is ~~not an option you need to make one of the files the ''primary'' file. The~~ primary file will include the complete export list and the definitions of all exported variables and procedures. The secondary files should only contain supporting routines to the primary file. It is important to construct your ~~package in this manner or utilities like ~~'''~~pkg_mkIndex~~'''~~ will not work~~ correctly. Finally, the header to the various files should make it clear which file is the primary file and which are supporting files. ~~~~ Procedure headers~~ After the header you will have one or more procedures. Each procedure will ~~begin with a ''procedure header'' that gives overall documentation for the~~ procedure, followed by the declaration and body for the procedure. See below for an example. ~~\|# tcl::HistRedo -- \|#~~ ~~\|# Fetch the previous or specified event, execute it, and then \|# replace the current history item with that event. \|#~~ \|# Arguments: \|# event (optional) index of history item to redo. Defaults \|# to -1, which means the previous event. \|# Results: \|# The result is that of the command being redone. Also replaces \|# the current history list item with the one being redone. \|proc tcl::HistRedo {{event -1}} { \| ... \|} The header should contain everything that a caller of the procedure needs to know in order to use the procedure, and nothing else. It consists of three parts: Abstract: The first lines in the header give the procedure's name, followed by a brief description of what the procedure does. This should not be a	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| < > \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| < > \| \| < > \| \| \| \| \| \| \| \| < >	63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325	and how to write procedure headers. Section 5 desribes the Tcl naming conventions. Section 6 presents low-level coding conventions, such as how to indent and where to put curly braces. Section 7 contains a collection of rules and suggestions for writing comments. Section 8 describes how to write and maintain test suites. Section 9 contains a few miscellaneous topics, such as keeping a change log. # Executable files An executable is a file, collection of files, or some other collection of Tcl code and necessary runtime environment. Often referred to as applications, an executable is simply what you run to start your program. The format and exact make up of an executable is platform-specific. At some point, however, a Tcl _start-up script_ will be evaluated. It is the start-up script that will bootstrap any Tcl based application. The role of the start-up script is to load any needed _packages_, set up any non-package specific state, and finally start the Tcl application by calling routines inside a Tcl package. If the start-up script is more than a few lines it should probably be a package itself. There are several ways to create executable scripts. Each major platform usually has a unique way of creating an executable application. Here is a brief description of how these applications should be created on each platform: 1. The most common method for creating executable applications on UNIX platforms is the infamous \#! mechanism built into most shells. Unfortunately, the most common approach of just giving a path to wish is not recommended. Don't do: #! /usr/local/tclsh8.0 -f "$0" "$@" > This method will not work if the file tclsh is another script that, for example, locates and starts the most recent version of Tcl. It also requires tclsh to be in a particular place, which makes the script less portable. Instead, the following method should be used which calls /bin/sh which will in turn exec the wish application. #!/bin/sh # the next line restarts using wish \ exec wish8.0 "$0" "$@" > This example will actually locate the wish application in the user's path which can be very useful for developers. The backslash is recognized as part of a comment to _sh_, but in Tcl the backslash continues the comment into the next line which keeps the _exec_ command from executing again. However, more stable sites would probably want to include the full path instead of just wish. Note that the version number of the tclsh or wish interpreter is usually added to the end of the program name. This allows you use a specific version of Tcl. In addition, many sites include a link of wish to the latest version currently installed. This is useful if you know that your code will work on any version of Tcl. 2. On the Windows platform you only need to end a file with the .tcl extension and the file will be run when the user double clicks on the file. This is, of course, assuming you have installed Tcl/Tk. > Alternatively, you may create a .bat file which explicitly executes tclsh or wish with an absolute path to your start-up script. Please check the Windows documentation for more details about .bat files. 3. The Macintosh platform doesn't really have a notion of an executable Tcl file. One of the reasons for this is that, unlike UNIX or Windows, you can only run one instance of an application at a time. So instead of callingwish with a specific script to load, we must create a copy of the wish application that is tied to our script. > The easiest way to do this is to use the application _Drag&Drop Tclets_ or the _SpecTcl_ GUI builder which can do this work for you. You can also do this by hand by putting the start-up script into a TEXT resource and name it _tclshrc_ - which ensures it gets sourced on start-up. This can be done with _ResEdit_ \(a tool provided by Apple\) or other tools that manipulate resources. Additional scripts can also be placed in TEXT resource to make the application completely contained. # Packages and namespaces Tcl applications consist of collections of _packages_. Each package provides code to implement a related set of features. For example, Tcl itself is a package, as is Tk; these packages happen to be implemented in both C and Tcl. Other packages are implemented completely in Tcl such as the http package included in the Tcl distribution. Packages are the units in which code is developed and distributed: a single package is typically developed by a single person or group and distributed as a unit. It is possible to combine many independently-developed packages into a single application; packages should be designed with this in mind. The notion of _namespaces_ were created to help make this easier. Namespaces help to hide private aspects of packages and avoid name collisions. A package will generally export one public namespace which will include all state and routines that are associated with the package. A package should not contain any global variables or global procedures. Side effects when loading a package should be avoided. This document will focus on packages written entirely in Tcl. For a discussion of packages built in C or C and Tcl see the _Tcl/Tk Engineering Manual_. ## Package names Each package should have a unique _name_. The name of the package is used to identify the package. It is also used as the name of the namespace that the package exports. It is best to have a simple one word name in all lower-case like http. Multi-word names are ok as well. Additional words should just be concatenated with the first word but start with a capital letter like specMenu. Coming up with a unique name for your package requires a collaborative component. For internal projects this is an easy task and can usually be decided among the management or principal engineers in your organization. For packages you wish to publish, however, you should make an effort to make sure that an existing package isn't already using the same name you are. This can often be done by checking the comp.lang.tcl newsgroup or the standard Tcl ftp sites. It is also suggested \(but not required\) that you register your name on the NIST Identifier Collaboration Service \(NICS\). It is located at: <http://pitch.nist.gov/nics> ## Version numbers Each package has a two-part version number such as 7.4. The first number \(7\) is called the major version number and the second \(4\) is called the minor version number. The version number changes with each public release of the package. If a new release contains only bug fixes, new features, and other upwardly compatible changes, so that code and scripts that worked with the old version will also work with the new version, then the minor version number increments and the major version number stays the same \(e.g., from 7.4 to 7.5\). If the new release contains substantial incompatibilities, so that existing code and scripts will have to be modified to run with the new version, then the major version number increments and the minor version number resets to zero \(e.g., from 7.4 to 8.0\). ## Package namespaces As of version 8.0, Tcl supports namespaces to hide the internal structure of a package. This helps avoid name collisions and provides a simpler way to manage packages. All packages written for Tcl 8.0 or newer should use namespaces. The name of the name space should be the same as the package name. ## Structure There are a couple of ways to deploy a package of Tcl commands. * A pkgIndex.tcl file is used to create _packages_ that can be loaded on demand by any Tcl script. Like a tclIndex file, a package specifies a set of Tcl and/or shared libraries that can be loaded when needed. A package, however, must be explicitly requested by using the package require command. You can use the pkg\_mkIndex command to create a package index file for your use. In most cases, particularly in code you distribute to others, it is better to use a package instead of the tclIndex auto-loading mechanism. * On the Macintosh platform, shared libraries can be made into self contained packages. You simply need to add a TEXT resource with the name of pkgIndex. It will be treated in the exact same fashion as a pkgIndex.tcl file. The pkgIndex resource should have the same format as the pkgIndex.tcl file. # How to organize a code file Each source code file should either contain an entire application or a set of related procedures that make up a package or a another type of identifiable module, such as the implementation of the menus for your application, or a set of procedures to implement HTTP access. Before writing any code you should think carefully about what functions are to be provided and divide them into files in a logical way. The most manageable size for files is usually in the range of 500-2000 lines. If a file gets much larger than this, it will be hard to remember everything that the file does. If a file is much shorter than this, then you may end up with too many files in a directory, which is also hard to manage. ## The file header The first part of a code file is referred to as the _header_. It contains overall information that is relevant throughout the file. It consists of everything but the definitions of the file's procedures. The header typically has four parts, as shown below: / # specMenu.tcl -- \| # Abstract \| # This file implements the Tcl code for creating and \| # managing the menus in the SpecTcl application. \ # / # Copyright (c) 1994-1997 Sun Microsystems, Inc. \| # Copyright \| # See the file "license.terms" for information on usage and \| # redistribution of this file, and for a DISCLAIMER OF ALL \ # WARRANTIES. # Revision # SCCS: %Z% %M% %I% %E% %U% String # RCS: Id / package require specTable Package \| package provide specMenu 1.0 Definition \| namespace eval specMenu { \| namespace export addMenu \| array set menuData {one two three} \| ... \ } Abstract: The first few lines give the name of the file and a brief description of the overall functions provided by the file, just as in header files. Copyright notice: The notice protects ownership of the file. The copyright shown above is included in the Tcl and Tk sources. More product specific packages would probably have the words _All rights reserved included_ instead. If more than one entity contributed to the page they should each have a distinct copyright line. Revision string: The contents of this string are managed automatically by the source code control system for the file, such as RCS or SCCS \(both are shown in the example\). It identifies the file's current revision, date of last modification, and so on. Package definition: Also any require statements for other packages that this package depends on should be the first code in the file. Any global variables that are managed by this file should be declared at the top of the page. The name space definition should be next and the export list should be the first item in the namespace definition. Please structure your header pages in exactly the order given above and follow the syntax of the example as closely as possible. The file fileHead.tcl [_not available_] provides a template for a header page. ## Multi-file packages Some packages may be too large to fit into one file. You may want to consider breaking the package into multiple independent packages. However, when that is not an option you need to make one of the files the _primary_ file. The primary file will include the complete export list and the definitions of all exported variables and procedures. The secondary files should only contain supporting routines to the primary file. It is important to construct your package in this manner or utilities like pkg\_mkIndex will not work correctly. Finally, the header to the various files should make it clear which file is the primary file and which are supporting files. ## Procedure headers After the header you will have one or more procedures. Each procedure will begin with a _procedure header_ that gives overall documentation for the procedure, followed by the declaration and body for the procedure. See below for an example. # tcl::HistRedo -- # # Fetch the previous or specified event, execute it, and then # replace the current history item with that event. # # Arguments: # event (optional) index of history item to redo. Defaults # to -1, which means the previous event. # Results: # The result is that of the command being redone. Also replaces # the current history list item with the one being redone. proc tcl::HistRedo {{event -1}} { ... } The header should contain everything that a caller of the procedure needs to know in order to use the procedure, and nothing else. It consists of three parts: Abstract: The first lines in the header give the procedure's name, followed by a brief description of what the procedure does. This should not be a
︙			︙
334 335 336 337 338 339 340 ~~341~~ 342 343 ~~344~~ 345 ~~346 347~~ 348 ~~349~~ 350 351 352 353 ~~354 355 356~~ 357 ~~358~~ 359 ~~360 361 362~~ 363 ~~364~~ 365 366 367 368 369 370 371 372 ~~373 374 375 376 377~~ ~~378~~ 379 380 381 ~~382 383~~ 384 ~~385~~ 386 387 388 389 390 391 ~~392~~ 393 394 395 396 397 398 399 ~~400~~ 401 402 403 404 405 406 407 408 ~~409~~ 410 411 412 413 ~~414 415~~ 416 417 418 ~~419~~ 420 421 422 ~~423~~ 424 425 ~~426 427~~ 428 429 ~~430~~ 431 432 433 434 435 436 ~~437~~ 438 ~~439~~ 440 441 442 ~~443 444 445 446 447 448~~ 449 450 451 452 453 ~~454~~ 455 456 ~~457~~ 458 459 460 ~~461 462 463 464 465~~ ~~466 467~~ 468 ~~469 470 471 472~~ 473 474 475 ~~476~~ 477 ~~478 479 480 481~~ ~~482~~ 483 484 485 486 487 ~~488~~ 489 490 491 492 493 494 495 ~~496~~ 497 498 499 500 501 502 503 504 505 506 507 508 509 ~~510 511 512 513 514 515 516 517 518~~ ~~519 520 521 522 523 524 525 526 527 528 529~~ ~~530~~ 531 532 533 534 535 536 537 ~~538~~ 539 540 ~~541~~ 542 543 ~~544~~ 545 546 547 548 549 ~~550~~ 551 552 553 554 555 556 557 558 559 560 561 562 ~~563~~ 564 565 566 ~~567 568 569~~ 570 571 572 573 574 575 ~~576~~ 577 578 ~~579 580~~ 581 582 ~~583~~ 584 585 586 ~~587~~ 588 ~~589~~ 590 ~~591~~ 592 593 594 ~~595~~ 596 597 598 599 ~~600 601 602 603 604 605 606 607~~ 608 609 ~~610 611 612 613 614 615 616 617~~ ~~618~~ 619 620 ~~621~~ 622 ~~623 624~~ 625 626 627 628 629 ~~630 631 632 633 634~~ ~~635 636 637~~ ~~638 639 640 641 642~~ ~~643~~ 644 ~~645 646~~ 647 648 649 ~~650 651 652 653 654 655 656 657~~ ~~658~~ 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 ~~674~~ 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 ~~695~~ 696 ~~697 698 699~~ 700 ~~701 702 703~~ 704 705 706 707 708 ~~709~~ 710 711 712 ~~713 714 715~~ 716 717 718 719 720 721 722 723 ~~724 725~~ ~~726 727~~ 728 729 730 731 ~~732 733~~ ~~734 735 736 737~~ 738 ~~739 740~~ 741 ~~742~~ 743 744 745 746 747 748 749 ~~750 751~~ 752 753 754 755 ~~756~~ 757 758 759 760 761 762 ~~763~~ 764 765 766 767 768 769 770 771 772 ~~773~~ 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 ~~794~~ 795 796 797 798 799 800 801 802 803 ~~804~~ 805 806 807 808 809 810 811 812 813 814 815 ~~816~~ 817 818 819 ~~820~~ 821 ~~822 823 824 825 826 827~~ 828 ~~829 830~~ 831 832 ~~833~~ 834 835 836 ~~837 838 839 840 841 842 843 844~~ 845 ~~846~~ 847 848 849 ~~850~~ 851 ~~852 853 854 855 856~~ 857 858 859 ~~860~~ 861 862 863 864 865 ~~866~~ 867 868 869 ~~870~~ 871 872 873 ~~874 875 876 877 878~~ 879 880 881 882 883 884 885 886 887 888 889 ~~890~~ 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 ~~906~~ 907 908 909 910 911 912 913 ~~914~~ 915 916 917 918 ~~919~~ 920 921 922 923 924 ~~925~~ 926 927 928 929 930 931 932 ~~933~~ 934 935 936 937 ~~938~~ 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 ~~957~~ 958 ~~959~~ 960 961 962 963 ~~964~~ 965 ~~966~~ 967 968 969 970 ~~971~~ 972 973 ~~974~~ 975 976 ~~977~~ 978 979 ~~980~~ 981 982 ~~983~~ 984 ~~985~~ 986 ~~987 988 989 990 991 992 993 994 995 996 997~~ 998 ~~999~~ 1000 1001 ~~1002~~ 1003 1004 1005 ~~1006~~ 1007 1008 ~~1009~~ 1010 ~~1011~~ 1012 ~~1013~~ 1014	comment should describe the expected type and describe it's function. Optional arguments should be pointed out and the default behavior of an unspecified argument should be mentioned. Comments for all of the arguments should line up on the same tab stop. Results: The last part of the header describes the value returned by the procedure. The type and the intended use of the result should be ~~described. This section should also mention any ''side effects'' that are~~ worth noting. ~~The file ~~'''~~tclProcHead~~''' [[''~~not available~~'']~~] contains a template for a~~ procedure header which should be used as a base for all new Tcl commands. ~~Follow the syntax of the above example exactly (same indentation, double-dash after the procedure name, etc.).~~ ~~~~ Procedure declarations~~ The procedure declaration should also follow exactly the syntax in the example above. Note that the procedure is defined outside the namespace command that defines the export list and namespace globals. The first line gives the ~~~~'''~~proc~~'''~~ keyword, the procedure name, and an argument list. If there are many arguments, they may spill onto additional lines (see Sections 6.1 and 6.3 for information about indentation).~~ ~~~~ Parameter order~~ Procedure parameters may be divided into three categories. ~~''In''~~ parameters only pass information into the procedure (either directly or by pointing to information that the procedure reads). ~~''Out''~~ parameters point to things in the caller's memory that the procedure modifies such as the name of a variable ~~the procedure will modify. ''In-out'' parameters do both. Below is a set of~~ rules for deciding on the order of parameters to a procedure: 1. Parameters should normally appear in the order in, in/out, out, except where overridden by the rules below. 2. If an argument is actually a sub-command for the command than it should be the first argument of the command. For example: ~~\| proc graph::tree {subCmd args} { \| switch $subCmd { \| add { \| eval add_node $args ~~\| }~~~~ ~~\| draw {...~~ 3. If there is a group of procedures, all of which operate on an argument of a particular type, such as a file path or widget path, the argument should ~~be the first argument to each of the procedures (or after the sub-command argument).~~ ~~~~ Procedure bodies~~ The body of a procedure follows the declaration. See Section 6 for the coding conventions that govern procedure bodies. The curly braces enclosing the body should be on different lines, as shown in the examples above, even if the body of the procedure is empty. ~~~ Naming conventions~~ Choosing names is one of the most important aspects of programming. Good names clarify the function of a program and reduce the need for other documentation. Poor names result in ambiguity, confusion, and error. This section gives some general principles to follow when choosing names and lists specific rules for name syntax, such as capitalization. ~~~~ General considerations~~ The ideal variable name is one that instantly conveys as much information as possible about the purpose of the variable it refers to. When choosing names, play devil's advocate with yourself to see if there are ways that a name might be misinterpreted or confused. Here are some things to consider: 1. Are you consistent? Use the same name to refer to the same thing everywhere. For example, within the code for handling standard bindings in ~~Tk widgets, a standard name ~~'''w'''~~ is always used to refer to the window~~ associated with the current event. 2. If someone sees the name out of context, will they realize what it stands for, or could they confuse it with something else? For example, the ~~procedure name ~~'''~~buildStructure~~'''~~ could get confused with some other part of the system. A name like ~~'''~~buildGraphNode~~'''~~ both describes what~~ part of the system it belongs to and what it is probably used for. 3. Could this name be confused with some other name? For example, it's ~~probably a mistake to have two variables ~~'''~~str~~'''~~ and ~~'''~~string~~'''~~ in the~~ same procedure: it will be hard for anyone to remember which is which. Instead, change the names to reflect their functions. For example, if the strings are used as source and destination for a copy operation, name ~~them ~~'''~~src~~'''~~ and ~~'''~~dst~~'''~~.~~ 4. Is the name so generic that it doesn't convey any information? The ~~variable ~~'''~~str~~'''~~ from the previous paragraph is an example of this; changing its name to ~~'''~~src~~'''~~ makes the name less generic and hence~~ conveys more information. ~~~~ Basic syntax rules~~ Below are some specific rules governing the syntax of names. Please follow the rules exactly, since they make it possible to determine certain properties of a variable just from its name. 1. Exported names for both procedures and variables always start with a ~~''lower''-case letter. Procedures and variables that are meant only for~~ use with in the current package or namespace should start with an ~~''upper''-case letter. We chose lower-case for the exported symbols~~ because it is possible they may be commonly used from the command line and they should be easy to write. For example: \| # CountNum is a private variable \| set CountNum 0 \| # The function addWindow is public \| proc addWindow {} {... \| # newWindow is a public interface in the spectcl namespace \| proc spectcl::newWindow {} {... 2. In multi-word names, the first letter of each trailing word is capitalized. Do not use underscores or dashes as separators between the words of a name. ~~\| set numWindows 0~~ 3. Any variable whose value refers to another variable has a name that ends ~~in ~~'''~~Name~~'''~~. Furthermore, the name should also indicate what type of~~ variable the name is referring to. These names are often used in arguments to procedures that are taking a name of a variable. ~~\| proc foo::Bar {arrayName} { \| upvar 1 $arrayName array \| ... ~~\| }~~~~ ~~4. Variables that hold Tcl code that will be ~~'''~~eval~~'''~~ed should have names ending in ~~'''~~Script~~'''~~.~~ ~~\| proc log::eval {logScript} { \| if {$Log::logOn} { \| set result [catch {eval $logScript} msg] \| ...~~ 5. Variables that hold a partial Tcl command that must have additional arguments appended before being a valid script should have names ending in ~~~~'''~~Cmd~~'''~~.~~ ~~\| foreach scrollCmd $listScrollCmds { \| eval $scrollCmd $args ~~\| }~~~~ ~~~ Low-level coding conventions~~ This section describes several low-level syntactic rules for writing Tcl code. These rules help to ensure that all of the Tcl code looks the same, and they prohibit a few confusing coding constructs. ~~~~ Indents are 4 spaces~~ Each level of indentation should be four spaces. There are ways to set 4-space indents in all of the most common editors. Be sure that your editor really uses four spaces for the indent, rather than just displaying tabs as four spaces wide; if you use the latter approach then the indents will appear eight spaces wide in other editors. ~~~~ Code comments occupy full lines~~ Comments that document code should occupy full lines, rather than being tacked onto the ends of lines containing code. The reason for this is that side-by-side comments are hard to see, particularly if neighboring statements are long enough to overlap the side-by-side comments. Also it is easy to place comments in a place that could cause errors. Comments must have exactly the structure shown in the example below, with a blank line above and below the comment. The leading blank line can be omitted if the comment is at the beginning of a block, as is the case in the second comment in the example below. Each comment should be indented to the same level as the surrounding code. Use proper English in comments: write complete sentences, capitalize the first word of each sentence, and so on. \|# If we are running on the Macintosh platform then we can \|# assume that the sources are located in the resource fork \|# of our application, and we do not need to search for them. \|# Note that there is a blank line below it to separate it \|# more strongly from the code. \| \|if {$tcl_platform(platform) == "macintosh"} { \| return \|} \| \|foreach dir $dirList { \| # If the source succeds then we are done. \| # Note there is no blank line above the comment; \| # the indentation change is visible enough. \| \| if {![catch {source [file join $dir file.tcl]}]} { \| break ~~\| }~~ \|} ~~~~ Continuation lines are indented 8 spaces~~ You should use continuation lines to make sure that no single line exceeds 80 characters in length. Continuation lines should be indented 8 spaces so that they won't be confused with an immediately-following nested block. Pick clean places to break your lines for continuation, so that the continuation doesn't obscure the structure of the statement. For example, if a procedure call requires continuation lines, try to avoid situations where a single argument ~~spans multiple lines. If the test for an ~~'''~~if~~'''~~ or ~~'''~~while~~'''~~ command spans~~ lines, try to make each line have the same nesting level of parentheses and/or brackets if possible. I try to start each continuation line with an operator ~~such as ~~'''~~~~'''~~, ~~'''~~&&~~'''~~, or ~~'''~~\|~~\|'''~~; this makes it clear that the line is a~~ continuation, since a new statement would never start with such an operator. ~~~~ Only one command per line~~ You should only have one Tcl command per line on the page. Do not use the semi-colon character to place multiple commands on the same line. This makes the code easier to read and helps with debugging. ~~~~ Curly braces: { goes at the end of a line~~ Open curly braces can not appear on lines by themselves in Tcl. Instead, they should be placed at the end of the preceding line. Close curly braces are indented to the same level as the outer code, i.e., four spaces less than the statements they enclose. However, you shouldalways use curly braces rather than some other list generating mechanism that will work in the Tcl language. This will help make code more readable, will avoid unwanted side effects, and in many cases will generate faster code with the Tcl compiler. Control structures should always use curly braces, even if there is only one statement in the block. Thus you shouldn't write code like ~~\| if {$tcl_platform(platform) == "unix"} return~~ but rather ~~\| if {$tcl_platform(platform) == "unix"} { \| return ~~\| }~~~~ This approach makes code less dense, but it avoids potential mistakes like unwanted Tcl substitutions. It also makes it easier to set breakpoints in a debugger, since it guarantees that each statement is on a separate line and can be named individually. ~~~~ Parenthesize expressions~~ Use parentheses around each subexpression in an expression to make it ~~absolutely clear what is the evaluation order of the expression (a reader of your code should not need to remember Tcl's precedence rules). For example,~~ don't type ~~\| if {$x > 22 && $y <= 47} ...~~ Instead, type this: ~~\| if {($x > 22) && ($y <= 47)} ...~~ ~~~~ Always use the return statement~~ ~~You should always explicitly use the ~~'''~~return~~'''~~ statement to return values~~ from a Tcl procedure. By default Tcl will return the value of the last Tcl statement executed in a Tcl procedure as the return value of the procedure which often leads to confusion as to where the result is coming from. In ~~addition, you should use a ~~'''~~return~~'''~~ statement with no argument for~~ procedures whose results are ignored. Supplying this return will actually speed up your application with the new Tcl compiler. For example, don't write code like this: ~~\| proc foo {x y} { \| if {$x < 0} { \| incr x \| } else { \| expr $x + $y ~~\| }~~ ~~\| }~~~~ But rather, type this: ~~\| proc foo {x y} { \| if {$x < 0} { \| return [incr x] \| } else { \| return [expr $x + $y] ~~\| }~~ ~~\| }~~~~ ~~For Tcl procedures that have no return value a single ~~'''~~return~~'''~~ statement~~ with no arguments is placed at the end of the procedure. ~~~~ Switch statements~~ ~~The ~~'''~~switch~~'''~~ statement should be formatted as below. Always use the ~~'''~~--~~'''~~ option to avoid having the string be confused with an option. This~~ can happen when the string is user generated. Comments can be added on the same line as the pattern to comment the pattern case. The comments for each case should line up on the same tab stop and must be within the braces. Note that this is an exception to the standard commenting conventions. ~~\| switch -regexp -- $string { \| plus - \| add { # Do add task \| ... ~~\| }~~~~ ~~\| subtract { # Do subtract case \| ... ~~\| }~~~~ ~~\| default { \| ... ~~\| }~~ ~~\| }~~~~ ~~~~ If statements~~ ~~Never use the ~~'''~~then~~'''~~ word of an ~~'''~~if~~'''~~ statement. It is syntactic sugar that really isn't that useful. However, the ~~'''~~else~~'''~~ word should always be~~ used as it does impart some semantic information and it is more like the C language. Here is an example: ~~\| if {$x < 0} { \| ... \| } elseif {$x == 0} { \| ... \| } else { \| ... ~~\| }~~~~ ~~~ Documenting code~~ The purpose of documentation is to save time and reduce errors. Documentation is typically used for two purposes. First, people will read the documentation to find out how to use your code. For example, they will read procedure headers to learn how to call the procedures. Ideally, people should have to learn as little as possible about your code in order to use it correctly. Second, people will read the documentation to find out how your code works internally, so they can fix bugs or add new features; again, good documentation will allow them to make their fixes or enhancements while learning the minimum possible about your code. More documentation isn't necessarily better: wading through pages of documentation may not be any easier than deciphering the code. Try to pick out the most important things that will help people to understand your code and focus on these in your documentation. ~~~~ Document things with wide impact~~ The most important things to document are those that affect many different pieces of a program. Thus it is essential that every procedure interface, every structure declaration, and every global variable be documented clearly. If you haven't documented one of these things it will be necessary to look at all the uses of the thing to figure out how it's supposed to work; this will be time-consuming and error-prone. On the other hand, things with only local impact may not need much documentation. For example, in short procedures I don't usually have comments explaining the local variables. If the overall function of the procedure has been explained, and if there isn't much code in the procedure, and if the variables have meaningful names, then it will be easy to figure out how they are used. On the other hand, for long procedures with many variables I usually document the key variables. Similarly, when I write short procedures I don't usually have any comments in the procedure's code: the procedure header provides enough information to figure out what is going on. For long procedures I place a comment block before each major piece of the procedure to clarify the overall flow through the procedure. ~~~~ Don't just repeat what's in the code~~ ~~The most common mistake I see in documentation (besides it not being there at all) is that it repeats what is already obvious from the code, such as this trivial (but exasperatingly common) example:~~ ~~\| # Increment i. \| \| incr i~~ Documentation should provide higher-level information about the overall function of the code, helping readers to understand what a complex collection of statements really means. For example, the comment ~~\| # Probe into the array to see if the symbol exists.~~ is likely to be much more helpful than ~~\| # Loop through every array index, get the third value of the \| # list in the content to determine if it has the symbol we are \| # looking for. Set the result to the symbol if we find it.~~ Everything in this second comment is probably obvious from the code that follows it. Another thing to consider in your comments is word choice. Use different words in the comments than the words that appear in variable or procedure names. For example, the comment ~~\| # SwapPanels -- ~~\| #~~~~ ~~\| # Swap the panels. \| # ...~~ is not a very useful comment. Everything in the comment is already obvious from the procedure's name. Here is a much more useful comment: ~~\| # SwapPanels -- ~~\| #~~~~ ~~\| # Unmap the current UI panel from the parent frame and replace \| # it with the newly specified frame. Make sure that the new \| # panel fits into the old frame and resize if needed. \| # ...~~ ~~This comment tells ~~''why''~~ you might want to use the procedure, in addition to ''what'' it does, which makes the comment much more useful.~~ ~~~~ Document each thing in exactly one place~~ Systems evolve over time. If something is documented in several places, it will be hard to keep the documentation up to date as the system changes. Instead, try to document each major design decision in exactly one place, as near as possible to the code that implements the design decision. The principal documentation for each procedure goes in the procedure header. There's no need to repeat this information again in the body of the procedure ~~(but you might have additional comments in the procedure body to fill in details not described in the procedure header). If a library procedure is~~ documented thoroughly in a manual entry, then I may make the header for the procedure very terse, simply referring to the manual entry. The other side of this coin is that every major design decision needs to be ~~documented ''at least'' once. If a design decision is used in many places, it~~ may be hard to pick a central place to document it. Try to find a data structure or key procedure where you can place the main body of comments; then reference this body in the other places where the decision is used. If all else fails, add a block of comments to the header page of one of the files implementing the decision. ~~~~ Write clean code~~ The best way to produce a well-documented system is to write clean and simple code. This way there won't be much to document. If code is clean, it means that there are a few simple ideas that explain its operation; all you have to do is to document those key ideas. When writing code, ask yourself if there is a simple concept behind the code. If not, perhaps you should rethink the code. If it takes a lot of documentation to explain a piece of code, it is a sign that you haven't found a clean solution to the problem. ~~~~ Document as you go~~ It is extremely important to write the documentation as you write the code. It's very tempting to put off the documentation until the end; after all, the code will change, so why waste time writing documentation now when you'll have to change it later? The problem is that the end never comes - there is always more code to write. Also, the more undocumented code that you accumulate, the harder it is to work up the energy to document it. So, you just write more undocumented code. I've seen many people start a project fully intending to go back at the end and write all the documentation, but I've never seen anyone actually do it. If you do the documentation as you go, it won't add much to your coding time and you won't have to worry about doing it later. Also, the best time to document code is when the key ideas are fresh in your mind, which is when you're first writing the code. When I write new code, I write all of the header comments for a group of procedures before I fill in any of the bodies of the procedures. This way I can think about the overall structure and how the pieces fit together before getting bogged down in the details of individual procedures. ~~~~ Document tricky situations~~ If code is non-obvious, meaning that its structure and correctness depend on information that won't be obvious to someone reading it for the first time, be sure to document the non-obvious information. One good indicator of a tricky situation is a bug. If you discover a subtle property of your program while fixing a bug, be sure to add a comment explaining the problem and its solution. Of course, it's even better if you can fix the bug in a way that eliminates the subtle behavior, but this isn't always possible. ~~~ Testing~~ One of the environments where Tcl works best is for testing. While Tcl has traditionally been used for testing C code it is equally as good at testing other Tcl code. Whenever you write new code you should write Tcl test scripts to go with that code and save the tests in files so that they can be re-run later. Writing test scripts isn't as tedious as it may sound. If you're developing your code carefully you're already doing a lot of testing; all you need to do is type your test cases into a script file where they can be reused, rather than typing them interactively where they vanish after they're run. ~~~~ Basics~~ Tests should be organized into script files, where each file contains a collection of related tests. Individual tests should be based on the procedure ~~~~'''~~test~~'''~~, just like in the Tcl and Tk test suites. Here are two examples:~~ \| test expr-3.1 {floating-point operators} { \| expr 2.3.6 \| } 1.38 \| test expr-3.2 {floating-point operators} {unixOnly} { \| list [catch {expr 2.3/0} msg] $msg \| } {1 {divide by zero}} ~~~~'''~~test~~'''~~ is a procedure defined in a script file named ~~'''~~defs~~'''~~, which is ~~'''~~source~~'''~~d by each test file. ~~'''~~test~~'''~~ takes four or five arguments: a~~ test identifier, a string describing the test, an optional argument describing the conditions under which this test should run, a test script, and the ~~expected result of the script. ~~'''~~test~~'''~~ evaluates the script and checks to~~ be sure that it produces the expected result. If not, it prints a message like the following: ~~\| ==== expr-3.1 floating-point operators \| ==== Contents of test case: \| expr 2.3.6 \| ==== Result was: \| 1.39 \| ---- Result should have been: \| 1.38 \| ---- expr-3.1 FAILED~~ ~~To run a set of tests, you start up the application and ~~'''~~source~~'''~~ a test~~ file. If all goes well no messages appear; if errors are detected, a message is printed for each error. ~~The test identifier, such as ~~'''~~expr-3.1~~'''~~, is printed when errors occur. It~~ can be used to search a test script to locate the source for a failed test. The first part of the identifier, such as ~~'''~~expr~~'''~~, should be the same as the name of the test file, except that the test file should have a ~~'''~~.test~~'''~~ extension, such as ~~'''~~expr.test~~'''~~. The two numbers allow you to divide your tests into groups. The tests in a particular group (e.g., all the ~~'''~~expr-3.~~'''''n''~~ tests) relate to a single sub-feature, such as a single procedure. The tests should appear in the test file in the same order as their numbers. ~~The test name, such as ~~'''~~floating-point operators~~'''~~, is printed when errors~~ occur. It provides human-readable information about the general nature of the test. Before writing tests I suggest that you look over some of the test files for Tcl and Tk to see how they are structured. You may also want to look at the ~~~~'''~~README~~'''~~ files in the Tcl and Tk test directories to learn about~~ additional features that provide more verbose output or restrict the set of tests that are run. ~~~~ Organizing tests~~ Organize your tests to match the code being tested. The best way to do this is to have one test file for each source code file, with the name of the test file derived from the name of the source file in an obvious way (e.g. ~~'''~~http.test~~'''~~ contains tests for the code in ~~'''~~http.tcl~~'''~~). Within the test file, have one group of tests for each procedure (for example, all the ~~'''~~http-3.~~'''''n''~~ tests in ~~'''~~http.test~~'''~~ are for the procedure ~~'''~~http::geturl~~'''~~). The order of the tests within a group should be the same as the order of the code within the procedure. This approach makes it easy to find the tests for a particular piece of code and add new tests as the code changes. The Tcl test suite was written a long time ago and uses a different style where there is one file for each Tcl command or group of related commands, and the tests are grouped within the file by sub-command or features. In this approach the relationship between tests and particular pieces of code is much less obvious, so it is harder to maintain the tests as the code evolves. I don't recommend using this approach for new tests. ~~~~ Coverage~~ When writing tests, you should attempt to exercise every line of source code at least once. There will be occasionally be code that you can't exercise, such as code that exits the application, but situations like this are rare. You may find it hard to exercise some pieces of code because existing Tcl commands don't provide fine enough control to generate all the possible execution paths. In situations like this, write one or more new Tcl commands just for testing purposes. It's much better to test a facility directly then to rely on some side effect for testing that may change over time. Use a similar approach in your own code, where you have an extra file with additional commands for testing. It's not sufficient just to make sure each line of code is executed by your tests. In addition, your tests must discriminate between code that executes correctly and code that isn't correct. For example, write tests to make sure ~~that the ~~'''~~then~~'''~~ and ~~'''~~else~~'''~~ branches of each ~~'''~~if~~'''~~ statement are~~ taken under the correct conditions. For a loop, run different tests to make the loop execute zero times, one time, and two or more times. If a piece of code removes an element from a list, try cases where the element to be removed is the first element, last element, only element, and neither first element nor last. Try to find all the places where different pieces of code interact in unusual ways, and exercise the different possible interactions. ~~~~ Fixing bugs~~ Whenever you find a bug in your code it means that the test suite wasn't complete. As part of fixing the bug, you should add new tests that detect the presence of the bug. I recommend writing the tests after you've located the ~~bug but ''before'' you fix it. That way you can verify that the bug happens~~ before you implement the fix and the bug doesn't happen afterwards, so you'll know you've really fixed something. Use bugs to refine your testing approach: think about what you might be able to do differently when you write tests in the future to keep bugs like this one from going undetected. ~~~~ Tricky features~~ I also use tests as a way of illustrating the need for tricky code. If a piece of code has an unusual structure, and particularly if the code is hard to explain, I try to write additional tests that will fail if the code is implemented in the obvious manner instead of using the tricky approach. This way, if someone comes along later, doesn't understand the documentation for the code, decides the complex structure is unnecessary, and changes the code ~~back to the simple (but incorrect) form, the test will fail and the person~~ will be able to use the test to understand why the code needs to be the way it is. Illustrative tests are not a substitute for good documentation, but they provide a useful addition. ~~~~ Test independence~~ Try to make tests independent of each other, so that each test can be understood in isolation. For example, one test shouldn't depend on commands executed in a previous test. This is important because the test suite allows tests to be run selectively: if the tests depend on each other, then false errors will be reported when someone runs a few of the tests without the others. For convenience, you may execute a few statements in the test file to set up a test configuration and then run several tests based on that configuration. If you do this, put the setup code outside the calls to thetest procedure so it will always run even if the individual tests aren't run. I suggest keeping a very simple structure consisting of setup followed by a group of tests. Don't perform some setup, run a few tests, modify the setup slightly, run a few more tests, modify the setup again, and so on. If you do this, it will be hard for people to figure out what the setup is at any given point and when they add tests later they are likely to break the setup. ~~~ Miscellaneous~~ ~~~~ Porting issues~~ Writing portable scripts in Tcl is actually quite easy as Tcl itself is quite portable. However, issues do arise that may require writing platform specific code. To conditionalize your code in this manner you should use the ~~~~'''~~tcl_platform~~'''~~ array to determine platform specific differences. You~~ should avoid the use of theenv variable unless you have already determined the ~~platform you are running on via the ~~'''~~tcl_platform~~'''~~ array.~~ As Tcl/Tk has become more cross platform we have added commands that aid in making your code more portable. The most common porting mistakes result from assumptions about file names and locations. To avoid such mistakes always use ~~the ~~'''~~file join~~'''~~ command and list commands so that you will handle~~ different file separation characters or spaces in file names. In Tk, you should always use provided high level dialog boxes instead or creating your ~~own. The ~~'''~~font~~'''~~ and ~~'''~~menu~~'''~~ commands has also be revamped to make~~ writing cross-platform code easier. ~~~~ Changes files~~ Each package should contain a file namedchanges that keeps a log of all ~~significant changes made to the package. The ~~'''~~changes~~'''~~ file provides a way~~ for users to find out what's new in each new release, what bugs have been fixed, and what compatibility problems might be intro- duced by the new ~~release. The ~~'''~~changes~~'''~~ file should be in chronological order. Just add~~ short blurbs to it each time you make a change. Here is a sample from the Tk ~~~~'''~~changes~~'''~~ file:~~ \| 5/19/94 (bug fix) Canvases didn't generate proper Postscript for \| stippled text. (RJ) \| \| 5/20/94 (new feature) Added "bell" command to ring the display's \| bell. (JO) \| \| 5/26/94 (feature removed) Removed support for "fill" justify mode \| from Tk_GetJustify and from the TK_CONFIG_JUSTIFY configuration \| option. None of the built-in widgets ever supported this mode \| anyway. (SS) \| POTENTIAL INCOMPATIBILITY * ~~The entries in the ~~'''~~changes~~'''~~ file can be relatively terse; once someone~~ finds a change that is relevant, they can always go to the manual entries or code to find out more about it. Be sure to highlight changes that cause ~~compatibility problems, so people can scan the ~~'''~~changes~~'''~~ file quickly to~~ locate the incompatibilities. Also be sure to add your initials to the entry so that people scanning the log will know who made a particular change. ~~''(The Tcl and Tk core additionally uses a ChangeLog file that has a much~~ higher detail within it. This has the advantage of having more tooling support, but tends to be so verbose that the shorter summaries in the changes ~~file are still written up by the core maintainers before each release.~~)''~~~~ ~~~ Copyright~~ ~~The original version of this document is copyright (C) 1997 Sun Microsystems,~~ Inc. Revisions to reflect current community best-practice are public domain.	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| < > \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| < \| > \| \| \| \| \| \| \| \| \| < \| > \| \| \| \| \| \| \| \| \| \| \| < > \| \| \| \| \| \| \| \| < < \| > > \| \| \| \| \| \| \| \| < > \| \| \| \| \| \| \| \| \| \| \| \| \| < < \| > > \| \| \| \| \| < < \| > > \| \| \| \| \| \| \| \| < > \| \| < > \| \| < < \| > > \| \| \| \| \| \| \| \| \| < \| > \| \| \| \| \| \| \| \| \| \| \| \| \| \| < > \| \| \| < > \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| >	333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014	comment should describe the expected type and describe it's function. Optional arguments should be pointed out and the default behavior of an unspecified argument should be mentioned. Comments for all of the arguments should line up on the same tab stop. Results: The last part of the header describes the value returned by the procedure. The type and the intended use of the result should be described. This section should also mention any _side effects_ that are worth noting. The file tclProcHead [_not available_] contains a template for a procedure header which should be used as a base for all new Tcl commands. Follow the syntax of the above example exactly \(same indentation, double-dash after the procedure name, etc.\). ## Procedure declarations The procedure declaration should also follow exactly the syntax in the example above. Note that the procedure is defined outside the namespace command that defines the export list and namespace globals. The first line gives the proc keyword, the procedure name, and an argument list. If there are many arguments, they may spill onto additional lines \(see Sections 6.1 and 6.3 for information about indentation\). ## Parameter order Procedure parameters may be divided into three categories. _In_ parameters only pass information into the procedure \(either directly or by pointing to information that the procedure reads\). _Out_ parameters point to things in the caller's memory that the procedure modifies such as the name of a variable the procedure will modify. _In-out_ parameters do both. Below is a set of rules for deciding on the order of parameters to a procedure: 1. Parameters should normally appear in the order in, in/out, out, except where overridden by the rules below. 2. If an argument is actually a sub-command for the command than it should be the first argument of the command. For example: proc graph::tree {subCmd args} { switch $subCmd { add { eval add_node $args } draw {... 3. If there is a group of procedures, all of which operate on an argument of a particular type, such as a file path or widget path, the argument should be the first argument to each of the procedures \(or after the sub-command argument\). ## Procedure bodies The body of a procedure follows the declaration. See Section 6 for the coding conventions that govern procedure bodies. The curly braces enclosing the body should be on different lines, as shown in the examples above, even if the body of the procedure is empty. # Naming conventions Choosing names is one of the most important aspects of programming. Good names clarify the function of a program and reduce the need for other documentation. Poor names result in ambiguity, confusion, and error. This section gives some general principles to follow when choosing names and lists specific rules for name syntax, such as capitalization. ## General considerations The ideal variable name is one that instantly conveys as much information as possible about the purpose of the variable it refers to. When choosing names, play devil's advocate with yourself to see if there are ways that a name might be misinterpreted or confused. Here are some things to consider: 1. Are you consistent? Use the same name to refer to the same thing everywhere. For example, within the code for handling standard bindings in Tk widgets, a standard name w is always used to refer to the window associated with the current event. 2. If someone sees the name out of context, will they realize what it stands for, or could they confuse it with something else? For example, the procedure name buildStructure could get confused with some other part of the system. A name like buildGraphNode both describes what part of the system it belongs to and what it is probably used for. 3. Could this name be confused with some other name? For example, it's probably a mistake to have two variables str and string in the same procedure: it will be hard for anyone to remember which is which. Instead, change the names to reflect their functions. For example, if the strings are used as source and destination for a copy operation, name them src and dst. 4. Is the name so generic that it doesn't convey any information? The variable str from the previous paragraph is an example of this; changing its name to src makes the name less generic and hence conveys more information. ## Basic syntax rules Below are some specific rules governing the syntax of names. Please follow the rules exactly, since they make it possible to determine certain properties of a variable just from its name. 1. Exported names for both procedures and variables always start with a _lower_-case letter. Procedures and variables that are meant only for use with in the current package or namespace should start with an _upper_-case letter. We chose lower-case for the exported symbols because it is possible they may be commonly used from the command line and they should be easy to write. For example: # CountNum is a private variable set CountNum 0 # The function addWindow is public proc addWindow {} {... # newWindow is a public interface in the spectcl namespace proc spectcl::newWindow {} {... 2. In multi-word names, the first letter of each trailing word is capitalized. Do not use underscores or dashes as separators between the words of a name. set numWindows 0 3. Any variable whose value refers to another variable has a name that ends in Name. Furthermore, the name should also indicate what type of variable the name is referring to. These names are often used in arguments to procedures that are taking a name of a variable. proc foo::Bar {arrayName} { upvar 1 $arrayName array ... } 4. Variables that hold Tcl code that will be evaled should have names ending in Script. proc log::eval {logScript} { if {$Log::logOn} { set result [catch {eval $logScript} msg] ... 5. Variables that hold a partial Tcl command that must have additional arguments appended before being a valid script should have names ending in Cmd. foreach scrollCmd $listScrollCmds { eval $scrollCmd $args } # Low-level coding conventions This section describes several low-level syntactic rules for writing Tcl code. These rules help to ensure that all of the Tcl code looks the same, and they prohibit a few confusing coding constructs. ## Indents are 4 spaces Each level of indentation should be four spaces. There are ways to set 4-space indents in all of the most common editors. Be sure that your editor really uses four spaces for the indent, rather than just displaying tabs as four spaces wide; if you use the latter approach then the indents will appear eight spaces wide in other editors. ## Code comments occupy full lines Comments that document code should occupy full lines, rather than being tacked onto the ends of lines containing code. The reason for this is that side-by-side comments are hard to see, particularly if neighboring statements are long enough to overlap the side-by-side comments. Also it is easy to place comments in a place that could cause errors. Comments must have exactly the structure shown in the example below, with a blank line above and below the comment. The leading blank line can be omitted if the comment is at the beginning of a block, as is the case in the second comment in the example below. Each comment should be indented to the same level as the surrounding code. Use proper English in comments: write complete sentences, capitalize the first word of each sentence, and so on. # If we are running on the Macintosh platform then we can # assume that the sources are located in the resource fork # of our application, and we do not need to search for them. # Note that there is a blank line below it to separate it # more strongly from the code. if {$tcl_platform(platform) == "macintosh"} { return } foreach dir $dirList { # If the source succeds then we are done. # Note there is no blank line above the comment; # the indentation change is visible enough. if {![catch {source [file join $dir file.tcl]}]} { break } } ## Continuation lines are indented 8 spaces You should use continuation lines to make sure that no single line exceeds 80 characters in length. Continuation lines should be indented 8 spaces so that they won't be confused with an immediately-following nested block. Pick clean places to break your lines for continuation, so that the continuation doesn't obscure the structure of the statement. For example, if a procedure call requires continuation lines, try to avoid situations where a single argument spans multiple lines. If the test for an if or while command spans lines, try to make each line have the same nesting level of parentheses and/or brackets if possible. I try to start each continuation line with an operator such as \, &&, or \\|\\|; this makes it clear that the line is a continuation, since a new statement would never start with such an operator. ## Only one command per line You should only have one Tcl command per line on the page. Do not use the semi-colon character to place multiple commands on the same line. This makes the code easier to read and helps with debugging. ## Curly braces: \{ goes at the end of a line Open curly braces can not appear on lines by themselves in Tcl. Instead, they should be placed at the end of the preceding line. Close curly braces are indented to the same level as the outer code, i.e., four spaces less than the statements they enclose. However, you shouldalways use curly braces rather than some other list generating mechanism that will work in the Tcl language. This will help make code more readable, will avoid unwanted side effects, and in many cases will generate faster code with the Tcl compiler. Control structures should always use curly braces, even if there is only one statement in the block. Thus you shouldn't write code like if {$tcl_platform(platform) == "unix"} return but rather if {$tcl_platform(platform) == "unix"} { return } This approach makes code less dense, but it avoids potential mistakes like unwanted Tcl substitutions. It also makes it easier to set breakpoints in a debugger, since it guarantees that each statement is on a separate line and can be named individually. ## Parenthesize expressions Use parentheses around each subexpression in an expression to make it absolutely clear what is the evaluation order of the expression \(a reader of your code should not need to remember Tcl's precedence rules\). For example, don't type if {$x > 22 && $y <= 47} ... Instead, type this: if {($x > 22) && ($y <= 47)} ... ## Always use the return statement You should always explicitly use the return* statement to return values from a Tcl procedure. By default Tcl will return the value of the last Tcl statement executed in a Tcl procedure as the return value of the procedure which often leads to confusion as to where the result is coming from. In addition, you should use a return statement with no argument for procedures whose results are ignored. Supplying this return will actually speed up your application with the new Tcl compiler. For example, don't write code like this: proc foo {x y} { if {$x < 0} { incr x } else { expr $x + $y } } But rather, type this: proc foo {x y} { if {$x < 0} { return [incr x] } else { return [expr $x + $y] } } For Tcl procedures that have no return value a single return statement with no arguments is placed at the end of the procedure. ## Switch statements The switch statement should be formatted as below. Always use the -- option to avoid having the string be confused with an option. This can happen when the string is user generated. Comments can be added on the same line as the pattern to comment the pattern case. The comments for each case should line up on the same tab stop and must be within the braces. Note that this is an exception to the standard commenting conventions. switch -regexp -- $string { plus - add { # Do add task ... } subtract { # Do subtract case ... } default { ... } } ## If statements Never use the then word of an if statement. It is syntactic sugar that really isn't that useful. However, the else word should always be used as it does impart some semantic information and it is more like the C language. Here is an example: if {$x < 0} { ... } elseif {$x == 0} { ... } else { ... } # Documenting code The purpose of documentation is to save time and reduce errors. Documentation is typically used for two purposes. First, people will read the documentation to find out how to use your code. For example, they will read procedure headers to learn how to call the procedures. Ideally, people should have to learn as little as possible about your code in order to use it correctly. Second, people will read the documentation to find out how your code works internally, so they can fix bugs or add new features; again, good documentation will allow them to make their fixes or enhancements while learning the minimum possible about your code. More documentation isn't necessarily better: wading through pages of documentation may not be any easier than deciphering the code. Try to pick out the most important things that will help people to understand your code and focus on these in your documentation. ## Document things with wide impact The most important things to document are those that affect many different pieces of a program. Thus it is essential that every procedure interface, every structure declaration, and every global variable be documented clearly. If you haven't documented one of these things it will be necessary to look at all the uses of the thing to figure out how it's supposed to work; this will be time-consuming and error-prone. On the other hand, things with only local impact may not need much documentation. For example, in short procedures I don't usually have comments explaining the local variables. If the overall function of the procedure has been explained, and if there isn't much code in the procedure, and if the variables have meaningful names, then it will be easy to figure out how they are used. On the other hand, for long procedures with many variables I usually document the key variables. Similarly, when I write short procedures I don't usually have any comments in the procedure's code: the procedure header provides enough information to figure out what is going on. For long procedures I place a comment block before each major piece of the procedure to clarify the overall flow through the procedure. ## Don't just repeat what's in the code The most common mistake I see in documentation \(besides it not being there at all\) is that it repeats what is already obvious from the code, such as this trivial \(but exasperatingly common\) example: # Increment i. incr i Documentation should provide higher-level information about the overall function of the code, helping readers to understand what a complex collection of statements really means. For example, the comment # Probe into the array to see if the symbol exists. is likely to be much more helpful than # Loop through every array index, get the third value of the # list in the content to determine if it has the symbol we are # looking for. Set the result to the symbol if we find it. Everything in this second comment is probably obvious from the code that follows it. Another thing to consider in your comments is word choice. Use different words in the comments than the words that appear in variable or procedure names. For example, the comment # SwapPanels -- # # Swap the panels. # ... is not a very useful comment. Everything in the comment is already obvious from the procedure's name. Here is a much more useful comment: # SwapPanels -- # # Unmap the current UI panel from the parent frame and replace # it with the newly specified frame. Make sure that the new # panel fits into the old frame and resize if needed. # ... This comment tells _why_ you might want to use the procedure, in addition to _what_ it does, which makes the comment much more useful. ## Document each thing in exactly one place Systems evolve over time. If something is documented in several places, it will be hard to keep the documentation up to date as the system changes. Instead, try to document each major design decision in exactly one place, as near as possible to the code that implements the design decision. The principal documentation for each procedure goes in the procedure header. There's no need to repeat this information again in the body of the procedure \(but you might have additional comments in the procedure body to fill in details not described in the procedure header\). If a library procedure is documented thoroughly in a manual entry, then I may make the header for the procedure very terse, simply referring to the manual entry. The other side of this coin is that every major design decision needs to be documented _at least_ once. If a design decision is used in many places, it may be hard to pick a central place to document it. Try to find a data structure or key procedure where you can place the main body of comments; then reference this body in the other places where the decision is used. If all else fails, add a block of comments to the header page of one of the files implementing the decision. ## Write clean code The best way to produce a well-documented system is to write clean and simple code. This way there won't be much to document. If code is clean, it means that there are a few simple ideas that explain its operation; all you have to do is to document those key ideas. When writing code, ask yourself if there is a simple concept behind the code. If not, perhaps you should rethink the code. If it takes a lot of documentation to explain a piece of code, it is a sign that you haven't found a clean solution to the problem. ## Document as you go It is extremely important to write the documentation as you write the code. It's very tempting to put off the documentation until the end; after all, the code will change, so why waste time writing documentation now when you'll have to change it later? The problem is that the end never comes - there is always more code to write. Also, the more undocumented code that you accumulate, the harder it is to work up the energy to document it. So, you just write more undocumented code. I've seen many people start a project fully intending to go back at the end and write all the documentation, but I've never seen anyone actually do it. If you do the documentation as you go, it won't add much to your coding time and you won't have to worry about doing it later. Also, the best time to document code is when the key ideas are fresh in your mind, which is when you're first writing the code. When I write new code, I write all of the header comments for a group of procedures before I fill in any of the bodies of the procedures. This way I can think about the overall structure and how the pieces fit together before getting bogged down in the details of individual procedures. ## Document tricky situations If code is non-obvious, meaning that its structure and correctness depend on information that won't be obvious to someone reading it for the first time, be sure to document the non-obvious information. One good indicator of a tricky situation is a bug. If you discover a subtle property of your program while fixing a bug, be sure to add a comment explaining the problem and its solution. Of course, it's even better if you can fix the bug in a way that eliminates the subtle behavior, but this isn't always possible. # Testing One of the environments where Tcl works best is for testing. While Tcl has traditionally been used for testing C code it is equally as good at testing other Tcl code. Whenever you write new code you should write Tcl test scripts to go with that code and save the tests in files so that they can be re-run later. Writing test scripts isn't as tedious as it may sound. If you're developing your code carefully you're already doing a lot of testing; all you need to do is type your test cases into a script file where they can be reused, rather than typing them interactively where they vanish after they're run. ## Basics Tests should be organized into script files, where each file contains a collection of related tests. Individual tests should be based on the procedure test, just like in the Tcl and Tk test suites. Here are two examples: test expr-3.1 {floating-point operators} { expr 2.3.6 } 1.38 test expr-3.2 {floating-point operators} {unixOnly} { list [catch {expr 2.3/0} msg] $msg } {1 {divide by zero}} test* is a procedure defined in a script file named defs, which is sourced by each test file. test takes four or five arguments: a test identifier, a string describing the test, an optional argument describing the conditions under which this test should run, a test script, and the expected result of the script. test evaluates the script and checks to be sure that it produces the expected result. If not, it prints a message like the following: ==== expr-3.1 floating-point operators ==== Contents of test case: expr 2.3.6 ==== Result was: 1.39 ---- Result should have been: 1.38 ---- expr-3.1 FAILED To run a set of tests, you start up the application and source* a test file. If all goes well no messages appear; if errors are detected, a message is printed for each error. The test identifier, such as expr-3.1, is printed when errors occur. It can be used to search a test script to locate the source for a failed test. The first part of the identifier, such as expr, should be the same as the name of the test file, except that the test file should have a .test extension, such as expr.test. The two numbers allow you to divide your tests into groups. The tests in a particular group \(e.g., all the expr-3._n_ tests\) relate to a single sub-feature, such as a single procedure. The tests should appear in the test file in the same order as their numbers. The test name, such as floating-point operators, is printed when errors occur. It provides human-readable information about the general nature of the test. Before writing tests I suggest that you look over some of the test files for Tcl and Tk to see how they are structured. You may also want to look at the README files in the Tcl and Tk test directories to learn about additional features that provide more verbose output or restrict the set of tests that are run. ## Organizing tests Organize your tests to match the code being tested. The best way to do this is to have one test file for each source code file, with the name of the test file derived from the name of the source file in an obvious way \(e.g. http.test contains tests for the code in http.tcl\). Within the test file, have one group of tests for each procedure \(for example, all the http-3._n_ tests in http.test are for the procedure http::geturl\). The order of the tests within a group should be the same as the order of the code within the procedure. This approach makes it easy to find the tests for a particular piece of code and add new tests as the code changes. The Tcl test suite was written a long time ago and uses a different style where there is one file for each Tcl command or group of related commands, and the tests are grouped within the file by sub-command or features. In this approach the relationship between tests and particular pieces of code is much less obvious, so it is harder to maintain the tests as the code evolves. I don't recommend using this approach for new tests. ## Coverage When writing tests, you should attempt to exercise every line of source code at least once. There will be occasionally be code that you can't exercise, such as code that exits the application, but situations like this are rare. You may find it hard to exercise some pieces of code because existing Tcl commands don't provide fine enough control to generate all the possible execution paths. In situations like this, write one or more new Tcl commands just for testing purposes. It's much better to test a facility directly then to rely on some side effect for testing that may change over time. Use a similar approach in your own code, where you have an extra file with additional commands for testing. It's not sufficient just to make sure each line of code is executed by your tests. In addition, your tests must discriminate between code that executes correctly and code that isn't correct. For example, write tests to make sure that the then and else branches of each if statement are taken under the correct conditions. For a loop, run different tests to make the loop execute zero times, one time, and two or more times. If a piece of code removes an element from a list, try cases where the element to be removed is the first element, last element, only element, and neither first element nor last. Try to find all the places where different pieces of code interact in unusual ways, and exercise the different possible interactions. ## Fixing bugs Whenever you find a bug in your code it means that the test suite wasn't complete. As part of fixing the bug, you should add new tests that detect the presence of the bug. I recommend writing the tests after you've located the bug but _before_ you fix it. That way you can verify that the bug happens before you implement the fix and the bug doesn't happen afterwards, so you'll know you've really fixed something. Use bugs to refine your testing approach: think about what you might be able to do differently when you write tests in the future to keep bugs like this one from going undetected. ## Tricky features I also use tests as a way of illustrating the need for tricky code. If a piece of code has an unusual structure, and particularly if the code is hard to explain, I try to write additional tests that will fail if the code is implemented in the obvious manner instead of using the tricky approach. This way, if someone comes along later, doesn't understand the documentation for the code, decides the complex structure is unnecessary, and changes the code back to the simple \(but incorrect\) form, the test will fail and the person will be able to use the test to understand why the code needs to be the way it is. Illustrative tests are not a substitute for good documentation, but they provide a useful addition. ## Test independence Try to make tests independent of each other, so that each test can be understood in isolation. For example, one test shouldn't depend on commands executed in a previous test. This is important because the test suite allows tests to be run selectively: if the tests depend on each other, then false errors will be reported when someone runs a few of the tests without the others. For convenience, you may execute a few statements in the test file to set up a test configuration and then run several tests based on that configuration. If you do this, put the setup code outside the calls to thetest procedure so it will always run even if the individual tests aren't run. I suggest keeping a very simple structure consisting of setup followed by a group of tests. Don't perform some setup, run a few tests, modify the setup slightly, run a few more tests, modify the setup again, and so on. If you do this, it will be hard for people to figure out what the setup is at any given point and when they add tests later they are likely to break the setup. # Miscellaneous ## Porting issues Writing portable scripts in Tcl is actually quite easy as Tcl itself is quite portable. However, issues do arise that may require writing platform specific code. To conditionalize your code in this manner you should use the tcl\_platform array to determine platform specific differences. You should avoid the use of theenv variable unless you have already determined the platform you are running on via the tcl\_platform array. As Tcl/Tk has become more cross platform we have added commands that aid in making your code more portable. The most common porting mistakes result from assumptions about file names and locations. To avoid such mistakes always use the file join command and list commands so that you will handle different file separation characters or spaces in file names. In Tk, you should always use provided high level dialog boxes instead or creating your own. The font and menu commands has also be revamped to make writing cross-platform code easier. ## Changes files Each package should contain a file namedchanges that keeps a log of all significant changes made to the package. The changes file provides a way for users to find out what's new in each new release, what bugs have been fixed, and what compatibility problems might be intro- duced by the new release. The changes file should be in chronological order. Just add short blurbs to it each time you make a change. Here is a sample from the Tk changes file: 5/19/94 (bug fix) Canvases didn't generate proper Postscript for stippled text. (RJ) 5/20/94 (new feature) Added "bell" command to ring the display's bell. (JO) 5/26/94 (feature removed) Removed support for "fill" justify mode from Tk_GetJustify and from the TK_CONFIG_JUSTIFY configuration option. None of the built-in widgets ever supported this mode anyway. (SS) * POTENTIAL INCOMPATIBILITY * The entries in the changes file can be relatively terse; once someone finds a change that is relevant, they can always go to the manual entries or code to find out more about it. Be sure to highlight changes that cause compatibility problems, so people can scan the changes file quickly to locate the incompatibilities. Also be sure to add your initials to the entry so that people scanning the log will know who made a particular change. _\(The Tcl and Tk core additionally uses a ChangeLog file that has a much higher detail within it. This has the advantage of having more tooling support, but tends to be so verbose that the shorter summaries in the changes file are still written up by the core maintainers before each release.\)_ # Copyright The original version of this document is copyright \(C\) 1997 Sun Microsystems, Inc. Revisions to reflect current community best-practice are public domain.

~~1 2 3 4 5 6 7 8 9 10~~ 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39	~~TIP: 18~~ T~~itle~~: Add Labels to Frames ~~Version: $Revision: 2.3 $~~ Author: Peter Spjuth <[email protected]> State: Final Type: Project Vote: Done Created: 12-Dec-2000 Post-History: Tcl-Version: 8.4 ~~~ Abstract~~ This TIP proposes to add a labelled frame widget to Tk. ~~~ Introduction~~ Labelled frames are a common thing in a GUI and the need for them are rather clear by the fact that practically every widget package implements some version of it. This proposal wants to add simple labelled frames to Tk. Even though a labelled frame can be built by three frames and label, this requires some skill and a bit work. I believe such a basic thing should be easier and this change would make creating a labelled frame as simple as it deserves to be. Below is an example of what I mean with a labelled frame. ~~~~#image:18labframe~~ Example of labelled frame~~ ~~~ Specification~~ A new widget class, labelframe, is added. It works like a frame, with the following changes. These options are added: -text: Standard option. Default value "".	< \| < \| \| \| \| \| \| \| > \| \| \| \|	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38	# TIP 18: Add Labels to Frames Author: Peter Spjuth <[email protected]> State: Final Type: Project Vote: Done Created: 12-Dec-2000 Post-History: Tcl-Version: 8.4 ----- # Abstract This TIP proposes to add a labelled frame widget to Tk. # Introduction Labelled frames are a common thing in a GUI and the need for them are rather clear by the fact that practically every widget package implements some version of it. This proposal wants to add simple labelled frames to Tk. Even though a labelled frame can be built by three frames and label, this requires some skill and a bit work. I believe such a basic thing should be easier and this change would make creating a labelled frame as simple as it deserves to be. Below is an example of what I mean with a labelled frame. ![Example of labelled frame](../assets/18labframe.png) # Specification A new widget class, labelframe, is added. It works like a frame, with the following changes. These options are added: -text: Standard option. Default value "".
︙			︙
60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 ~~97 98 99 100~~ 101 ~~102 103 104 105~~ 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 ~~122~~ 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 ~~140~~ 141 142 143 144 145 146 147 148 149 150 151 152 153 ~~154~~ 155 156 157 158 159 160 161 162 163 164 165 166 167 168 ~~169 170~~ 171 172 173 174 175 176 177 178 179 180 181 182 183 184 ~~185~~ 186 187 188 189 190 191 192	-borderwidth, new default value 2. -relief, new default value groove. -padx and -pady are useful in frames and toplevels too, and since it is easy and cheap to add them at the same time, this TIP proposes to add them there too. ~~~ Rationale~~ My main approach has been to make a simple but still general solution. The most typical usage should be really easy, more advanced usage possible, and more features should be possible to add later if needed. Trying to mimic all the abilities of a label widget is rather futile. It leads to code duplication and future updates to the label widget would need to be copied too to keep up. Since the most common label is a simple text, the labelframe only mimics options -text, -font and -fg to be able to handle that case in a simple manner. If you want a more advanced label, e.g. with an image or with a checkbutton, you can get it with -labelwidget. For placement of the label I chose a style I found in IWidget's "labeledframe" widget. It's the most general solution I can see since it allows access to all twelve obvious positions in an easy way. Options -padx and -pady does not have anything directly to do with labels, but are a generally nice addition to frames that I have missed a lot in the past. Such padding is not possible without ~~part of the changes to geometry management (see Implementing section)~~ that are required for displaying the label. The thing about raising the -labelwidget in the stacking order comes from this: With the most simple implementation, using -labelwidget could be done in two ways: ~~\|# Way #1 \|labelframe .f \|label .f.l -text Mupp \|.f configure -labelwidget .f.l~~ ~~\|# Way #2 \|label .l -text Mupp \|labelframe .f -labelwidget .l \|raise .l .f~~ In the first you want the label to be a child but since it has to exist, the -labelwidget can't be used on the labelframe creation line. In the second you try to circumvent it by creating the label first, but then you have to raise it above the labelframe to be visible. Even though it's just one extra line of code I find it a bit awkward when it's so easy to do something about. The first can be fixed by not trying to do anything with the label widget until idle time when it has had a chance to be created. This is not a good solution though since it leads to some rather awkward things in implementation. The second can be fixed by automatically raising the label in the stacking order when used as -labelwidget. If this is documented clearly I don't have a problem with it, and that is why I chose it. ~~~ Alternatives to this TIP~~ An alternative way to implement a labelled frame is using mega widget style with a subframe where children are placed. This is how current widget packages do it. I think that is an awkward and unnatural way to handle such a simple thing as a labelled frame. The only reason to do so is that current limitations in geometry management prevents a simpler solution. I believe that a labelled frame should work like a normal frame. That it displays a label should not matter more than displaying a border or a blue background. A labelled frame megawidget would be different from a frame, the most noticeable difference being that you can't pack/grid things directly into the labelled frame, instead you have to go via a subframe. Having the labelled frame work like a normal frame is more consistent and easier for the programmer at Tcl level. ~~~ Implementing~~ Implementing this is mostly rather straightforward. The labelframe will share most code with the frame, just like toplevel and frame share code today, and like the spinbox was built on the entry. The tricky part is that limitations in geometry management does not leave room for displaying a label. The changes needed in geometry management are simple but introduces a slight backward incompatibility. The problem is this. Today a widget can set an internal border width. This defines a uniform width area around the edge of the widget that geometry managers should stay away from. This is not enough though, since to display a label the frame needs to get more space on one side where it will put the label. Also, there is no way for a widget ~~to affect its own size (anything it says is overridden by pack/grid), so~~ the labelframe cannot make sure that enough size is requested to make room for the label. By adding some more fields to the TkWindow structure, the information needed can be transferred to the geometry manager. First, the present internalBorderWidth field is split into four fields, one for each side. Second, minimum requested width/height fields are added. This requires one macro per field for reading them and two new APIs to set the fields: ~~\|void Tk_SetInternalBorderWidthEx(tkwin, left, right, top, bottom) \|void Tk_SetMinimumRequestedSize(tkwin, minWidth, minHeight)~~ Geometry managers would need to be updated to take the new fields into consideration, and here is where backwards compatibility comes in. Any extension implementing a geometry manager would need to be updated in the same way as grid/pack/place will be. The change is trivial, and even if not done most things will work anyway. An updated Tk plus an old extension plus an old script will still work and thus no one needs to worry about upgrading. I consider this a minor thing since it wont break any existing applications. The only thing that will break is if someone would try to use a geometry manager that is not updated within a labelframe. And even in that case you can work around it with an extra frame. ~~~ Rejected alternatives~~ The ability to display a label could have been given to the normal frame by adding the options above to it. Having a new widget class has the following advantages: The separate widget class can have its own default values, and the user can control it separately from the frame in the option database.	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191	-borderwidth, new default value 2. -relief, new default value groove. -padx and -pady are useful in frames and toplevels too, and since it is easy and cheap to add them at the same time, this TIP proposes to add them there too. # Rationale My main approach has been to make a simple but still general solution. The most typical usage should be really easy, more advanced usage possible, and more features should be possible to add later if needed. Trying to mimic all the abilities of a label widget is rather futile. It leads to code duplication and future updates to the label widget would need to be copied too to keep up. Since the most common label is a simple text, the labelframe only mimics options -text, -font and -fg to be able to handle that case in a simple manner. If you want a more advanced label, e.g. with an image or with a checkbutton, you can get it with -labelwidget. For placement of the label I chose a style I found in IWidget's "labeledframe" widget. It's the most general solution I can see since it allows access to all twelve obvious positions in an easy way. Options -padx and -pady does not have anything directly to do with labels, but are a generally nice addition to frames that I have missed a lot in the past. Such padding is not possible without part of the changes to geometry management \(see Implementing section\) that are required for displaying the label. The thing about raising the -labelwidget in the stacking order comes from this: With the most simple implementation, using -labelwidget could be done in two ways: # Way #1 labelframe .f label .f.l -text Mupp .f configure -labelwidget .f.l # Way #2 label .l -text Mupp labelframe .f -labelwidget .l raise .l .f In the first you want the label to be a child but since it has to exist, the -labelwidget can't be used on the labelframe creation line. In the second you try to circumvent it by creating the label first, but then you have to raise it above the labelframe to be visible. Even though it's just one extra line of code I find it a bit awkward when it's so easy to do something about. The first can be fixed by not trying to do anything with the label widget until idle time when it has had a chance to be created. This is not a good solution though since it leads to some rather awkward things in implementation. The second can be fixed by automatically raising the label in the stacking order when used as -labelwidget. If this is documented clearly I don't have a problem with it, and that is why I chose it. # Alternatives to this TIP An alternative way to implement a labelled frame is using mega widget style with a subframe where children are placed. This is how current widget packages do it. I think that is an awkward and unnatural way to handle such a simple thing as a labelled frame. The only reason to do so is that current limitations in geometry management prevents a simpler solution. I believe that a labelled frame should work like a normal frame. That it displays a label should not matter more than displaying a border or a blue background. A labelled frame megawidget would be different from a frame, the most noticeable difference being that you can't pack/grid things directly into the labelled frame, instead you have to go via a subframe. Having the labelled frame work like a normal frame is more consistent and easier for the programmer at Tcl level. # Implementing Implementing this is mostly rather straightforward. The labelframe will share most code with the frame, just like toplevel and frame share code today, and like the spinbox was built on the entry. The tricky part is that limitations in geometry management does not leave room for displaying a label. The changes needed in geometry management are simple but introduces a slight backward incompatibility. The problem is this. Today a widget can set an internal border width. This defines a uniform width area around the edge of the widget that geometry managers should stay away from. This is not enough though, since to display a label the frame needs to get more space on one side where it will put the label. Also, there is no way for a widget to affect its own size \(anything it says is overridden by pack/grid\), so the labelframe cannot make sure that enough size is requested to make room for the label. By adding some more fields to the TkWindow structure, the information needed can be transferred to the geometry manager. First, the present internalBorderWidth field is split into four fields, one for each side. Second, minimum requested width/height fields are added. This requires one macro per field for reading them and two new APIs to set the fields: void Tk_SetInternalBorderWidthEx(tkwin, left, right, top, bottom) void Tk_SetMinimumRequestedSize(tkwin, minWidth, minHeight) Geometry managers would need to be updated to take the new fields into consideration, and here is where backwards compatibility comes in. Any extension implementing a geometry manager would need to be updated in the same way as grid/pack/place will be. The change is trivial, and even if not done most things will work anyway. An updated Tk plus an old extension plus an old script will still work and thus no one needs to worry about upgrading. I consider this a minor thing since it wont break any existing applications. The only thing that will break is if someone would try to use a geometry manager that is not updated within a labelframe. And even in that case you can work around it with an extra frame. # Rejected alternatives The ability to display a label could have been given to the normal frame by adding the options above to it. Having a new widget class has the following advantages: The separate widget class can have its own default values, and the user can control it separately from the frame in the option database.
︙			︙
206 207 208 209 210 211 212 ~~213~~ 214 215 216 217 218 ~~219~~ 220 221 222 223 224 ~~225~~ 226 227	1.1 of this TIP has also been discarded because it was too complex. It would be possible to do without the minimum requested size fields if you give the responsibility to make sure the label has room to the GUI programmer. This could be rather awkward though, e.g. when making an internationalized application where labels can vary a lot. ~~~ Reference Implementation~~ An almost finished implementation exists, and it's just a matter of polishing the last bits to create a patch for this proposal if it is accepted. ~~At http://www.dtek.chalmers.se/~d1peter/labframe.tcl you can find a~~ pure Tcl demo of labelled frames. Even though it uses sub-frames and thus do not live up to what I want to accomplish here it implements all new options as specified here and can be played with if you want to know more. ~~~ Copyright~~ This document has been placed in the public domain.	\| \| \| >	205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227	1.1 of this TIP has also been discarded because it was too complex. It would be possible to do without the minimum requested size fields if you give the responsibility to make sure the label has room to the GUI programmer. This could be rather awkward though, e.g. when making an internationalized application where labels can vary a lot. # Reference Implementation An almost finished implementation exists, and it's just a matter of polishing the last bits to create a patch for this proposal if it is accepted. At <http://www.dtek.chalmers.se/~d1peter/labframe.tcl> you can find a pure Tcl demo of labelled frames. Even though it uses sub-frames and thus do not live up to what I want to accomplish here it implements all new options as specified here and can be played with if you want to know more. # Copyright This document has been placed in the public domain.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 ~~37 38 39~~ 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55	itle: ~~Version: Author: State: Final Type: Project Vote: Done Created: 04-Jul-2001 Post-History: Keywords: widget,tk,panedwindow Tcl-Version: 8.4a2~~ ~~~ Abstract~~ This TIP proposes Tk core. A paned horizontal "panes", containing one widget, modern graphical directly by the Tk core. Windows Explorer; virtually every ~ Tk has long lagged of widgets provided by the toolkit. useful, and relevant, with widgets which interfaces. One that makes it easy with Tk. This paned window several Tcl-based each have quirks, the geometry calls to things ~~'&#~~ creation of very far from reality right now. creating new widgets, implementations corresponds to, each pane after the first. ~~panes, this~~ window container ~~sash; one frame~~ a proper megawidget bloat issue with a megawidget. A C-based paned these issues, and C implementation functions. Also,	< \| < \| \| \| \| \| \| \| \| > \| \| \| \| \| \| \|	# T Author: State: Final Type: Project Vote: Done Created: 04-Jul-2001 Post-History: Keywords: Tcl-Version: 8.4a2 ----- # Abstract This TIP proposes Tk core. A paned horizontal "panes", containing one widget, modern graphical directly by the Tk core. Windows Explorer; virtually every # Tk has long lagged of widgets provided by the toolkit. useful, and relevant, with widgets which interfaces. One that makes it easy with Tk. This paned window several Tcl-based each have quirks, the geometry calls to things _Tk\creation of very far from reality right now. creating new widgets, implementations corresponds to, each pane after the first. panes, this window container sash; one frame a proper megawidget bloat issue with a megawidget. A C-based paned these issues, and C implementation functions. Also,
83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 ~~391~~ 392 393 394 ~~395 396~~ 397 ~~398~~ 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 ~~416~~ 417 418	#i ~ The manual entry \|NAME \| \| \|SYNOPSIS \| \| \|STANDARD \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|DESCRIPTION \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|WIDGET COMMAND \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| ...? \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|RESIZING PANES \| \| \| \| \| \| \| \| \| \| ~ The widget described documentation and a full test suite. ~~Vu widget extension, http://tktable.sourceforge.net~~ ~~~ Notes~~ Suggestions for * Allow specification grid, to be used widget. * Allow a bindable Netscape's Messenger, collapse of the pane. * Integrate with a -setgrided widget, None of these are implemented at a ~ This document has	\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| >	![Example # The manual entry NAME SYNOPSIS STANDARD DESCRIPTION WIDGET COMMAND ...? RESIZING PANES # The widget described documentation and a full test suite. Vu widget extension, <http://tkt # Notes Suggestions for * Allow specification grid, to be used widget. * Allow a bindable Netscape's Messenger, collapse of the pane. * Integrate with a -setgrided widget, None of these are implemented at a # This document has