bzr branch
http://gegoxaren.bato24.eu/bzr/brz/remove-bazaar
| 
0.1.1
by Martin Pool
 Check in old existing knit code.  | 
1  | 
#! /usr/bin/python
 | 
2  | 
||
3  | 
# Copyright (C) 2005 Canonical Ltd
 | 
|
4  | 
||
| 
0.1.33
by Martin Pool
 add gpl text  | 
5  | 
# This program is free software; you can redistribute it and/or modify
 | 
6  | 
# it under the terms of the GNU General Public License as published by
 | 
|
7  | 
# the Free Software Foundation; either version 2 of the License, or
 | 
|
8  | 
# (at your option) any later version.
 | 
|
9  | 
||
10  | 
# This program is distributed in the hope that it will be useful,
 | 
|
11  | 
# but WITHOUT ANY WARRANTY; without even the implied warranty of
 | 
|
12  | 
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
 | 
|
13  | 
# GNU General Public License for more details.
 | 
|
14  | 
||
15  | 
# You should have received a copy of the GNU General Public License
 | 
|
16  | 
# along with this program; if not, write to the Free Software
 | 
|
17  | 
# Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA  02111-1307  USA
 | 
|
| 
0.1.1
by Martin Pool
 Check in old existing knit code.  | 
18  | 
|
19  | 
# Author: Martin Pool <mbp@canonical.com>
 | 
|
20  | 
||
21  | 
||
| 
0.1.38
by Martin Pool
 Rename knit to weave. (I don't think there's an existing module called weave.)  | 
22  | 
"""Weave - storage of related text file versions"""
 | 
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
23  | 
|
| 
928
by Martin Pool
 - go back to using plain builtin set()  | 
24  | 
# before intset (r923) 2000 versions in 41.5s
 | 
25  | 
# with intset (r926) 2000 versions in 93s !!!
 | 
|
26  | 
# better to just use plain sets.
 | 
|
27  | 
||
| 
931
by Martin Pool
 - experiment with making Weave._extract() return a list, not a generator - slightly faster  | 
28  | 
# making _extract build and return a list, rather than being a generator
 | 
29  | 
# takes 37.94s
 | 
|
30  | 
||
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
31  | 
# with python -O, r923 does 2000 versions in 36.87s
 | 
32  | 
||
33  | 
# with optimizations to avoid mutating lists - 35.75!  I guess copying
 | 
|
34  | 
# all the elements every time costs more than the small manipulations.
 | 
|
35  | 
# a surprisingly small change.
 | 
|
36  | 
||
37  | 
# r931, which avoids using a generator for extract, does 36.98s
 | 
|
38  | 
||
39  | 
# with memoized inclusions, takes 41.49s; not very good
 | 
|
40  | 
||
41  | 
# with slots, takes 37.35s; without takes 39.16, a bit surprising
 | 
|
42  | 
||
43  | 
# with the delta calculation mixed in with the add method, rather than
 | 
|
44  | 
# separated, takes 36.78s
 | 
|
45  | 
||
46  | 
# with delta folded in and mutation of the list, 36.13s
 | 
|
47  | 
||
| 
1079
by Martin Pool
 - weavefile can just use lists for read-in ancestry, not frozensets  | 
48  | 
# with all this and simplification of add code, 33s
 | 
49  | 
||
50  | 
||
51  | 
||
52  | 
||
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
53  | 
|
| 
0.1.61
by Martin Pool
 doc  | 
54  | 
# TODO: Perhaps have copy method for Weave instances?
 | 
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
55  | 
|
| 
0.1.58
by Martin Pool
 doc  | 
56  | 
# XXX: If we do weaves this way, will a merge still behave the same
 | 
57  | 
# way if it's done in a different order?  That's a pretty desirable
 | 
|
58  | 
# property.
 | 
|
59  | 
||
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
60  | 
# TODO: Nothing here so far assumes the lines are really \n newlines,
 | 
61  | 
# rather than being split up in some other way.  We could accomodate
 | 
|
62  | 
# binaries, perhaps by naively splitting on \n or perhaps using
 | 
|
63  | 
# something like a rolling checksum.
 | 
|
64  | 
||
| 
0.1.85
by Martin Pool
 doc  | 
65  | 
# TODO: End marker for each version so we can stop reading?
 | 
| 
0.1.69
by Martin Pool
 Simple text-based format for storing weaves, cleaner than  | 
66  | 
|
67  | 
# TODO: Check that no insertion occurs inside a deletion that was
 | 
|
68  | 
# active in the version of the insertion.
 | 
|
69  | 
||
| 
912
by Martin Pool
 - update todos for weave  | 
70  | 
# TODO: In addition to the SHA-1 check, perhaps have some code that
 | 
71  | 
# checks structural constraints of the weave: ie that insertions are
 | 
|
72  | 
# properly nested, that there is no text outside of an insertion, that
 | 
|
73  | 
# insertions or deletions are not repeated, etc.
 | 
|
| 
0.1.85
by Martin Pool
 doc  | 
74  | 
|
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
75  | 
# TODO: Parallel-extract that passes back each line along with a
 | 
76  | 
# description of which revisions include it.  Nice for checking all
 | 
|
77  | 
# shas in parallel.
 | 
|
78  | 
||
| 
1082
by Martin Pool
 - lift imports  | 
79  | 
# TODO: Using a single _extract routine and then processing the output
 | 
80  | 
# is probably inefficient.  It's simple enough that we can afford to
 | 
|
81  | 
# have slight specializations for different ways its used: annotate,
 | 
|
82  | 
# basis for add, get, etc.
 | 
|
83  | 
||
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
84  | 
# TODO: Perhaps the API should work only in names to hide the integer
 | 
85  | 
# indexes from the user?
 | 
|
86  | 
||
87  | 
||
| 
1082
by Martin Pool
 - lift imports  | 
88  | 
|
89  | 
import sha  | 
|
| 
1237
by Martin Pool
 - allow the same version to be repeatedly added to a weave  | 
90  | 
|
| 
1196
by Martin Pool
 - [WIP] retrieve historical texts from weaves  | 
91  | 
from cStringIO import StringIO  | 
| 
0.1.85
by Martin Pool
 doc  | 
92  | 
|
| 
1237
by Martin Pool
 - allow the same version to be repeatedly added to a weave  | 
93  | 
from bzrlib.osutils import sha_strings  | 
94  | 
||
| 
924
by Martin Pool
 - Add IntSet class  | 
95  | 
|
| 
0.1.47
by Martin Pool
 New WeaveError and WeaveFormatError rather than assertions.  | 
96  | 
class WeaveError(Exception):  | 
97  | 
"""Exception in processing weave"""  | 
|
98  | 
||
99  | 
||
100  | 
class WeaveFormatError(WeaveError):  | 
|
101  | 
"""Weave invariant violated"""  | 
|
102  | 
||
103  | 
||
| 
0.1.38
by Martin Pool
 Rename knit to weave. (I don't think there's an existing module called weave.)  | 
104  | 
class Weave(object):  | 
105  | 
"""weave - versioned text file storage.  | 
|
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
106  | 
    
 | 
| 
0.1.72
by Martin Pool
 Go back to weave lines normally having newlines at the end.  | 
107  | 
    A Weave manages versions of line-based text files, keeping track
 | 
108  | 
    of the originating version for each line.
 | 
|
109  | 
||
110  | 
    To clients the "lines" of the file are represented as a list of strings.
 | 
|
111  | 
    These strings  will typically have terminal newline characters, but
 | 
|
112  | 
    this is not required.  In particular files commonly do not have a newline
 | 
|
113  | 
    at the end of the file.
 | 
|
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
114  | 
|
| 
0.1.4
by Martin Pool
 Start indexing knits by both integer and version string.  | 
115  | 
    Texts can be identified in either of two ways:
 | 
116  | 
||
117  | 
    * a nonnegative index number.
 | 
|
118  | 
||
| 
1075
by Martin Pool
 - don't store redundant version number at end of insert blocks  | 
119  | 
    * a version-id string. (not implemented yet)
 | 
| 
0.1.4
by Martin Pool
 Start indexing knits by both integer and version string.  | 
120  | 
|
| 
0.1.38
by Martin Pool
 Rename knit to weave. (I don't think there's an existing module called weave.)  | 
121  | 
    Typically the index number will be valid only inside this weave and
 | 
| 
0.1.4
by Martin Pool
 Start indexing knits by both integer and version string.  | 
122  | 
    the version-id is used to reference it in the larger world.
 | 
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
123  | 
|
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
124  | 
    The weave is represented as a list mixing edit instructions and
 | 
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
125  | 
    literal text.  Each entry in _weave can be either a string (or
 | 
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
126  | 
    unicode), or a tuple.  If a string, it means that the given line
 | 
127  | 
    should be output in the currently active revisions.
 | 
|
128  | 
||
129  | 
    If a tuple, it gives a processing instruction saying in which
 | 
|
130  | 
    revisions the enclosed lines are active.  The tuple has the form
 | 
|
131  | 
    (instruction, version).
 | 
|
132  | 
||
133  | 
    The instruction can be '{' or '}' for an insertion block, and '['
 | 
|
134  | 
    and ']' for a deletion block respectively.  The version is the
 | 
|
| 
0.1.45
by Martin Pool
 doc  | 
135  | 
    integer version index.  There is no replace operator, only deletes
 | 
| 
1075
by Martin Pool
 - don't store redundant version number at end of insert blocks  | 
136  | 
    and inserts.  For '}', the end of an insertion, there is no
 | 
137  | 
    version parameter because it always closes the most recently
 | 
|
138  | 
    opened insertion.
 | 
|
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
139  | 
|
| 
0.1.41
by Martin Pool
 Doc  | 
140  | 
    Constraints/notes:
 | 
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
141  | 
|
142  | 
    * A later version can delete lines that were introduced by any
 | 
|
143  | 
      number of ancestor versions; this implies that deletion
 | 
|
144  | 
      instructions can span insertion blocks without regard to the
 | 
|
145  | 
      insertion block's nesting.
 | 
|
146  | 
||
| 
0.1.41
by Martin Pool
 Doc  | 
147  | 
    * Similarly, deletions need not be properly nested with regard to
 | 
148  | 
      each other, because they might have been generated by
 | 
|
149  | 
      independent revisions.
 | 
|
150  | 
||
| 
0.1.45
by Martin Pool
 doc  | 
151  | 
    * Insertions are always made by inserting a new bracketed block
 | 
152  | 
      into a single point in the previous weave.  This implies they
 | 
|
153  | 
      can nest but not overlap, and the nesting must always have later
 | 
|
154  | 
      insertions on the inside.
 | 
|
155  | 
||
| 
0.1.41
by Martin Pool
 Doc  | 
156  | 
    * It doesn't seem very useful to have an active insertion
 | 
157  | 
      inside an inactive insertion, but it might happen.
 | 
|
| 
0.1.45
by Martin Pool
 doc  | 
158  | 
      
 | 
| 
0.1.41
by Martin Pool
 Doc  | 
159  | 
    * Therefore, all instructions are always"considered"; that
 | 
160  | 
      is passed onto and off the stack.  An outer inactive block
 | 
|
161  | 
      doesn't disable an inner block.
 | 
|
162  | 
||
163  | 
    * Lines are enabled if the most recent enclosing insertion is
 | 
|
164  | 
      active and none of the enclosing deletions are active.
 | 
|
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
165  | 
|
| 
0.1.49
by Martin Pool
 Add another constraint: revisions should not delete text that they  | 
166  | 
    * There is no point having a deletion directly inside its own
 | 
167  | 
      insertion; you might as well just not write it.  And there
 | 
|
168  | 
      should be no way to get an earlier version deleting a later
 | 
|
169  | 
      version.
 | 
|
170  | 
||
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
171  | 
    _weave
 | 
172  | 
        Text of the weave; list of control instruction tuples and strings.
 | 
|
| 
0.1.4
by Martin Pool
 Start indexing knits by both integer and version string.  | 
173  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
174  | 
    _parents
 | 
| 
892
by Martin Pool
 - weave stores only direct parents, and calculates and memoizes expansion as needed  | 
175  | 
        List of parents, indexed by version number.
 | 
176  | 
        It is only necessary to store the minimal set of parents for
 | 
|
177  | 
        each version; the parent's parents are implied.
 | 
|
| 
0.1.13
by Martin Pool
 Knit structure now allows for versions to include the lines present in other  | 
178  | 
|
| 
0.1.89
by Martin Pool
 Store SHA1 in weave file for later verification  | 
179  | 
    _sha1s
 | 
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
180  | 
        List of hex SHA-1 of each version.
 | 
181  | 
||
182  | 
    _names
 | 
|
183  | 
        List of symbolic names for each version.  Each should be unique.
 | 
|
184  | 
||
185  | 
    _name_map
 | 
|
186  | 
        For each name, the version number.
 | 
|
| 
1209
by Martin Pool
 - Add Weave._weave_name for debugging purposes  | 
187  | 
|
188  | 
    _weave_name
 | 
|
189  | 
        Descriptive name of this weave; typically the filename if known.
 | 
|
190  | 
        Set by read_weave.
 | 
|
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
191  | 
    """
 | 
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
192  | 
|
| 
1209
by Martin Pool
 - Add Weave._weave_name for debugging purposes  | 
193  | 
__slots__ = ['_weave', '_parents', '_sha1s', '_names', '_name_map',  | 
194  | 
'_weave_name']  | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
195  | 
|
| 
1209
by Martin Pool
 - Add Weave._weave_name for debugging purposes  | 
196  | 
def __init__(self, weave_name=None):  | 
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
197  | 
self._weave = []  | 
198  | 
self._parents = []  | 
|
| 
0.1.89
by Martin Pool
 Store SHA1 in weave file for later verification  | 
199  | 
self._sha1s = []  | 
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
200  | 
self._names = []  | 
201  | 
self._name_map = {}  | 
|
| 
1209
by Martin Pool
 - Add Weave._weave_name for debugging purposes  | 
202  | 
self._weave_name = weave_name  | 
| 
0.1.60
by Martin Pool
 Weave eq and ne methods  | 
203  | 
|
204  | 
||
205  | 
def __eq__(self, other):  | 
|
206  | 
if not isinstance(other, Weave):  | 
|
207  | 
return False  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
208  | 
return self._parents == other._parents \  | 
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
209  | 
and self._weave == other._weave \  | 
210  | 
and self._sha1s == other._sha1s  | 
|
211  | 
||
| 
0.1.60
by Martin Pool
 Weave eq and ne methods  | 
212  | 
|
213  | 
def __ne__(self, other):  | 
|
214  | 
return not self.__eq__(other)  | 
|
215  | 
||
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
216  | 
|
| 
1269
by Martin Pool
 - some weave operations automatically look up symbolic names if supplied  | 
217  | 
def maybe_lookup(self, name_or_index):  | 
218  | 
"""Convert possible symbolic name to index, or pass through indexes."""  | 
|
219  | 
if isinstance(name_or_index, (int, long)):  | 
|
220  | 
return name_or_index  | 
|
221  | 
else:  | 
|
222  | 
return self.lookup(name_or_index)  | 
|
223  | 
||
224  | 
||
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
225  | 
def lookup(self, name):  | 
| 
1269
by Martin Pool
 - some weave operations automatically look up symbolic names if supplied  | 
226  | 
"""Convert symbolic version name to index."""  | 
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
227  | 
try:  | 
228  | 
return self._name_map[name]  | 
|
229  | 
except KeyError:  | 
|
| 
1260
by Martin Pool
 - some updates for fetch/update function  | 
230  | 
raise WeaveError("name %r not present in weave %r" %  | 
| 
1209
by Martin Pool
 - Add Weave._weave_name for debugging purposes  | 
231  | 
(name, self._weave_name))  | 
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
232  | 
|
| 
1229
by Martin Pool
 - new Weave.idx_to_name and .parents methods  | 
233  | 
|
234  | 
def idx_to_name(self, version):  | 
|
235  | 
return self._names[version]  | 
|
236  | 
||
| 
1237
by Martin Pool
 - allow the same version to be repeatedly added to a weave  | 
237  | 
|
238  | 
def _check_repeated_add(self, name, parents, text):  | 
|
239  | 
"""Check that a duplicated add is OK.  | 
|
240  | 
||
241  | 
        If it is, return the (old) index; otherwise raise an exception.
 | 
|
242  | 
        """
 | 
|
243  | 
idx = self.lookup(name)  | 
|
244  | 
if sorted(self._parents[idx]) != sorted(parents):  | 
|
245  | 
raise WeaveError("name \"%s\" already present in weave "  | 
|
246  | 
"with different parents" % name)  | 
|
247  | 
new_sha1 = sha_strings(text)  | 
|
248  | 
if new_sha1 != self._sha1s[idx]:  | 
|
249  | 
raise WeaveError("name \"%s\" already present in weave "  | 
|
250  | 
"with different text" % name)  | 
|
251  | 
return idx  | 
|
252  | 
||
253  | 
||
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
254  | 
|
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
255  | 
def add(self, name, parents, text):  | 
| 
0.1.4
by Martin Pool
 Start indexing knits by both integer and version string.  | 
256  | 
"""Add a single text on top of the weave.  | 
| 
0.1.36
by Martin Pool
 doc  | 
257  | 
  
 | 
| 
0.1.26
by Martin Pool
 Refactor parameters to add command  | 
258  | 
        Returns the index number of the newly added version.
 | 
259  | 
||
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
260  | 
        name
 | 
261  | 
            Symbolic name for this version.
 | 
|
262  | 
            (Typically the revision-id of the revision that added it.)
 | 
|
263  | 
||
| 
0.1.26
by Martin Pool
 Refactor parameters to add command  | 
264  | 
        parents
 | 
| 
892
by Martin Pool
 - weave stores only direct parents, and calculates and memoizes expansion as needed  | 
265  | 
            List or set of direct parent version numbers.
 | 
266  | 
            
 | 
|
| 
0.1.26
by Martin Pool
 Refactor parameters to add command  | 
267  | 
        text
 | 
268  | 
            Sequence of lines to be added in the new version."""
 | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
269  | 
|
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
270  | 
assert isinstance(name, basestring)  | 
271  | 
if name in self._name_map:  | 
|
| 
1237
by Martin Pool
 - allow the same version to be repeatedly added to a weave  | 
272  | 
return self._check_repeated_add(name, parents, text)  | 
| 
1269
by Martin Pool
 - some weave operations automatically look up symbolic names if supplied  | 
273  | 
|
274  | 
parents = map(self.maybe_lookup, parents)  | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
275  | 
self._check_versions(parents)  | 
| 
0.1.82
by Martin Pool
 Small weave optimizations  | 
276  | 
        ## self._check_lines(text)
 | 
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
277  | 
new_version = len(self._parents)  | 
| 
0.1.5
by Martin Pool
 Add test for storing two text versions.  | 
278  | 
|
| 
1237
by Martin Pool
 - allow the same version to be repeatedly added to a weave  | 
279  | 
sha1 = sha_strings(text)  | 
| 
0.1.89
by Martin Pool
 Store SHA1 in weave file for later verification  | 
280  | 
|
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
281  | 
        # if we abort after here the (in-memory) weave will be corrupt because only
 | 
282  | 
        # some fields are updated
 | 
|
283  | 
self._parents.append(parents[:])  | 
|
| 
0.1.89
by Martin Pool
 Store SHA1 in weave file for later verification  | 
284  | 
self._sha1s.append(sha1)  | 
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
285  | 
self._names.append(name)  | 
286  | 
self._name_map[name] = new_version  | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
287  | 
|
288  | 
||
289  | 
if not parents:  | 
|
290  | 
            # special case; adding with no parents revision; can do
 | 
|
291  | 
            # this more quickly by just appending unconditionally.
 | 
|
292  | 
            # even more specially, if we're adding an empty text we
 | 
|
293  | 
            # need do nothing at all.
 | 
|
294  | 
if text:  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
295  | 
self._weave.append(('{', new_version))  | 
296  | 
self._weave.extend(text)  | 
|
| 
1075
by Martin Pool
 - don't store redundant version number at end of insert blocks  | 
297  | 
self._weave.append(('}', None))  | 
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
298  | 
|
299  | 
return new_version  | 
|
300  | 
||
| 
941
by Martin Pool
 - allow for parents specified to Weave.add to be a set  | 
301  | 
if len(parents) == 1:  | 
302  | 
pv = list(parents)[0]  | 
|
303  | 
if sha1 == self._sha1s[pv]:  | 
|
304  | 
                # special case: same as the single parent
 | 
|
305  | 
return new_version  | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
306  | 
|
307  | 
||
308  | 
ancestors = self.inclusions(parents)  | 
|
309  | 
||
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
310  | 
l = self._weave  | 
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
311  | 
|
312  | 
        # basis a list of (origin, lineno, line)
 | 
|
313  | 
basis_lineno = []  | 
|
314  | 
basis_lines = []  | 
|
315  | 
for origin, lineno, line in self._extract(ancestors):  | 
|
316  | 
basis_lineno.append(lineno)  | 
|
317  | 
basis_lines.append(line)  | 
|
318  | 
||
| 
974.1.26
by aaron.bentley at utoronto
 merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472  | 
319  | 
        # another small special case: a merge, producing the same text
 | 
320  | 
        # as auto-merge
 | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
321  | 
if text == basis_lines:  | 
322  | 
return new_version  | 
|
323  | 
||
324  | 
        # add a sentinal, because we can also match against the final line
 | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
325  | 
basis_lineno.append(len(self._weave))  | 
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
326  | 
|
327  | 
        # XXX: which line of the weave should we really consider
 | 
|
328  | 
        # matches the end of the file?  the current code says it's the
 | 
|
329  | 
        # last line of the weave?
 | 
|
330  | 
||
331  | 
        #print 'basis_lines:', basis_lines
 | 
|
332  | 
        #print 'new_lines:  ', lines
 | 
|
333  | 
||
334  | 
from difflib import SequenceMatcher  | 
|
335  | 
s = SequenceMatcher(None, basis_lines, text)  | 
|
336  | 
||
337  | 
        # offset gives the number of lines that have been inserted
 | 
|
338  | 
        # into the weave up to the current point; if the original edit instruction
 | 
|
339  | 
        # says to change line A then we actually change (A+offset)
 | 
|
340  | 
offset = 0  | 
|
341  | 
||
342  | 
for tag, i1, i2, j1, j2 in s.get_opcodes():  | 
|
343  | 
            # i1,i2 are given in offsets within basis_lines; we need to map them
 | 
|
344  | 
            # back to offsets within the entire weave
 | 
|
345  | 
            #print 'raw match', tag, i1, i2, j1, j2
 | 
|
346  | 
if tag == 'equal':  | 
|
347  | 
                continue
 | 
|
348  | 
||
349  | 
i1 = basis_lineno[i1]  | 
|
350  | 
i2 = basis_lineno[i2]  | 
|
351  | 
||
352  | 
assert 0 <= j1 <= j2 <= len(text)  | 
|
353  | 
||
354  | 
            #print tag, i1, i2, j1, j2
 | 
|
355  | 
||
356  | 
            # the deletion and insertion are handled separately.
 | 
|
357  | 
            # first delete the region.
 | 
|
358  | 
if i1 != i2:  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
359  | 
self._weave.insert(i1+offset, ('[', new_version))  | 
360  | 
self._weave.insert(i2+offset+1, (']', new_version))  | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
361  | 
offset += 2  | 
362  | 
||
363  | 
if j1 != j2:  | 
|
364  | 
                # there may have been a deletion spanning up to
 | 
|
365  | 
                # i2; we want to insert after this region to make sure
 | 
|
366  | 
                # we don't destroy ourselves
 | 
|
367  | 
i = i2 + offset  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
368  | 
self._weave[i:i] = ([('{', new_version)]  | 
| 
1075
by Martin Pool
 - don't store redundant version number at end of insert blocks  | 
369  | 
+ text[j1:j2]  | 
370  | 
+ [('}', None)])  | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
371  | 
offset += 2 + (j2 - j1)  | 
372  | 
||
373  | 
return new_version  | 
|
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
374  | 
|
| 
0.1.27
by Martin Pool
 Check that version numbers passed in are reasonable  | 
375  | 
|
| 
0.1.78
by Martin Pool
 Rename Weave.get_included to inclusions and getiter to get_iter  | 
376  | 
def inclusions(self, versions):  | 
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
377  | 
"""Return set of all ancestors of given version(s)."""  | 
| 
928
by Martin Pool
 - go back to using plain builtin set()  | 
378  | 
i = set(versions)  | 
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
379  | 
v = max(versions)  | 
| 
892
by Martin Pool
 - weave stores only direct parents, and calculates and memoizes expansion as needed  | 
380  | 
try:  | 
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
381  | 
while v >= 0:  | 
382  | 
if v in i:  | 
|
383  | 
                    # include all its parents
 | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
384  | 
i.update(self._parents[v])  | 
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
385  | 
v -= 1  | 
386  | 
return i  | 
|
| 
892
by Martin Pool
 - weave stores only direct parents, and calculates and memoizes expansion as needed  | 
387  | 
except IndexError:  | 
388  | 
raise ValueError("version %d not present in weave" % v)  | 
|
| 
0.1.77
by Martin Pool
 New Weave.get_included() does transitive expansion  | 
389  | 
|
390  | 
||
| 
1229
by Martin Pool
 - new Weave.idx_to_name and .parents methods  | 
391  | 
def parents(self, version):  | 
392  | 
return self._parents[version]  | 
|
393  | 
||
394  | 
||
| 
890
by Martin Pool
 - weave info should show minimal expression of parents  | 
395  | 
def minimal_parents(self, version):  | 
396  | 
"""Find the minimal set of parents for the version."""  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
397  | 
included = self._parents[version]  | 
| 
890
by Martin Pool
 - weave info should show minimal expression of parents  | 
398  | 
if not included:  | 
399  | 
return []  | 
|
400  | 
||
401  | 
li = list(included)  | 
|
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
402  | 
li.sort(reverse=True)  | 
| 
890
by Martin Pool
 - weave info should show minimal expression of parents  | 
403  | 
|
404  | 
mininc = []  | 
|
| 
928
by Martin Pool
 - go back to using plain builtin set()  | 
405  | 
gotit = set()  | 
| 
890
by Martin Pool
 - weave info should show minimal expression of parents  | 
406  | 
|
407  | 
for pv in li:  | 
|
408  | 
if pv not in gotit:  | 
|
409  | 
mininc.append(pv)  | 
|
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
410  | 
gotit.update(self.inclusions(pv))  | 
| 
890
by Martin Pool
 - weave info should show minimal expression of parents  | 
411  | 
|
412  | 
assert mininc[0] >= 0  | 
|
413  | 
assert mininc[-1] < version  | 
|
414  | 
return mininc  | 
|
415  | 
||
416  | 
||
| 
0.1.75
by Martin Pool
 Remove VerInfo class; just store sets directly in the list of  | 
417  | 
|
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
418  | 
def _check_lines(self, text):  | 
419  | 
if not isinstance(text, list):  | 
|
420  | 
raise ValueError("text should be a list, not %s" % type(text))  | 
|
421  | 
||
422  | 
for l in text:  | 
|
423  | 
if not isinstance(l, basestring):  | 
|
| 
869
by Martin Pool
 - more weave.py command line options  | 
424  | 
raise ValueError("text line should be a string or unicode, not %s"  | 
425  | 
% type(l))  | 
|
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
426  | 
|
427  | 
||
428  | 
||
| 
0.1.27
by Martin Pool
 Check that version numbers passed in are reasonable  | 
429  | 
def _check_versions(self, indexes):  | 
430  | 
"""Check everything in the sequence of indexes is valid"""  | 
|
431  | 
for i in indexes:  | 
|
432  | 
try:  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
433  | 
self._parents[i]  | 
| 
0.1.27
by Martin Pool
 Check that version numbers passed in are reasonable  | 
434  | 
except IndexError:  | 
435  | 
raise IndexError("invalid version number %r" % i)  | 
|
436  | 
||
| 
0.1.2
by Martin Pool
 Import testsweet module adapted from bzr.  | 
437  | 
|
| 
1269
by Martin Pool
 - some weave operations automatically look up symbolic names if supplied  | 
438  | 
def annotate(self, name_or_index):  | 
439  | 
return list(self.annotate_iter(name_or_index))  | 
|
440  | 
||
441  | 
||
442  | 
def annotate_iter(self, name_or_index):  | 
|
| 
0.1.7
by Martin Pool
 Add trivial annotate text  | 
443  | 
"""Yield list of (index-id, line) pairs for the specified version.  | 
444  | 
||
445  | 
        The index indicates when the line originated in the weave."""
 | 
|
| 
1269
by Martin Pool
 - some weave operations automatically look up symbolic names if supplied  | 
446  | 
incls = [self.maybe_lookup(name_or_index)]  | 
447  | 
for origin, lineno, text in self._extract(incls):  | 
|
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
448  | 
yield origin, text  | 
| 
0.1.22
by Martin Pool
 Calculate delta for new versions relative to a set of parent versions.  | 
449  | 
|
450  | 
||
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
451  | 
def _walk(self):  | 
452  | 
"""Walk the weave.  | 
|
453  | 
||
454  | 
        Yields sequence of
 | 
|
455  | 
        (lineno, insert, deletes, text)
 | 
|
456  | 
        for each literal line.
 | 
|
457  | 
        """
 | 
|
458  | 
||
459  | 
istack = []  | 
|
| 
928
by Martin Pool
 - go back to using plain builtin set()  | 
460  | 
dset = set()  | 
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
461  | 
|
462  | 
lineno = 0 # line of weave, 0-based  | 
|
463  | 
||
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
464  | 
for l in self._weave:  | 
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
465  | 
if isinstance(l, tuple):  | 
466  | 
c, v = l  | 
|
467  | 
isactive = None  | 
|
468  | 
if c == '{':  | 
|
469  | 
istack.append(v)  | 
|
470  | 
elif c == '}':  | 
|
| 
1075
by Martin Pool
 - don't store redundant version number at end of insert blocks  | 
471  | 
istack.pop()  | 
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
472  | 
elif c == '[':  | 
| 
926
by Martin Pool
 - update more weave code to use intsets  | 
473  | 
assert v not in dset  | 
474  | 
dset.add(v)  | 
|
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
475  | 
elif c == ']':  | 
| 
926
by Martin Pool
 - update more weave code to use intsets  | 
476  | 
dset.remove(v)  | 
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
477  | 
else:  | 
478  | 
raise WeaveFormatError('unexpected instruction %r'  | 
|
479  | 
% v)  | 
|
480  | 
else:  | 
|
481  | 
assert isinstance(l, basestring)  | 
|
482  | 
assert istack  | 
|
483  | 
yield lineno, istack[-1], dset, l  | 
|
484  | 
lineno += 1  | 
|
485  | 
||
486  | 
||
487  | 
||
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
488  | 
def _extract(self, versions):  | 
| 
0.1.20
by Martin Pool
 Factor out Knit.extract() method  | 
489  | 
"""Yield annotation of lines in included set.  | 
490  | 
||
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
491  | 
        Yields a sequence of tuples (origin, lineno, text), where
 | 
492  | 
        origin is the origin version, lineno the index in the weave,
 | 
|
493  | 
        and text the text of the line.
 | 
|
494  | 
||
| 
0.1.20
by Martin Pool
 Factor out Knit.extract() method  | 
495  | 
        The set typically but not necessarily corresponds to a version.
 | 
496  | 
        """
 | 
|
| 
1196
by Martin Pool
 - [WIP] retrieve historical texts from weaves  | 
497  | 
for i in versions:  | 
498  | 
if not isinstance(i, int):  | 
|
499  | 
raise ValueError(i)  | 
|
500  | 
||
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
501  | 
included = self.inclusions(versions)  | 
| 
881
by Martin Pool
 - faster weave extraction  | 
502  | 
|
503  | 
istack = []  | 
|
| 
928
by Martin Pool
 - go back to using plain builtin set()  | 
504  | 
dset = set()  | 
| 
0.1.48
by Martin Pool
 Basic parsing of delete instructions.  | 
505  | 
|
506  | 
lineno = 0 # line of weave, 0-based  | 
|
| 
891
by Martin Pool
 - fix up refactoring of weave  | 
507  | 
|
| 
894
by Martin Pool
 - small optimization for weave extract  | 
508  | 
isactive = None  | 
| 
0.1.85
by Martin Pool
 doc  | 
509  | 
|
| 
931
by Martin Pool
 - experiment with making Weave._extract() return a list, not a generator - slightly faster  | 
510  | 
result = []  | 
511  | 
||
| 
0.1.63
by Martin Pool
 Abbreviate WeaveFormatError in some code  | 
512  | 
WFE = WeaveFormatError  | 
| 
0.1.95
by Martin Pool
 - preliminary merge conflict detection  | 
513  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
514  | 
for l in self._weave:  | 
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
515  | 
if isinstance(l, tuple):  | 
516  | 
c, v = l  | 
|
| 
894
by Martin Pool
 - small optimization for weave extract  | 
517  | 
isactive = None  | 
| 
891
by Martin Pool
 - fix up refactoring of weave  | 
518  | 
if c == '{':  | 
519  | 
assert v not in istack  | 
|
520  | 
istack.append(v)  | 
|
521  | 
elif c == '}':  | 
|
| 
1075
by Martin Pool
 - don't store redundant version number at end of insert blocks  | 
522  | 
istack.pop()  | 
| 
891
by Martin Pool
 - fix up refactoring of weave  | 
523  | 
elif c == '[':  | 
524  | 
if v in included:  | 
|
| 
881
by Martin Pool
 - faster weave extraction  | 
525  | 
assert v not in dset  | 
| 
0.1.48
by Martin Pool
 Basic parsing of delete instructions.  | 
526  | 
dset.add(v)  | 
| 
891
by Martin Pool
 - fix up refactoring of weave  | 
527  | 
else:  | 
528  | 
assert c == ']'  | 
|
529  | 
if v in included:  | 
|
| 
881
by Martin Pool
 - faster weave extraction  | 
530  | 
assert v in dset  | 
| 
0.1.48
by Martin Pool
 Basic parsing of delete instructions.  | 
531  | 
dset.remove(v)  | 
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
532  | 
else:  | 
533  | 
assert isinstance(l, basestring)  | 
|
| 
894
by Martin Pool
 - small optimization for weave extract  | 
534  | 
if isactive is None:  | 
535  | 
isactive = (not dset) and istack and (istack[-1] in included)  | 
|
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
536  | 
if isactive:  | 
| 
931
by Martin Pool
 - experiment with making Weave._extract() return a list, not a generator - slightly faster  | 
537  | 
result.append((istack[-1], lineno, l))  | 
| 
0.1.39
by Martin Pool
 Change to a more realistic weave structure which can represent insertions and  | 
538  | 
lineno += 1  | 
| 
0.1.7
by Martin Pool
 Add trivial annotate text  | 
539  | 
|
| 
0.1.46
by Martin Pool
 More constraints on structure of weave, and checks that they work  | 
540  | 
if istack:  | 
| 
0.1.63
by Martin Pool
 Abbreviate WeaveFormatError in some code  | 
541  | 
raise WFE("unclosed insertion blocks at end of weave",  | 
| 
0.1.47
by Martin Pool
 New WeaveError and WeaveFormatError rather than assertions.  | 
542  | 
istack)  | 
| 
0.1.48
by Martin Pool
 Basic parsing of delete instructions.  | 
543  | 
if dset:  | 
| 
0.1.63
by Martin Pool
 Abbreviate WeaveFormatError in some code  | 
544  | 
raise WFE("unclosed deletion blocks at end of weave",  | 
| 
0.1.48
by Martin Pool
 Basic parsing of delete instructions.  | 
545  | 
dset)  | 
| 
0.1.40
by Martin Pool
 Add test for extracting from weave with nested insertions  | 
546  | 
|
| 
931
by Martin Pool
 - experiment with making Weave._extract() return a list, not a generator - slightly faster  | 
547  | 
return result  | 
548  | 
||
549  | 
||
| 
0.1.7
by Martin Pool
 Add trivial annotate text  | 
550  | 
|
| 
1269
by Martin Pool
 - some weave operations automatically look up symbolic names if supplied  | 
551  | 
def get_iter(self, name_or_index):  | 
| 
0.1.5
by Martin Pool
 Add test for storing two text versions.  | 
552  | 
"""Yield lines for the specified version."""  | 
| 
1269
by Martin Pool
 - some weave operations automatically look up symbolic names if supplied  | 
553  | 
incls = [self.maybe_lookup(name_or_index)]  | 
554  | 
for origin, lineno, line in self._extract(incls):  | 
|
| 
0.1.8
by Martin Pool
 Unify get/annotate code  | 
555  | 
yield line  | 
| 
0.1.5
by Martin Pool
 Add test for storing two text versions.  | 
556  | 
|
557  | 
||
| 
1196
by Martin Pool
 - [WIP] retrieve historical texts from weaves  | 
558  | 
def get_text(self, version):  | 
559  | 
assert isinstance(version, int)  | 
|
560  | 
s = StringIO()  | 
|
561  | 
s.writelines(self.get_iter(version))  | 
|
562  | 
return s.getvalue()  | 
|
563  | 
||
564  | 
||
| 
1269
by Martin Pool
 - some weave operations automatically look up symbolic names if supplied  | 
565  | 
def get(self, name_or_index):  | 
566  | 
return list(self.get_iter(name_or_index))  | 
|
| 
0.1.1
by Martin Pool
 Check in old existing knit code.  | 
567  | 
|
568  | 
||
| 
0.1.95
by Martin Pool
 - preliminary merge conflict detection  | 
569  | 
def mash_iter(self, included):  | 
| 
0.1.65
by Martin Pool
 Add Weave.merge_iter to get automerged lines  | 
570  | 
"""Return composed version of multiple included versions."""  | 
| 
893
by Martin Pool
 - Refactor weave calculation of inclusions  | 
571  | 
for origin, lineno, text in self._extract(included):  | 
| 
0.1.65
by Martin Pool
 Add Weave.merge_iter to get automerged lines  | 
572  | 
yield text  | 
573  | 
||
574  | 
||
| 
0.1.11
by Martin Pool
 Add Knit.dump method  | 
575  | 
def dump(self, to_file):  | 
576  | 
from pprint import pprint  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
577  | 
print >>to_file, "Weave._weave = ",  | 
578  | 
pprint(self._weave, to_file)  | 
|
579  | 
print >>to_file, "Weave._parents = ",  | 
|
580  | 
pprint(self._parents, to_file)  | 
|
| 
0.1.11
by Martin Pool
 Add Knit.dump method  | 
581  | 
|
582  | 
||
| 
0.1.91
by Martin Pool
 Update Weave.check  | 
583  | 
|
584  | 
def numversions(self):  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
585  | 
l = len(self._parents)  | 
| 
0.1.91
by Martin Pool
 Update Weave.check  | 
586  | 
assert l == len(self._sha1s)  | 
587  | 
return l  | 
|
588  | 
||
589  | 
||
| 
946
by Martin Pool
 - weave info only shows the weave headers, doesn't extract every version:  | 
590  | 
def __len__(self):  | 
591  | 
return self.numversions()  | 
|
592  | 
||
593  | 
||
| 
894
by Martin Pool
 - small optimization for weave extract  | 
594  | 
def check(self, progress_bar=None):  | 
| 
0.1.91
by Martin Pool
 Update Weave.check  | 
595  | 
        # check no circular inclusions
 | 
596  | 
for version in range(self.numversions()):  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
597  | 
inclusions = list(self._parents[version])  | 
| 
0.1.91
by Martin Pool
 Update Weave.check  | 
598  | 
if inclusions:  | 
599  | 
inclusions.sort()  | 
|
600  | 
if inclusions[-1] >= version:  | 
|
| 
0.1.47
by Martin Pool
 New WeaveError and WeaveFormatError rather than assertions.  | 
601  | 
raise WeaveFormatError("invalid included version %d for index %d"  | 
| 
0.1.91
by Martin Pool
 Update Weave.check  | 
602  | 
% (inclusions[-1], version))  | 
603  | 
||
604  | 
        # try extracting all versions; this is a bit slow and parallel
 | 
|
605  | 
        # extraction could be used
 | 
|
| 
894
by Martin Pool
 - small optimization for weave extract  | 
606  | 
nv = self.numversions()  | 
607  | 
for version in range(nv):  | 
|
608  | 
if progress_bar:  | 
|
609  | 
progress_bar.update('checking text', version, nv)  | 
|
| 
0.1.91
by Martin Pool
 Update Weave.check  | 
610  | 
s = sha.new()  | 
611  | 
for l in self.get_iter(version):  | 
|
612  | 
s.update(l)  | 
|
613  | 
hd = s.hexdigest()  | 
|
614  | 
expected = self._sha1s[version]  | 
|
615  | 
if hd != expected:  | 
|
616  | 
raise WeaveError("mismatched sha1 for version %d; "  | 
|
617  | 
"got %s, expected %s"  | 
|
618  | 
% (version, hd, expected))  | 
|
| 
0.1.18
by Martin Pool
 Better Knit.dump method  | 
619  | 
|
| 
881
by Martin Pool
 - faster weave extraction  | 
620  | 
        # TODO: check insertions are properly nested, that there are
 | 
621  | 
        # no lines outside of insertion blocks, that deletions are
 | 
|
622  | 
        # properly paired, etc.
 | 
|
623  | 
||
| 
0.1.13
by Martin Pool
 Knit structure now allows for versions to include the lines present in other  | 
624  | 
|
625  | 
||
| 
0.1.95
by Martin Pool
 - preliminary merge conflict detection  | 
626  | 
def merge(self, merge_versions):  | 
627  | 
"""Automerge and mark conflicts between versions.  | 
|
628  | 
||
629  | 
        This returns a sequence, each entry describing alternatives
 | 
|
630  | 
        for a chunk of the file.  Each of the alternatives is given as
 | 
|
631  | 
        a list of lines.
 | 
|
632  | 
||
633  | 
        If there is a chunk of the file where there's no diagreement,
 | 
|
634  | 
        only one alternative is given.
 | 
|
635  | 
        """
 | 
|
636  | 
||
637  | 
        # approach: find the included versions common to all the
 | 
|
638  | 
        # merged versions
 | 
|
639  | 
raise NotImplementedError()  | 
|
640  | 
||
641  | 
||
642  | 
||
| 
0.1.21
by Martin Pool
 Start computing a delta to insert a new revision  | 
643  | 
def _delta(self, included, lines):  | 
644  | 
"""Return changes from basis to new revision.  | 
|
645  | 
||
646  | 
        The old text for comparison is the union of included revisions.
 | 
|
647  | 
||
648  | 
        This is used in inserting a new text.
 | 
|
| 
0.1.22
by Martin Pool
 Calculate delta for new versions relative to a set of parent versions.  | 
649  | 
|
| 
0.1.55
by Martin Pool
 doc  | 
650  | 
        Delta is returned as a sequence of
 | 
651  | 
        (weave1, weave2, newlines).
 | 
|
652  | 
||
653  | 
        This indicates that weave1:weave2 of the old weave should be
 | 
|
| 
0.1.22
by Martin Pool
 Calculate delta for new versions relative to a set of parent versions.  | 
654  | 
        replaced by the sequence of lines in newlines.  Note that
 | 
655  | 
        these line numbers are positions in the total weave and don't
 | 
|
656  | 
        correspond to the lines in any extracted version, or even the
 | 
|
657  | 
        extracted union of included versions.
 | 
|
658  | 
||
659  | 
        If line1=line2, this is a pure insert; if newlines=[] this is a
 | 
|
660  | 
        pure delete.  (Similar to difflib.)
 | 
|
| 
0.1.21
by Martin Pool
 Start computing a delta to insert a new revision  | 
661  | 
        """
 | 
662  | 
||
| 
0.1.1
by Martin Pool
 Check in old existing knit code.  | 
663  | 
|
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
664  | 
|
665  | 
def plan_merge(self, ver_a, ver_b):  | 
|
666  | 
"""Return pseudo-annotation indicating how the two versions merge.  | 
|
667  | 
||
668  | 
        This is computed between versions a and b and their common
 | 
|
669  | 
        base.
 | 
|
670  | 
||
671  | 
        Weave lines present in none of them are skipped entirely.
 | 
|
672  | 
        """
 | 
|
| 
926
by Martin Pool
 - update more weave code to use intsets  | 
673  | 
inc_a = self.inclusions([ver_a])  | 
674  | 
inc_b = self.inclusions([ver_b])  | 
|
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
675  | 
inc_c = inc_a & inc_b  | 
676  | 
||
677  | 
for lineno, insert, deleteset, line in self._walk():  | 
|
678  | 
if deleteset & inc_c:  | 
|
679  | 
                # killed in parent; can't be in either a or b
 | 
|
680  | 
                # not relevant to our work
 | 
|
681  | 
yield 'killed-base', line  | 
|
| 
926
by Martin Pool
 - update more weave code to use intsets  | 
682  | 
elif insert in inc_c:  | 
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
683  | 
                # was inserted in base
 | 
684  | 
killed_a = bool(deleteset & inc_a)  | 
|
685  | 
killed_b = bool(deleteset & inc_b)  | 
|
686  | 
if killed_a and killed_b:  | 
|
687  | 
yield 'killed-both', line  | 
|
688  | 
elif killed_a:  | 
|
689  | 
yield 'killed-a', line  | 
|
690  | 
elif killed_b:  | 
|
691  | 
yield 'killed-b', line  | 
|
692  | 
else:  | 
|
693  | 
yield 'unchanged', line  | 
|
| 
926
by Martin Pool
 - update more weave code to use intsets  | 
694  | 
elif insert in inc_a:  | 
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
695  | 
if deleteset & inc_a:  | 
696  | 
yield 'ghost-a', line  | 
|
697  | 
else:  | 
|
698  | 
                    # new in A; not in B
 | 
|
699  | 
yield 'new-a', line  | 
|
| 
926
by Martin Pool
 - update more weave code to use intsets  | 
700  | 
elif insert in inc_b:  | 
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
701  | 
if deleteset & inc_b:  | 
702  | 
yield 'ghost-b', line  | 
|
703  | 
else:  | 
|
704  | 
yield 'new-b', line  | 
|
705  | 
else:  | 
|
706  | 
                # not in either revision
 | 
|
707  | 
yield 'irrelevant', line  | 
|
708  | 
||
| 
919
by Martin Pool
 - more development of weave-merge  | 
709  | 
yield 'unchanged', '' # terminator  | 
710  | 
||
711  | 
||
712  | 
||
713  | 
def weave_merge(self, plan):  | 
|
714  | 
lines_a = []  | 
|
715  | 
lines_b = []  | 
|
716  | 
ch_a = ch_b = False  | 
|
717  | 
||
718  | 
for state, line in plan:  | 
|
719  | 
if state == 'unchanged' or state == 'killed-both':  | 
|
720  | 
                # resync and flush queued conflicts changes if any
 | 
|
721  | 
if not lines_a and not lines_b:  | 
|
722  | 
                    pass
 | 
|
723  | 
elif ch_a and not ch_b:  | 
|
724  | 
                    # one-sided change:                    
 | 
|
725  | 
for l in lines_a: yield l  | 
|
726  | 
elif ch_b and not ch_a:  | 
|
727  | 
for l in lines_b: yield l  | 
|
728  | 
elif lines_a == lines_b:  | 
|
729  | 
for l in lines_a: yield l  | 
|
730  | 
else:  | 
|
731  | 
yield '<<<<\n'  | 
|
732  | 
for l in lines_a: yield l  | 
|
733  | 
yield '====\n'  | 
|
734  | 
for l in lines_b: yield l  | 
|
735  | 
yield '>>>>\n'  | 
|
736  | 
||
737  | 
del lines_a[:]  | 
|
738  | 
del lines_b[:]  | 
|
739  | 
ch_a = ch_b = False  | 
|
740  | 
||
741  | 
if state == 'unchanged':  | 
|
742  | 
if line:  | 
|
743  | 
yield line  | 
|
744  | 
elif state == 'killed-a':  | 
|
745  | 
ch_a = True  | 
|
746  | 
lines_b.append(line)  | 
|
747  | 
elif state == 'killed-b':  | 
|
748  | 
ch_b = True  | 
|
749  | 
lines_a.append(line)  | 
|
750  | 
elif state == 'new-a':  | 
|
751  | 
ch_a = True  | 
|
752  | 
lines_a.append(line)  | 
|
753  | 
elif state == 'new-b':  | 
|
754  | 
ch_b = True  | 
|
755  | 
lines_b.append(line)  | 
|
756  | 
else:  | 
|
| 
920
by Martin Pool
 - add more test cases for weave_merge  | 
757  | 
assert state in ('irrelevant', 'ghost-a', 'ghost-b', 'killed-base',  | 
758  | 
'killed-both'), \  | 
|
| 
919
by Martin Pool
 - more development of weave-merge  | 
759  | 
                       state
 | 
760  | 
||
761  | 
||
762  | 
||
763  | 
||
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
764  | 
|
765  | 
||
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
766  | 
|
| 
1081
by Martin Pool
 - if weave tool is invoked with no arguments, show help  | 
767  | 
def weave_toc(w):  | 
768  | 
"""Show the weave's table-of-contents"""  | 
|
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
769  | 
print '%6s %50s %10s %10s' % ('ver', 'name', 'sha1', 'parents')  | 
770  | 
for i in (6, 50, 10, 10):  | 
|
| 
870
by Martin Pool
 - better weave info display  | 
771  | 
print '-' * i,  | 
772  | 
    print
 | 
|
| 
946
by Martin Pool
 - weave info only shows the weave headers, doesn't extract every version:  | 
773  | 
for i in range(w.numversions()):  | 
| 
0.1.91
by Martin Pool
 Update Weave.check  | 
774  | 
sha1 = w._sha1s[i]  | 
| 
1083
by Martin Pool
 - add space to store revision-id in weave files  | 
775  | 
name = w._names[i]  | 
776  | 
parent_str = ' '.join(map(str, w._parents[i]))  | 
|
777  | 
print '%6d %-50.50s %10.10s %s' % (i, name, sha1, parent_str)  | 
|
| 
0.1.88
by Martin Pool
 Add weave info command.  | 
778  | 
|
| 
869
by Martin Pool
 - more weave.py command line options  | 
779  | 
|
780  | 
||
| 
947
by Martin Pool
 - new 'weave stats' command  | 
781  | 
def weave_stats(weave_file):  | 
782  | 
from bzrlib.progress import ProgressBar  | 
|
783  | 
from bzrlib.weavefile import read_weave  | 
|
784  | 
||
785  | 
pb = ProgressBar()  | 
|
786  | 
||
787  | 
wf = file(weave_file, 'rb')  | 
|
788  | 
w = read_weave(wf)  | 
|
789  | 
    # FIXME: doesn't work on pipes
 | 
|
790  | 
weave_size = wf.tell()  | 
|
791  | 
||
792  | 
total = 0  | 
|
793  | 
vers = len(w)  | 
|
794  | 
for i in range(vers):  | 
|
795  | 
pb.update('checking sizes', i, vers)  | 
|
796  | 
for line in w.get_iter(i):  | 
|
797  | 
total += len(line)  | 
|
798  | 
||
799  | 
pb.clear()  | 
|
800  | 
||
801  | 
print 'versions %9d' % vers  | 
|
802  | 
print 'weave file %9d bytes' % weave_size  | 
|
803  | 
print 'total contents %9d bytes' % total  | 
|
804  | 
print 'compression ratio %9.2fx' % (float(total) / float(weave_size))  | 
|
| 
974.1.26
by aaron.bentley at utoronto
 merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472  | 
805  | 
if vers:  | 
806  | 
avg = total/vers  | 
|
807  | 
print 'average size %9d bytes' % avg  | 
|
808  | 
print 'relative size %9.2fx' % (float(weave_size) / float(avg))  | 
|
| 
947
by Martin Pool
 - new 'weave stats' command  | 
809  | 
|
810  | 
||
| 
869
by Martin Pool
 - more weave.py command line options  | 
811  | 
def usage():  | 
| 
871
by Martin Pool
 - add command for merge-based weave  | 
812  | 
print """bzr weave tool  | 
813  | 
||
814  | 
Experimental tool for weave algorithm.
 | 
|
815  | 
||
| 
869
by Martin Pool
 - more weave.py command line options  | 
816  | 
usage:
 | 
817  | 
    weave init WEAVEFILE
 | 
|
818  | 
        Create an empty weave file
 | 
|
819  | 
    weave get WEAVEFILE VERSION
 | 
|
820  | 
        Write out specified version.
 | 
|
821  | 
    weave check WEAVEFILE
 | 
|
822  | 
        Check consistency of all versions.
 | 
|
| 
1081
by Martin Pool
 - if weave tool is invoked with no arguments, show help  | 
823  | 
    weave toc WEAVEFILE
 | 
| 
869
by Martin Pool
 - more weave.py command line options  | 
824  | 
        Display table of contents.
 | 
| 
1084
by Martin Pool
 - weave add command needs to take a symbolic name too  | 
825  | 
    weave add WEAVEFILE NAME [BASE...] < NEWTEXT
 | 
| 
869
by Martin Pool
 - more weave.py command line options  | 
826  | 
        Add NEWTEXT, with specified parent versions.
 | 
827  | 
    weave annotate WEAVEFILE VERSION
 | 
|
828  | 
        Display origin of each line.
 | 
|
829  | 
    weave mash WEAVEFILE VERSION...
 | 
|
830  | 
        Display composite of all selected versions.
 | 
|
831  | 
    weave merge WEAVEFILE VERSION1 VERSION2 > OUT
 | 
|
832  | 
        Auto-merge two versions and display conflicts.
 | 
|
| 
871
by Martin Pool
 - add command for merge-based weave  | 
833  | 
|
834  | 
example:
 | 
|
835  | 
||
836  | 
    % weave init foo.weave
 | 
|
837  | 
    % vi foo.txt
 | 
|
| 
1084
by Martin Pool
 - weave add command needs to take a symbolic name too  | 
838  | 
    % weave add foo.weave ver0 < foo.txt
 | 
| 
871
by Martin Pool
 - add command for merge-based weave  | 
839  | 
    added version 0
 | 
840  | 
||
841  | 
    (create updated version)
 | 
|
842  | 
    % vi foo.txt
 | 
|
843  | 
    % weave get foo.weave 0 | diff -u - foo.txt
 | 
|
| 
1084
by Martin Pool
 - weave add command needs to take a symbolic name too  | 
844  | 
    % weave add foo.weave ver1 0 < foo.txt
 | 
| 
871
by Martin Pool
 - add command for merge-based weave  | 
845  | 
    added version 1
 | 
846  | 
||
847  | 
    % weave get foo.weave 0 > foo.txt       (create forked version)
 | 
|
848  | 
    % vi foo.txt
 | 
|
| 
1084
by Martin Pool
 - weave add command needs to take a symbolic name too  | 
849  | 
    % weave add foo.weave ver2 0 < foo.txt
 | 
| 
871
by Martin Pool
 - add command for merge-based weave  | 
850  | 
    added version 2
 | 
851  | 
||
852  | 
    % weave merge foo.weave 1 2 > foo.txt   (merge them)
 | 
|
853  | 
    % vi foo.txt                            (resolve conflicts)
 | 
|
| 
1084
by Martin Pool
 - weave add command needs to take a symbolic name too  | 
854  | 
    % weave add foo.weave merged 1 2 < foo.txt     (commit merged version)     
 | 
| 
871
by Martin Pool
 - add command for merge-based weave  | 
855  | 
    
 | 
| 
869
by Martin Pool
 - more weave.py command line options  | 
856  | 
"""
 | 
| 
0.1.88
by Martin Pool
 Add weave info command.  | 
857  | 
|
858  | 
||
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
859  | 
|
860  | 
def main(argv):  | 
|
861  | 
import sys  | 
|
862  | 
import os  | 
|
| 
869
by Martin Pool
 - more weave.py command line options  | 
863  | 
from weavefile import write_weave, read_weave  | 
| 
894
by Martin Pool
 - small optimization for weave extract  | 
864  | 
from bzrlib.progress import ProgressBar  | 
865  | 
||
| 
1078
by Martin Pool
 - use psyco for weave if possible  | 
866  | 
try:  | 
867  | 
import psyco  | 
|
868  | 
psyco.full()  | 
|
869  | 
except ImportError:  | 
|
870  | 
        pass
 | 
|
| 
894
by Martin Pool
 - small optimization for weave extract  | 
871  | 
|
| 
1081
by Martin Pool
 - if weave tool is invoked with no arguments, show help  | 
872  | 
if len(argv) < 2:  | 
873  | 
usage()  | 
|
874  | 
return 0  | 
|
875  | 
||
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
876  | 
cmd = argv[1]  | 
| 
869
by Martin Pool
 - more weave.py command line options  | 
877  | 
|
878  | 
def readit():  | 
|
879  | 
return read_weave(file(argv[2], 'rb'))  | 
|
880  | 
||
881  | 
if cmd == 'help':  | 
|
882  | 
usage()  | 
|
883  | 
elif cmd == 'add':  | 
|
884  | 
w = readit()  | 
|
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
885  | 
        # at the moment, based on everything in the file
 | 
| 
1084
by Martin Pool
 - weave add command needs to take a symbolic name too  | 
886  | 
name = argv[3]  | 
887  | 
parents = map(int, argv[4:])  | 
|
| 
0.1.72
by Martin Pool
 Go back to weave lines normally having newlines at the end.  | 
888  | 
lines = sys.stdin.readlines()  | 
| 
1084
by Martin Pool
 - weave add command needs to take a symbolic name too  | 
889  | 
ver = w.add(name, parents, lines)  | 
| 
869
by Martin Pool
 - more weave.py command line options  | 
890  | 
write_weave(w, file(argv[2], 'wb'))  | 
| 
1084
by Martin Pool
 - weave add command needs to take a symbolic name too  | 
891  | 
print 'added version %r %d' % (name, ver)  | 
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
892  | 
elif cmd == 'init':  | 
893  | 
fn = argv[2]  | 
|
894  | 
if os.path.exists(fn):  | 
|
895  | 
raise IOError("file exists")  | 
|
896  | 
w = Weave()  | 
|
| 
869
by Martin Pool
 - more weave.py command line options  | 
897  | 
write_weave(w, file(fn, 'wb'))  | 
898  | 
elif cmd == 'get': # get one version  | 
|
899  | 
w = readit()  | 
|
| 
0.1.94
by Martin Pool
 Fix get_iter call  | 
900  | 
sys.stdout.writelines(w.get_iter(int(argv[3])))  | 
| 
869
by Martin Pool
 - more weave.py command line options  | 
901  | 
|
902  | 
elif cmd == 'mash': # get composite  | 
|
903  | 
w = readit()  | 
|
904  | 
sys.stdout.writelines(w.mash_iter(map(int, argv[3:])))  | 
|
905  | 
||
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
906  | 
elif cmd == 'annotate':  | 
| 
869
by Martin Pool
 - more weave.py command line options  | 
907  | 
w = readit()  | 
| 
0.1.72
by Martin Pool
 Go back to weave lines normally having newlines at the end.  | 
908  | 
        # newline is added to all lines regardless; too hard to get
 | 
909  | 
        # reasonable formatting otherwise
 | 
|
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
910  | 
lasto = None  | 
911  | 
for origin, text in w.annotate(int(argv[3])):  | 
|
| 
0.1.72
by Martin Pool
 Go back to weave lines normally having newlines at the end.  | 
912  | 
text = text.rstrip('\r\n')  | 
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
913  | 
if origin == lasto:  | 
914  | 
print ' | %s' % (text)  | 
|
915  | 
else:  | 
|
916  | 
print '%5d | %s' % (origin, text)  | 
|
917  | 
lasto = origin  | 
|
| 
871
by Martin Pool
 - add command for merge-based weave  | 
918  | 
|
| 
1081
by Martin Pool
 - if weave tool is invoked with no arguments, show help  | 
919  | 
elif cmd == 'toc':  | 
920  | 
weave_toc(readit())  | 
|
| 
947
by Martin Pool
 - new 'weave stats' command  | 
921  | 
|
922  | 
elif cmd == 'stats':  | 
|
923  | 
weave_stats(argv[2])  | 
|
| 
871
by Martin Pool
 - add command for merge-based weave  | 
924  | 
|
| 
0.1.91
by Martin Pool
 Update Weave.check  | 
925  | 
elif cmd == 'check':  | 
| 
869
by Martin Pool
 - more weave.py command line options  | 
926  | 
w = readit()  | 
| 
894
by Martin Pool
 - small optimization for weave extract  | 
927  | 
pb = ProgressBar()  | 
928  | 
w.check(pb)  | 
|
929  | 
pb.clear()  | 
|
| 
938
by Martin Pool
 - various optimizations to weave add code  | 
930  | 
print '%d versions ok' % w.numversions()  | 
| 
871
by Martin Pool
 - add command for merge-based weave  | 
931  | 
|
| 
892
by Martin Pool
 - weave stores only direct parents, and calculates and memoizes expansion as needed  | 
932  | 
elif cmd == 'inclusions':  | 
933  | 
w = readit()  | 
|
934  | 
print ' '.join(map(str, w.inclusions([int(argv[3])])))  | 
|
935  | 
||
936  | 
elif cmd == 'parents':  | 
|
937  | 
w = readit()  | 
|
| 
944
by Martin Pool
 - refactor member names in Weave code  | 
938  | 
print ' '.join(map(str, w._parents[int(argv[3])]))  | 
| 
892
by Martin Pool
 - weave stores only direct parents, and calculates and memoizes expansion as needed  | 
939  | 
|
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
940  | 
elif cmd == 'plan-merge':  | 
941  | 
w = readit()  | 
|
942  | 
for state, line in w.plan_merge(int(argv[3]), int(argv[4])):  | 
|
| 
919
by Martin Pool
 - more development of weave-merge  | 
943  | 
if line:  | 
944  | 
print '%14s | %s' % (state, line),  | 
|
| 
918
by Martin Pool
 - start doing new weave-merge algorithm  | 
945  | 
|
| 
871
by Martin Pool
 - add command for merge-based weave  | 
946  | 
elif cmd == 'merge':  | 
| 
919
by Martin Pool
 - more development of weave-merge  | 
947  | 
w = readit()  | 
948  | 
p = w.plan_merge(int(argv[3]), int(argv[4]))  | 
|
949  | 
sys.stdout.writelines(w.weave_merge(p))  | 
|
950  | 
||
951  | 
elif cmd == 'mash-merge':  | 
|
| 
871
by Martin Pool
 - add command for merge-based weave  | 
952  | 
if len(argv) != 5:  | 
953  | 
usage()  | 
|
954  | 
return 1  | 
|
955  | 
||
956  | 
w = readit()  | 
|
957  | 
v1, v2 = map(int, argv[3:5])  | 
|
958  | 
||
959  | 
basis = w.inclusions([v1]).intersection(w.inclusions([v2]))  | 
|
960  | 
||
961  | 
base_lines = list(w.mash_iter(basis))  | 
|
962  | 
a_lines = list(w.get(v1))  | 
|
963  | 
b_lines = list(w.get(v2))  | 
|
964  | 
||
965  | 
from bzrlib.merge3 import Merge3  | 
|
966  | 
m3 = Merge3(base_lines, a_lines, b_lines)  | 
|
967  | 
||
968  | 
name_a = 'version %d' % v1  | 
|
969  | 
name_b = 'version %d' % v2  | 
|
970  | 
sys.stdout.writelines(m3.merge_lines(name_a=name_a, name_b=name_b))  | 
|
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
971  | 
else:  | 
972  | 
raise ValueError('unknown command %r' % cmd)  | 
|
973  | 
||
974  | 
||
| 
1076
by Martin Pool
 - add code to run weave utility under profiler  | 
975  | 
|
976  | 
def profile_main(argv):  | 
|
977  | 
import tempfile, hotshot, hotshot.stats  | 
|
978  | 
||
979  | 
prof_f = tempfile.NamedTemporaryFile()  | 
|
980  | 
||
981  | 
prof = hotshot.Profile(prof_f.name)  | 
|
982  | 
||
983  | 
ret = prof.runcall(main, argv)  | 
|
984  | 
prof.close()  | 
|
985  | 
||
986  | 
stats = hotshot.stats.load(prof_f.name)  | 
|
987  | 
    #stats.strip_dirs()
 | 
|
| 
1079
by Martin Pool
 - weavefile can just use lists for read-in ancestry, not frozensets  | 
988  | 
stats.sort_stats('cumulative')  | 
| 
1076
by Martin Pool
 - add code to run weave utility under profiler  | 
989  | 
    ## XXX: Might like to write to stderr or the trace file instead but
 | 
990  | 
    ## print_stats seems hardcoded to stdout
 | 
|
991  | 
stats.print_stats(20)  | 
|
992  | 
||
993  | 
return ret  | 
|
994  | 
||
995  | 
||
| 
0.1.62
by Martin Pool
 Lame command-line client for reading and writing weaves.  | 
996  | 
if __name__ == '__main__':  | 
997  | 
import sys  | 
|
| 
1081
by Martin Pool
 - if weave tool is invoked with no arguments, show help  | 
998  | 
if '--profile' in sys.argv:  | 
| 
1078
by Martin Pool
 - use psyco for weave if possible  | 
999  | 
args = sys.argv[:]  | 
| 
1081
by Martin Pool
 - if weave tool is invoked with no arguments, show help  | 
1000  | 
args.remove('--profile')  | 
| 
1078
by Martin Pool
 - use psyco for weave if possible  | 
1001  | 
sys.exit(profile_main(args))  | 
1002  | 
else:  | 
|
1003  | 
sys.exit(main(sys.argv))  | 
|
| 
1076
by Martin Pool
 - add code to run weave utility under profiler  | 
1004  |