Context Navigation

← Previous Revision
Latest Revision
Next Revision →
Blame
Revision Log

perltie.pod@ 14489

Last change on this file since 14489 was 14489, checked in by oranfry, 17 years ago
upgrading to perl 5.8
File size: 35.7 KB

Line
1	=head1 NAME
2	X<tie>
3
4	perltie - how to hide an object class in a simple variable
5
6	=head1 SYNOPSIS
7
8	tie VARIABLE, CLASSNAME, LIST
9
10	$object = tied VARIABLE
11
12	untie VARIABLE
13
14	=head1 DESCRIPTION
15
16	Prior to release 5.0 of Perl, a programmer could use dbmopen()
17	to connect an on-disk database in the standard Unix dbm(3x)
18	format magically to a %HASH in their program. However, their Perl was either
19	built with one particular dbm library or another, but not both, and
20	you couldn't extend this mechanism to other packages or types of variables.
21
22	Now you can.
23
24	The tie() function binds a variable to a class (package) that will provide
25	the implementation for access methods for that variable. Once this magic
26	has been performed, accessing a tied variable automatically triggers
27	method calls in the proper class. The complexity of the class is
28	hidden behind magic methods calls. The method names are in ALL CAPS,
29	which is a convention that Perl uses to indicate that they're called
30	implicitly rather than explicitly--just like the BEGIN() and END()
31	functions.
32
33	In the tie() call, C<VARIABLE> is the name of the variable to be
34	enchanted. C<CLASSNAME> is the name of a class implementing objects of
35	the correct type. Any additional arguments in the C<LIST> are passed to
36	the appropriate constructor method for that class--meaning TIESCALAR(),
37	TIEARRAY(), TIEHASH(), or TIEHANDLE(). (Typically these are arguments
38	such as might be passed to the dbminit() function of C.) The object
39	returned by the "new" method is also returned by the tie() function,
40	which would be useful if you wanted to access other methods in
41	C<CLASSNAME>. (You don't actually have to return a reference to a right
42	"type" (e.g., HASH or C<CLASSNAME>) so long as it's a properly blessed
43	object.) You can also retrieve a reference to the underlying object
44	using the tied() function.
45
46	Unlike dbmopen(), the tie() function will not C<use> or C<require> a module
47	for you--you need to do that explicitly yourself.
48
49	=head2 Tying Scalars
50	X<scalar, tying>
51
52	A class implementing a tied scalar should define the following methods:
53	TIESCALAR, FETCH, STORE, and possibly UNTIE and/or DESTROY.
54
55	Let's look at each in turn, using as an example a tie class for
56	scalars that allows the user to do something like:
57
58	tie $his_speed, 'Nice', getppid();
59	tie $my_speed, 'Nice', $$;
60
61	And now whenever either of those variables is accessed, its current
62	system priority is retrieved and returned. If those variables are set,
63	then the process's priority is changed!
64
65	We'll use Jarkko Hietaniemi <F<[email protected]>>'s BSD::Resource class (not
66	included) to access the PRIO_PROCESS, PRIO_MIN, and PRIO_MAX constants
67	from your system, as well as the getpriority() and setpriority() system
68	calls. Here's the preamble of the class.
69
70	package Nice;
71	use Carp;
72	use BSD::Resource;
73	use strict;
74	$Nice::DEBUG = 0 unless defined $Nice::DEBUG;
75
76	=over 4
77
78	=item TIESCALAR classname, LIST
79	X<TIESCALAR>
80
81	This is the constructor for the class. That means it is
82	expected to return a blessed reference to a new scalar
83	(probably anonymous) that it's creating. For example:
84
85	sub TIESCALAR {
86	my $class = shift;
87	my $pid = shift \|\| $$; # 0 means me
88
89	if ($pid !~ /^\d+$/) {
90	carp "Nice::Tie::Scalar got non-numeric pid $pid" if $^W;
91	return undef;
92	}
93
94	unless (kill 0, $pid) { # EPERM or ERSCH, no doubt
95	carp "Nice::Tie::Scalar got bad pid $pid: $!" if $^W;
96	return undef;
97	}
98
99	return bless \$pid, $class;
100	}
101
102	This tie class has chosen to return an error rather than raising an
103	exception if its constructor should fail. While this is how dbmopen() works,
104	other classes may well not wish to be so forgiving. It checks the global
105	variable C<$^W> to see whether to emit a bit of noise anyway.
106
107	=item FETCH this
108	X<FETCH>
109
110	This method will be triggered every time the tied variable is accessed
111	(read). It takes no arguments beyond its self reference, which is the
112	object representing the scalar we're dealing with. Because in this case
113	we're using just a SCALAR ref for the tied scalar object, a simple $$self
114	allows the method to get at the real value stored there. In our example
115	below, that real value is the process ID to which we've tied our variable.
116
117	sub FETCH {
118	my $self = shift;
119	confess "wrong type" unless ref $self;
120	croak "usage error" if @_;
121	my $nicety;
122	local($!) = 0;
123	$nicety = getpriority(PRIO_PROCESS, $$self);
124	if ($!) { croak "getpriority failed: $!" }
125	return $nicety;
126	}
127
128	This time we've decided to blow up (raise an exception) if the renice
129	fails--there's no place for us to return an error otherwise, and it's
130	probably the right thing to do.
131
132	=item STORE this, value
133	X<STORE>
134
135	This method will be triggered every time the tied variable is set
136	(assigned). Beyond its self reference, it also expects one (and only one)
137	argument--the new value the user is trying to assign. Don't worry about
138	returning a value from STORE -- the semantic of assignment returning the
139	assigned value is implemented with FETCH.
140
141	sub STORE {
142	my $self = shift;
143	confess "wrong type" unless ref $self;
144	my $new_nicety = shift;
145	croak "usage error" if @_;
146
147	if ($new_nicety < PRIO_MIN) {
148	carp sprintf
149	"WARNING: priority %d less than minimum system priority %d",
150	$new_nicety, PRIO_MIN if $^W;
151	$new_nicety = PRIO_MIN;
152	}
153
154	if ($new_nicety > PRIO_MAX) {
155	carp sprintf
156	"WARNING: priority %d greater than maximum system priority %d",
157	$new_nicety, PRIO_MAX if $^W;
158	$new_nicety = PRIO_MAX;
159	}
160
161	unless (defined setpriority(PRIO_PROCESS, $$self, $new_nicety)) {
162	confess "setpriority failed: $!";
163	}
164	}
165
166	=item UNTIE this
167	X<UNTIE>
168
169	This method will be triggered when the C<untie> occurs. This can be useful
170	if the class needs to know when no further calls will be made. (Except DESTROY
171	of course.) See L<The C<untie> Gotcha> below for more details.
172
173	=item DESTROY this
174	X<DESTROY>
175
176	This method will be triggered when the tied variable needs to be destructed.
177	As with other object classes, such a method is seldom necessary, because Perl
178	deallocates its moribund object's memory for you automatically--this isn't
179	C++, you know. We'll use a DESTROY method here for debugging purposes only.
180
181	sub DESTROY {
182	my $self = shift;
183	confess "wrong type" unless ref $self;
184	carp "[ Nice::DESTROY pid $$self ]" if $Nice::DEBUG;
185	}
186
187	=back
188
189	That's about all there is to it. Actually, it's more than all there
190	is to it, because we've done a few nice things here for the sake
191	of completeness, robustness, and general aesthetics. Simpler
192	TIESCALAR classes are certainly possible.
193
194	=head2 Tying Arrays
195	X<array, tying>
196
197	A class implementing a tied ordinary array should define the following
198	methods: TIEARRAY, FETCH, STORE, FETCHSIZE, STORESIZE and perhaps UNTIE and/or DESTROY.
199
200	FETCHSIZE and STORESIZE are used to provide C<$#array> and
201	equivalent C<scalar(@array)> access.
202
203	The methods POP, PUSH, SHIFT, UNSHIFT, SPLICE, DELETE, and EXISTS are
204	required if the perl operator with the corresponding (but lowercase) name
205	is to operate on the tied array. The B<Tie::Array> class can be used as a
206	base class to implement the first five of these in terms of the basic
207	methods above. The default implementations of DELETE and EXISTS in
208	B<Tie::Array> simply C<croak>.
209
210	In addition EXTEND will be called when perl would have pre-extended
211	allocation in a real array.
212
213	For this discussion, we'll implement an array whose elements are a fixed
214	size at creation. If you try to create an element larger than the fixed
215	size, you'll take an exception. For example:
216
217	use FixedElem_Array;
218	tie @array, 'FixedElem_Array', 3;
219	$array[0] = 'cat'; # ok.
220	$array[1] = 'dogs'; # exception, length('dogs') > 3.
221
222	The preamble code for the class is as follows:
223
224	package FixedElem_Array;
225	use Carp;
226	use strict;
227
228	=over 4
229
230	=item TIEARRAY classname, LIST
231	X<TIEARRAY>
232
233	This is the constructor for the class. That means it is expected to
234	return a blessed reference through which the new array (probably an
235	anonymous ARRAY ref) will be accessed.
236
237	In our example, just to show you that you don't I<really> have to return an
238	ARRAY reference, we'll choose a HASH reference to represent our object.
239	A HASH works out well as a generic record type: the C<{ELEMSIZE}> field will
240	store the maximum element size allowed, and the C<{ARRAY}> field will hold the
241	true ARRAY ref. If someone outside the class tries to dereference the
242	object returned (doubtless thinking it an ARRAY ref), they'll blow up.
243	This just goes to show you that you should respect an object's privacy.
244
245	sub TIEARRAY {
246	my $class = shift;
247	my $elemsize = shift;
248	if ( @_ \|\| $elemsize =~ /\D/ ) {
249	croak "usage: tie ARRAY, '" . __PACKAGE__ . "', elem_size";
250	}
251	return bless {
252	ELEMSIZE => $elemsize,
253	ARRAY => [],
254	}, $class;
255	}
256
257	=item FETCH this, index
258	X<FETCH>
259
260	This method will be triggered every time an individual element the tied array
261	is accessed (read). It takes one argument beyond its self reference: the
262	index whose value we're trying to fetch.
263
264	sub FETCH {
265	my $self = shift;
266	my $index = shift;
267	return $self->{ARRAY}->[$index];
268	}
269
270	If a negative array index is used to read from an array, the index
271	will be translated to a positive one internally by calling FETCHSIZE
272	before being passed to FETCH. You may disable this feature by
273	assigning a true value to the variable C<$NEGATIVE_INDICES> in the
274	tied array class.
275
276	As you may have noticed, the name of the FETCH method (et al.) is the same
277	for all accesses, even though the constructors differ in names (TIESCALAR
278	vs TIEARRAY). While in theory you could have the same class servicing
279	several tied types, in practice this becomes cumbersome, and it's easiest
280	to keep them at simply one tie type per class.
281
282	=item STORE this, index, value
283	X<STORE>
284
285	This method will be triggered every time an element in the tied array is set
286	(written). It takes two arguments beyond its self reference: the index at
287	which we're trying to store something and the value we're trying to put
288	there.
289
290	In our example, C<undef> is really C<$self-E<gt>{ELEMSIZE}> number of
291	spaces so we have a little more work to do here:
292
293	sub STORE {
294	my $self = shift;
295	my( $index, $value ) = @_;
296	if ( length $value > $self->{ELEMSIZE} ) {
297	croak "length of $value is greater than $self->{ELEMSIZE}";
298	}
299	# fill in the blanks
300	$self->EXTEND( $index ) if $index > $self->FETCHSIZE();
301	# right justify to keep element size for smaller elements
302	$self->{ARRAY}->[$index] = sprintf "%$self->{ELEMSIZE}s", $value;
303	}
304
305	Negative indexes are treated the same as with FETCH.
306
307	=item FETCHSIZE this
308	X<FETCHSIZE>
309
310	Returns the total number of items in the tied array associated with
311	object I<this>. (Equivalent to C<scalar(@array)>). For example:
312
313	sub FETCHSIZE {
314	my $self = shift;
315	return scalar @{$self->{ARRAY}};
316	}
317
318	=item STORESIZE this, count
319	X<STORESIZE>
320
321	Sets the total number of items in the tied array associated with
322	object I<this> to be I<count>. If this makes the array larger then
323	class's mapping of C<undef> should be returned for new positions.
324	If the array becomes smaller then entries beyond count should be
325	deleted.
326
327	In our example, 'undef' is really an element containing
328	C<$self-E<gt>{ELEMSIZE}> number of spaces. Observe:
329
330	sub STORESIZE {
331	my $self = shift;
332	my $count = shift;
333	if ( $count > $self->FETCHSIZE() ) {
334	foreach ( $count - $self->FETCHSIZE() .. $count ) {
335	$self->STORE( $_, '' );
336	}
337	} elsif ( $count < $self->FETCHSIZE() ) {
338	foreach ( 0 .. $self->FETCHSIZE() - $count - 2 ) {
339	$self->POP();
340	}
341	}
342	}
343
344	=item EXTEND this, count
345	X<EXTEND>
346
347	Informative call that array is likely to grow to have I<count> entries.
348	Can be used to optimize allocation. This method need do nothing.
349
350	In our example, we want to make sure there are no blank (C<undef>)
351	entries, so C<EXTEND> will make use of C<STORESIZE> to fill elements
352	as needed:
353
354	sub EXTEND {
355	my $self = shift;
356	my $count = shift;
357	$self->STORESIZE( $count );
358	}
359
360	=item EXISTS this, key
361	X<EXISTS>
362
363	Verify that the element at index I<key> exists in the tied array I<this>.
364
365	In our example, we will determine that if an element consists of
366	C<$self-E<gt>{ELEMSIZE}> spaces only, it does not exist:
367
368	sub EXISTS {
369	my $self = shift;
370	my $index = shift;
371	return 0 if ! defined $self->{ARRAY}->[$index] \|\|
372	$self->{ARRAY}->[$index] eq ' ' x $self->{ELEMSIZE};
373	return 1;
374	}
375
376	=item DELETE this, key
377	X<DELETE>
378
379	Delete the element at index I<key> from the tied array I<this>.
380
381	In our example, a deleted item is C<$self-E<gt>{ELEMSIZE}> spaces:
382
383	sub DELETE {
384	my $self = shift;
385	my $index = shift;
386	return $self->STORE( $index, '' );
387	}
388
389	=item CLEAR this
390	X<CLEAR>
391
392	Clear (remove, delete, ...) all values from the tied array associated with
393	object I<this>. For example:
394
395	sub CLEAR {
396	my $self = shift;
397	return $self->{ARRAY} = [];
398	}
399
400	=item PUSH this, LIST
401	X<PUSH>
402
403	Append elements of I<LIST> to the array. For example:
404
405	sub PUSH {
406	my $self = shift;
407	my @list = @_;
408	my $last = $self->FETCHSIZE();
409	$self->STORE( $last + $_, $list[$_] ) foreach 0 .. $#list;
410	return $self->FETCHSIZE();
411	}
412
413	=item POP this
414	X<POP>
415
416	Remove last element of the array and return it. For example:
417
418	sub POP {
419	my $self = shift;
420	return pop @{$self->{ARRAY}};
421	}
422
423	=item SHIFT this
424	X<SHIFT>
425
426	Remove the first element of the array (shifting other elements down)
427	and return it. For example:
428
429	sub SHIFT {
430	my $self = shift;
431	return shift @{$self->{ARRAY}};
432	}
433
434	=item UNSHIFT this, LIST
435	X<UNSHIFT>
436
437	Insert LIST elements at the beginning of the array, moving existing elements
438	up to make room. For example:
439
440	sub UNSHIFT {
441	my $self = shift;
442	my @list = @_;
443	my $size = scalar( @list );
444	# make room for our list
445	@{$self->{ARRAY}}[ $size .. $#{$self->{ARRAY}} + $size ]
446	= @{$self->{ARRAY}};
447	$self->STORE( $_, $list[$_] ) foreach 0 .. $#list;
448	}
449
450	=item SPLICE this, offset, length, LIST
451	X<SPLICE>
452
453	Perform the equivalent of C<splice> on the array.
454
455	I<offset> is optional and defaults to zero, negative values count back
456	from the end of the array.
457
458	I<length> is optional and defaults to rest of the array.
459
460	I<LIST> may be empty.
461
462	Returns a list of the original I<length> elements at I<offset>.
463
464	In our example, we'll use a little shortcut if there is a I<LIST>:
465
466	sub SPLICE {
467	my $self = shift;
468	my $offset = shift \|\| 0;
469	my $length = shift \|\| $self->FETCHSIZE() - $offset;
470	my @list = ();
471	if ( @_ ) {
472	tie @list, __PACKAGE__, $self->{ELEMSIZE};
473	@list = @_;
474	}
475	return splice @{$self->{ARRAY}}, $offset, $length, @list;
476	}
477
478	=item UNTIE this
479	X<UNTIE>
480
481	Will be called when C<untie> happens. (See L<The C<untie> Gotcha> below.)
482
483	=item DESTROY this
484	X<DESTROY>
485
486	This method will be triggered when the tied variable needs to be destructed.
487	As with the scalar tie class, this is almost never needed in a
488	language that does its own garbage collection, so this time we'll
489	just leave it out.
490
491	=back
492
493	=head2 Tying Hashes
494	X<hash, tying>
495
496	Hashes were the first Perl data type to be tied (see dbmopen()). A class
497	implementing a tied hash should define the following methods: TIEHASH is
498	the constructor. FETCH and STORE access the key and value pairs. EXISTS
499	reports whether a key is present in the hash, and DELETE deletes one.
500	CLEAR empties the hash by deleting all the key and value pairs. FIRSTKEY
501	and NEXTKEY implement the keys() and each() functions to iterate over all
502	the keys. SCALAR is triggered when the tied hash is evaluated in scalar
503	context. UNTIE is called when C<untie> happens, and DESTROY is called when
504	the tied variable is garbage collected.
505
506	If this seems like a lot, then feel free to inherit from merely the
507	standard Tie::StdHash module for most of your methods, redefining only the
508	interesting ones. See L<Tie::Hash> for details.
509
510	Remember that Perl distinguishes between a key not existing in the hash,
511	and the key existing in the hash but having a corresponding value of
512	C<undef>. The two possibilities can be tested with the C<exists()> and
513	C<defined()> functions.
514
515	Here's an example of a somewhat interesting tied hash class: it gives you
516	a hash representing a particular user's dot files. You index into the hash
517	with the name of the file (minus the dot) and you get back that dot file's
518	contents. For example:
519
520	use DotFiles;
521	tie %dot, 'DotFiles';
522	if ( $dot{profile} =~ /MANPATH/ \|\|
523	$dot{login} =~ /MANPATH/ \|\|
524	$dot{cshrc} =~ /MANPATH/ )
525	{
526	print "you seem to set your MANPATH\n";
527	}
528
529	Or here's another sample of using our tied class:
530
531	tie %him, 'DotFiles', 'daemon';
532	foreach $f ( keys %him ) {
533	printf "daemon dot file %s is size %d\n",
534	$f, length $him{$f};
535	}
536
537	In our tied hash DotFiles example, we use a regular
538	hash for the object containing several important
539	fields, of which only the C<{LIST}> field will be what the
540	user thinks of as the real hash.
541
542	=over 5
543
544	=item USER
545
546	whose dot files this object represents
547
548	=item HOME
549
550	where those dot files live
551
552	=item CLOBBER
553
554	whether we should try to change or remove those dot files
555
556	=item LIST
557
558	the hash of dot file names and content mappings
559
560	=back
561
562	Here's the start of F<Dotfiles.pm>:
563
564	package DotFiles;
565	use Carp;
566	sub whowasi { (caller(1))[3] . '()' }
567	my $DEBUG = 0;
568	sub debug { $DEBUG = @_ ? shift : 1 }
569
570	For our example, we want to be able to emit debugging info to help in tracing
571	during development. We keep also one convenience function around
572	internally to help print out warnings; whowasi() returns the function name
573	that calls it.
574
575	Here are the methods for the DotFiles tied hash.
576
577	=over 4
578
579	=item TIEHASH classname, LIST
580	X<TIEHASH>
581
582	This is the constructor for the class. That means it is expected to
583	return a blessed reference through which the new object (probably but not
584	necessarily an anonymous hash) will be accessed.
585
586	Here's the constructor:
587
588	sub TIEHASH {
589	my $self = shift;
590	my $user = shift \|\| $>;
591	my $dotdir = shift \|\| '';
592	croak "usage: @{[&whowasi]} [USER [DOTDIR]]" if @_;
593	$user = getpwuid($user) if $user =~ /^\d+$/;
594	my $dir = (getpwnam($user))[7]
595	\|\| croak "@{[&whowasi]}: no user $user";
596	$dir .= "/$dotdir" if $dotdir;
597
598	my $node = {
599	USER => $user,
600	HOME => $dir,
601	LIST => {},
602	CLOBBER => 0,
603	};
604
605	opendir(DIR, $dir)
606	\|\| croak "@{[&whowasi]}: can't opendir $dir: $!";
607	foreach $dot ( grep /^\./ && -f "$dir/$_", readdir(DIR)) {
608	$dot =~ s/^\.//;
609	$node->{LIST}{$dot} = undef;
610	}
611	closedir DIR;
612	return bless $node, $self;
613	}
614
615	It's probably worth mentioning that if you're going to filetest the
616	return values out of a readdir, you'd better prepend the directory
617	in question. Otherwise, because we didn't chdir() there, it would
618	have been testing the wrong file.
619
620	=item FETCH this, key
621	X<FETCH>
622
623	This method will be triggered every time an element in the tied hash is
624	accessed (read). It takes one argument beyond its self reference: the key
625	whose value we're trying to fetch.
626
627	Here's the fetch for our DotFiles example.
628
629	sub FETCH {
630	carp &whowasi if $DEBUG;
631	my $self = shift;
632	my $dot = shift;
633	my $dir = $self->{HOME};
634	my $file = "$dir/.$dot";
635
636	unless (exists $self->{LIST}->{$dot} \|\| -f $file) {
637	carp "@{[&whowasi]}: no $dot file" if $DEBUG;
638	return undef;
639	}
640
641	if (defined $self->{LIST}->{$dot}) {
642	return $self->{LIST}->{$dot};
643	} else {
644	return $self->{LIST}->{$dot} = `cat $dir/.$dot`;
645	}
646	}
647
648	It was easy to write by having it call the Unix cat(1) command, but it
649	would probably be more portable to open the file manually (and somewhat
650	more efficient). Of course, because dot files are a Unixy concept, we're
651	not that concerned.
652
653	=item STORE this, key, value
654	X<STORE>
655
656	This method will be triggered every time an element in the tied hash is set
657	(written). It takes two arguments beyond its self reference: the index at
658	which we're trying to store something, and the value we're trying to put
659	there.
660
661	Here in our DotFiles example, we'll be careful not to let
662	them try to overwrite the file unless they've called the clobber()
663	method on the original object reference returned by tie().
664
665	sub STORE {
666	carp &whowasi if $DEBUG;
667	my $self = shift;
668	my $dot = shift;
669	my $value = shift;
670	my $file = $self->{HOME} . "/.$dot";
671	my $user = $self->{USER};
672
673	croak "@{[&whowasi]}: $file not clobberable"
674	unless $self->{CLOBBER};
675
676	open(F, "> $file") \|\| croak "can't open $file: $!";
677	print F $value;
678	close(F);
679	}
680
681	If they wanted to clobber something, they might say:
682
683	$ob = tie %daemon_dots, 'daemon';
684	$ob->clobber(1);
685	$daemon_dots{signature} = "A true daemon\n";
686
687	Another way to lay hands on a reference to the underlying object is to
688	use the tied() function, so they might alternately have set clobber
689	using:
690
691	tie %daemon_dots, 'daemon';
692	tied(%daemon_dots)->clobber(1);
693
694	The clobber method is simply:
695
696	sub clobber {
697	my $self = shift;
698	$self->{CLOBBER} = @_ ? shift : 1;
699	}
700
701	=item DELETE this, key
702	X<DELETE>
703
704	This method is triggered when we remove an element from the hash,
705	typically by using the delete() function. Again, we'll
706	be careful to check whether they really want to clobber files.
707
708	sub DELETE {
709	carp &whowasi if $DEBUG;
710
711	my $self = shift;
712	my $dot = shift;
713	my $file = $self->{HOME} . "/.$dot";
714	croak "@{[&whowasi]}: won't remove file $file"
715	unless $self->{CLOBBER};
716	delete $self->{LIST}->{$dot};
717	my $success = unlink($file);
718	carp "@{[&whowasi]}: can't unlink $file: $!" unless $success;
719	$success;
720	}
721
722	The value returned by DELETE becomes the return value of the call
723	to delete(). If you want to emulate the normal behavior of delete(),
724	you should return whatever FETCH would have returned for this key.
725	In this example, we have chosen instead to return a value which tells
726	the caller whether the file was successfully deleted.
727
728	=item CLEAR this
729	X<CLEAR>
730
731	This method is triggered when the whole hash is to be cleared, usually by
732	assigning the empty list to it.
733
734	In our example, that would remove all the user's dot files! It's such a
735	dangerous thing that they'll have to set CLOBBER to something higher than
736	1 to make it happen.
737
738	sub CLEAR {
739	carp &whowasi if $DEBUG;
740	my $self = shift;
741	croak "@{[&whowasi]}: won't remove all dot files for $self->{USER}"
742	unless $self->{CLOBBER} > 1;
743	my $dot;
744	foreach $dot ( keys %{$self->{LIST}}) {
745	$self->DELETE($dot);
746	}
747	}
748
749	=item EXISTS this, key
750	X<EXISTS>
751
752	This method is triggered when the user uses the exists() function
753	on a particular hash. In our example, we'll look at the C<{LIST}>
754	hash element for this:
755
756	sub EXISTS {
757	carp &whowasi if $DEBUG;
758	my $self = shift;
759	my $dot = shift;
760	return exists $self->{LIST}->{$dot};
761	}
762
763	=item FIRSTKEY this
764	X<FIRSTKEY>
765
766	This method will be triggered when the user is going
767	to iterate through the hash, such as via a keys() or each()
768	call.
769
770	sub FIRSTKEY {
771	carp &whowasi if $DEBUG;
772	my $self = shift;
773	my $a = keys %{$self->{LIST}}; # reset each() iterator
774	each %{$self->{LIST}}
775	}
776
777	=item NEXTKEY this, lastkey
778	X<NEXTKEY>
779
780	This method gets triggered during a keys() or each() iteration. It has a
781	second argument which is the last key that had been accessed. This is
782	useful if you're carrying about ordering or calling the iterator from more
783	than one sequence, or not really storing things in a hash anywhere.
784
785	For our example, we're using a real hash so we'll do just the simple
786	thing, but we'll have to go through the LIST field indirectly.
787
788	sub NEXTKEY {
789	carp &whowasi if $DEBUG;
790	my $self = shift;
791	return each %{ $self->{LIST} }
792	}
793
794	=item SCALAR this
795	X<SCALAR>
796
797	This is called when the hash is evaluated in scalar context. In order
798	to mimic the behaviour of untied hashes, this method should return a
799	false value when the tied hash is considered empty. If this method does
800	not exist, perl will make some educated guesses and return true when
801	the hash is inside an iteration. If this isn't the case, FIRSTKEY is
802	called, and the result will be a false value if FIRSTKEY returns the empty
803	list, true otherwise.
804
805	However, you should B<not> blindly rely on perl always doing the right
806	thing. Particularly, perl will mistakenly return true when you clear the
807	hash by repeatedly calling DELETE until it is empty. You are therefore
808	advised to supply your own SCALAR method when you want to be absolutely
809	sure that your hash behaves nicely in scalar context.
810
811	In our example we can just call C<scalar> on the underlying hash
812	referenced by C<$self-E<gt>{LIST}>:
813
814	sub SCALAR {
815	carp &whowasi if $DEBUG;
816	my $self = shift;
817	return scalar %{ $self->{LIST} }
818	}
819
820	=item UNTIE this
821	X<UNTIE>
822
823	This is called when C<untie> occurs. See L<The C<untie> Gotcha> below.
824
825	=item DESTROY this
826	X<DESTROY>
827
828	This method is triggered when a tied hash is about to go out of
829	scope. You don't really need it unless you're trying to add debugging
830	or have auxiliary state to clean up. Here's a very simple function:
831
832	sub DESTROY {
833	carp &whowasi if $DEBUG;
834	}
835
836	=back
837
838	Note that functions such as keys() and values() may return huge lists
839	when used on large objects, like DBM files. You may prefer to use the
840	each() function to iterate over such. Example:
841
842	# print out history file offsets
843	use NDBM_File;
844	tie(%HIST, 'NDBM_File', '/usr/lib/news/history', 1, 0);
845	while (($key,$val) = each %HIST) {
846	print $key, ' = ', unpack('L',$val), "\n";
847	}
848	untie(%HIST);
849
850	=head2 Tying FileHandles
851	X<filehandle, tying>
852
853	This is partially implemented now.
854
855	A class implementing a tied filehandle should define the following
856	methods: TIEHANDLE, at least one of PRINT, PRINTF, WRITE, READLINE, GETC,
857	READ, and possibly CLOSE, UNTIE and DESTROY. The class can also provide: BINMODE,
858	OPEN, EOF, FILENO, SEEK, TELL - if the corresponding perl operators are
859	used on the handle.
860
861	When STDERR is tied, its PRINT method will be called to issue warnings
862	and error messages. This feature is temporarily disabled during the call,
863	which means you can use C<warn()> inside PRINT without starting a recursive
864	loop. And just like C<__WARN__> and C<__DIE__> handlers, STDERR's PRINT
865	method may be called to report parser errors, so the caveats mentioned under
866	L<perlvar/%SIG> apply.
867
868	All of this is especially useful when perl is embedded in some other
869	program, where output to STDOUT and STDERR may have to be redirected
870	in some special way. See nvi and the Apache module for examples.
871
872	In our example we're going to create a shouting handle.
873
874	package Shout;
875
876	=over 4
877
878	=item TIEHANDLE classname, LIST
879	X<TIEHANDLE>
880
881	This is the constructor for the class. That means it is expected to
882	return a blessed reference of some sort. The reference can be used to
883	hold some internal information.
884
885	sub TIEHANDLE { print "<shout>\n"; my $i; bless \$i, shift }
886
887	=item WRITE this, LIST
888	X<WRITE>
889
890	This method will be called when the handle is written to via the
891	C<syswrite> function.
892
893	sub WRITE {
894	$r = shift;
895	my($buf,$len,$offset) = @_;
896	print "WRITE called, \$buf=$buf, \$len=$len, \$offset=$offset";
897	}
898
899	=item PRINT this, LIST
900	X<PRINT>
901
902	This method will be triggered every time the tied handle is printed to
903	with the C<print()> function.
904	Beyond its self reference it also expects the list that was passed to
905	the print function.
906
907	sub PRINT { $r = shift; $$r++; print join($,,map(uc($_),@_)),$\ }
908
909	=item PRINTF this, LIST
910	X<PRINTF>
911
912	This method will be triggered every time the tied handle is printed to
913	with the C<printf()> function.
914	Beyond its self reference it also expects the format and list that was
915	passed to the printf function.
916
917	sub PRINTF {
918	shift;
919	my $fmt = shift;
920	print sprintf($fmt, @_);
921	}
922
923	=item READ this, LIST
924	X<READ>
925
926	This method will be called when the handle is read from via the C<read>
927	or C<sysread> functions.
928
929	sub READ {
930	my $self = shift;
931	my $bufref = \$_[0];
932	my(undef,$len,$offset) = @_;
933	print "READ called, \$buf=$bufref, \$len=$len, \$offset=$offset";
934	# add to $$bufref, set $len to number of characters read
935	$len;
936	}
937
938	=item READLINE this
939	X<READLINE>
940
941	This method will be called when the handle is read from via <HANDLE>.
942	The method should return undef when there is no more data.
943
944	sub READLINE { $r = shift; "READLINE called $$r times\n"; }
945
946	=item GETC this
947	X<GETC>
948
949	This method will be called when the C<getc> function is called.
950
951	sub GETC { print "Don't GETC, Get Perl"; return "a"; }
952
953	=item CLOSE this
954	X<CLOSE>
955
956	This method will be called when the handle is closed via the C<close>
957	function.
958
959	sub CLOSE { print "CLOSE called.\n" }
960
961	=item UNTIE this
962	X<UNTIE>
963
964	As with the other types of ties, this method will be called when C<untie> happens.
965	It may be appropriate to "auto CLOSE" when this occurs. See
966	L<The C<untie> Gotcha> below.
967
968	=item DESTROY this
969	X<DESTROY>
970
971	As with the other types of ties, this method will be called when the
972	tied handle is about to be destroyed. This is useful for debugging and
973	possibly cleaning up.
974
975	sub DESTROY { print "</shout>\n" }
976
977	=back
978
979	Here's how to use our little example:
980
981	tie(*FOO,'Shout');
982	print FOO "hello\n";
983	$a = 4; $b = 6;
984	print FOO $a, " plus ", $b, " equals ", $a + $b, "\n";
985	print <FOO>;
986
987	=head2 UNTIE this
988	X<UNTIE>
989
990	You can define for all tie types an UNTIE method that will be called
991	at untie(). See L<The C<untie> Gotcha> below.
992
993	=head2 The C<untie> Gotcha
994	X<untie>
995
996	If you intend making use of the object returned from either tie() or
997	tied(), and if the tie's target class defines a destructor, there is a
998	subtle gotcha you I<must> guard against.
999
1000	As setup, consider this (admittedly rather contrived) example of a
1001	tie; all it does is use a file to keep a log of the values assigned to
1002	a scalar.
1003
1004	package Remember;
1005
1006	use strict;
1007	use warnings;
1008	use IO::File;
1009
1010	sub TIESCALAR {
1011	my $class = shift;
1012	my $filename = shift;
1013	my $handle = new IO::File "> $filename"
1014	or die "Cannot open $filename: $!\n";
1015
1016	print $handle "The Start\n";
1017	bless {FH => $handle, Value => 0}, $class;
1018	}
1019
1020	sub FETCH {
1021	my $self = shift;
1022	return $self->{Value};
1023	}
1024
1025	sub STORE {
1026	my $self = shift;
1027	my $value = shift;
1028	my $handle = $self->{FH};
1029	print $handle "$value\n";
1030	$self->{Value} = $value;
1031	}
1032
1033	sub DESTROY {
1034	my $self = shift;
1035	my $handle = $self->{FH};
1036	print $handle "The End\n";
1037	close $handle;
1038	}
1039
1040	1;
1041
1042	Here is an example that makes use of this tie:
1043
1044	use strict;
1045	use Remember;
1046
1047	my $fred;
1048	tie $fred, 'Remember', 'myfile.txt';
1049	$fred = 1;
1050	$fred = 4;
1051	$fred = 5;
1052	untie $fred;
1053	system "cat myfile.txt";
1054
1055	This is the output when it is executed:
1056
1057	The Start
1058	1
1059	4
1060	5
1061	The End
1062
1063	So far so good. Those of you who have been paying attention will have
1064	spotted that the tied object hasn't been used so far. So lets add an
1065	extra method to the Remember class to allow comments to be included in
1066	the file -- say, something like this:
1067
1068	sub comment {
1069	my $self = shift;
1070	my $text = shift;
1071	my $handle = $self->{FH};
1072	print $handle $text, "\n";
1073	}
1074
1075	And here is the previous example modified to use the C<comment> method
1076	(which requires the tied object):
1077
1078	use strict;
1079	use Remember;
1080
1081	my ($fred, $x);
1082	$x = tie $fred, 'Remember', 'myfile.txt';
1083	$fred = 1;
1084	$fred = 4;
1085	comment $x "changing...";
1086	$fred = 5;
1087	untie $fred;
1088	system "cat myfile.txt";
1089
1090	When this code is executed there is no output. Here's why:
1091
1092	When a variable is tied, it is associated with the object which is the
1093	return value of the TIESCALAR, TIEARRAY, or TIEHASH function. This
1094	object normally has only one reference, namely, the implicit reference
1095	from the tied variable. When untie() is called, that reference is
1096	destroyed. Then, as in the first example above, the object's
1097	destructor (DESTROY) is called, which is normal for objects that have
1098	no more valid references; and thus the file is closed.
1099
1100	In the second example, however, we have stored another reference to
1101	the tied object in $x. That means that when untie() gets called
1102	there will still be a valid reference to the object in existence, so
1103	the destructor is not called at that time, and thus the file is not
1104	closed. The reason there is no output is because the file buffers
1105	have not been flushed to disk.
1106
1107	Now that you know what the problem is, what can you do to avoid it?
1108	Prior to the introduction of the optional UNTIE method the only way
1109	was the good old C<-w> flag. Which will spot any instances where you call
1110	untie() and there are still valid references to the tied object. If
1111	the second script above this near the top C<use warnings 'untie'>
1112	or was run with the C<-w> flag, Perl prints this
1113	warning message:
1114
1115	untie attempted while 1 inner references still exist
1116
1117	To get the script to work properly and silence the warning make sure
1118	there are no valid references to the tied object I<before> untie() is
1119	called:
1120
1121	undef $x;
1122	untie $fred;
1123
1124	Now that UNTIE exists the class designer can decide which parts of the
1125	class functionality are really associated with C<untie> and which with
1126	the object being destroyed. What makes sense for a given class depends
1127	on whether the inner references are being kept so that non-tie-related
1128	methods can be called on the object. But in most cases it probably makes
1129	sense to move the functionality that would have been in DESTROY to the UNTIE
1130	method.
1131
1132	If the UNTIE method exists then the warning above does not occur. Instead the
1133	UNTIE method is passed the count of "extra" references and can issue its own
1134	warning if appropriate. e.g. to replicate the no UNTIE case this method can
1135	be used:
1136
1137	sub UNTIE
1138	{
1139	my ($obj,$count) = @_;
1140	carp "untie attempted while $count inner references still exist" if $count;
1141	}
1142
1143	=head1 SEE ALSO
1144
1145	See L<DB_File> or L<Config> for some interesting tie() implementations.
1146	A good starting point for many tie() implementations is with one of the
1147	modules L<Tie::Scalar>, L<Tie::Array>, L<Tie::Hash>, or L<Tie::Handle>.
1148
1149	=head1 BUGS
1150
1151	The bucket usage information provided by C<scalar(%hash)> is not
1152	available. What this means is that using %tied_hash in boolean
1153	context doesn't work right (currently this always tests false,
1154	regardless of whether the hash is empty or hash elements).
1155
1156	Localizing tied arrays or hashes does not work. After exiting the
1157	scope the arrays or the hashes are not restored.
1158
1159	Counting the number of entries in a hash via C<scalar(keys(%hash))>
1160	or C<scalar(values(%hash)>) is inefficient since it needs to iterate
1161	through all the entries with FIRSTKEY/NEXTKEY.
1162
1163	Tied hash/array slices cause multiple FETCH/STORE pairs, there are no
1164	tie methods for slice operations.
1165
1166	You cannot easily tie a multilevel data structure (such as a hash of
1167	hashes) to a dbm file. The first problem is that all but GDBM and
1168	Berkeley DB have size limitations, but beyond that, you also have problems
1169	with how references are to be represented on disk. One experimental
1170	module that does attempt to address this need is DBM::Deep. Check your
1171	nearest CPAN site as described in L<perlmodlib> for source code. Note
1172	that despite its name, DBM::Deep does not use dbm. Another earlier attempt
1173	at solving the problem is MLDBM, which is also available on the CPAN, but
1174	which has some fairly serious limitations.
1175
1176	Tied filehandles are still incomplete. sysopen(), truncate(),
1177	flock(), fcntl(), stat() and -X can't currently be trapped.
1178
1179	=head1 AUTHOR
1180
1181	Tom Christiansen
1182
1183	TIEHANDLE by Sven Verdoolaege <F<[email protected]>> and Doug MacEachern <F<[email protected]>>
1184
1185	UNTIE by Nick Ing-Simmons <F<[email protected]>>
1186
1187	SCALAR by Tassilo von Parseval <F<[email protected]>>
1188
1189	Tying Arrays by Casey West <F<[email protected]>>

Note: See TracBrowser for help on using the repository browser.

Download in other formats:

Original Format