forked from ClusterLabs/pacemaker
-
Notifications
You must be signed in to change notification settings - Fork 0
/
ChangeLog
1472 lines (1429 loc) · 104 KB
/
ChangeLog
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
* Wed Mar 28 2012 Andrew Beekhof <[email protected]> Pacemaker-1.1.7-1
- Update source tarball to revision: bc7ff2c
- Statistics:
Changesets: 513
Diff: 1171 files changed, 90472 insertions(+), 19368 deletions(-)
- Changes since Pacemaker-1.1.6.1
High: ais: Prepare for corosync versions using IPC from libqb
High: cib: Correctly shutdown in the presence of peers without relying on timers
High: cib: Don't halt disk writes if the previous digest is missing
High: cib: Determine when there are no peers to respond to our shutdown request and exit
High: cib: Ensure no additional messages are processed after we begin terminating
High: Cluster: Hook up the callbacks to the corosync quorum notifications
High: Core: basename() may modify its input, do not pass in a constant
High: Core: Bug cl#5016 - Prevent failures in recurring ops from being lost
High: Core: Bug rhbz#800054 - Correctly retrieve heartbeat uuids
High: Core: Correctly determine when an XML file should be decompressed
High: Core: Correctly track the length of a string without reading from uninitialzied memory (valgrind)
High: Core: Ensure signals are handled eventually in the absense of timer sources or IPC messages
High: Core: Prevent use-of-NULL in crm_update_peer()
High: Core: Strip text nodes from on disk xml files
High: Core: Support libqb for logging
High: corosync: Consistently set the correct uuid with get_node_uuid()
High: Corosync: Correctly disconnect from corosync variants
High: Corosync: Correctly extract the node id from membership udpates
High: corosync: Correctly infer lost members from the quorum API
High: Corosync: Default to using the nodeid as the node's uuid (instead of uname)
High: corosync: Ensure we catch nodes that leave the membership, even if the ringid doesn't change
High: corosync: Hook up CPG membership
High: corosync: Relax a development assert and gracefully handle the error condition
High: corosync: Remove deprecated member of the CFG API
High: corosync: Treat CS_ERR_QUEUE_FULL the same as CS_ERR_TRY_AGAIN
High: corosync: Unset the process list when nodes dissappear on us
High: crmd: Also purge fencing results when we enter S_NOT_DC
High: crmd: Bug cl#5015 - Remove the failed operation as well as the resulting fail-count and last-failure attributes
High: crmd: Correctly determine when a node can suicide with fencing
High: crmd: Election - perform the age comparison only once
High: crmd: Fast-track shutdown if we couldn't request it via attrd
High: crmd: Leave it up to the PE to decide which ops can/cannot be reload
High: crmd: Prevent use-after-free when calling delete_resource due to CRM_OP_REPROBE
High: crmd: Supply format arguments in the correct order
High: Fencing: Add missing format parameter
High: fencing: Add the fencing topology section to the 1.1 configuration schema
High: fencing: fence_legacy - Drop spurilous host argument from status query
High: Fencing: fence_legacy - Ensure port is available as an environment variable when calling monitor
High: Fencing: fence_pcmk - don't block if nothing is specified on stdin
High: fencing: Fix log format error
High: Fencing: Fix segfault caused by passing garbage to dlsym()
High: Fencing: Fix use-of-NULL in process_remote_stonith_query()
High: Fencing: Fix use-of-NULL when listing installed devices
High: Fencing: Implement support for advanced fencing topologies: eg. kdump || (network && disk) || power
High: fencing: More gracefully handle failed 'list' operations for devices that only support a single connection
High: Fencing: Prevent duplicate free when listing devices
High: Fencing: Prevent uninitialized pointers being passed to free
High: Fencing: Prevent use-after-free, we may need the query result for subsequent operations
High: fencing: Provide enough data to construct an entry in the node's fencing history
High: Fencing: Standardize on /one/ method for clients to request members be fenced
High: Fencing: Supress errors when listing all registered devices
High: mcp: corosync_cfg_state_track was removed from the corosync API, luckily we didnt use it for anything
High: mcp: Do not specify a WorkingDirectory in the systemd unit file - startup fails if its not available
High: mcp: Set the HA_quorum_type env variable consistently with our corosync plugin
High: mcp: Shut down if one of our child processes can/should not be respawned
High: PE: Bug cl#5000 - Ensure ordering is preserved when depending on partial sets
High: PE: Bug cl#5028 - Unmanaged services should block shutdown unless in maintainence mode
High: PE: Bug cl#5038 - Prevent restart of anonymous clones when clone-max decreases
High: PE: Bug cl#5007 - Fixes use of colocation constraints with multi-state resources
High: PE: Bug cl#5014 - Prevent asymmetrical order constraints from causing resource stops
High: PE: Bug cl#5000 - Implements ability to create rsc_order constraint sets such that A can start after B or C has started.
High: PE: Correctly migrate a resource that has just migrated
High: PE: Correct return from error path
High: PE: Detect reloads of previously migrated resources
High: PE: Ensure post-migration stop actions occur before node shutdown
High: PE: Log as loudly as possible when we cannot shut down a cluster node
High: PE: Reload of a resource no longer causes a restart of dependant resources
High: PE: Support limiting the number of concurrent live migrations
High: PE: Support referencing templates in constraints
High: PE: Support of referencing resource templates in resource sets
High: PE: Support to make tickets standby for relinquishing tickets gracefully
High: stonith: A "start" operation of a stonith resource does a "monitor" on the device beyond registering it
High: stonith: Bug rhbz#745526 - Ensure stonith_admin actually gets called by fence_pcmk
High: Stonith: Ensure all nodes receive and deliver notifications of the manual override
High: stonith: Fix the stonith timeout issue (cl#5009, bnc#727498)
High: Stonith: Implement a manual override for when nodes are known to be safely off
High: Tools: Bug cl#5003 - Prevent use-after-free in crm_simlate
High: Tools: crm_mon - Support to display tickets (based on Yuusuke Iida's work)
High: Tools: crm_simulate - Support to grant/revoke/standby/activate tickets from the new ticket state section
High: Tools: Implement crm_node functionality for native corosync
High: Fix a number of potential problems reported by coverity
* Wed Aug 31 2011 Andrew Beekhof <[email protected]> 1.1.6-1
- Update source tarball to revision: 676e5f25aa46 tip
- Statistics:
Changesets: 376
Diff: 1761 files changed, 36259 insertions(+), 140578 deletions(-)
- Changes since Pacemaker-1.1.5
+ High: ais: check for retryable errors when dispatching AIS messages
+ High: ais: Correctly disconnect from Corosync and Cman based clusters
+ High: ais: Followup to previous patch - Ensure we drain the corosync queue of messages when Glib tells us there is input
+ High: ais: Handle IPC error before checking for NULL data (bnc#702907)
+ High: cib: Check the validation version before adding the originator details of a CIB change
+ High: cib: Remove disconnected remote connections from mainloop
+ High: cman: Correctly override existing fenced operations
+ High: cman: Dequeue all the cman emitted events and not only the first one leaving the others in the event's queue.
+ High: cman: Don't call fenced_join and fenced_leave when notifying cman of a fencing event.
+ High: cman: We need to run the crmd as root for CMAN so that we can ACK fencing operations
+ High: Core: Cancelled and pending operations do not count as failed
+ High: Core: Ensure there is sufficient space for EOS when building short-form option strings
+ High: Core: Fix variable expansion in pkg-config files
+ High: Core: Partial revert of accidental commit in previous patch
+ High: Core: Use dlopen to load heartbeat libraries on-demand
+ High: crmd: Bug lf#2509 - Watch for config option changes from the CIB even if we're not the DC
+ High: crmd: Bug lf#2528 - Introduce a slight delay when creating a transition to allow attrd time to perform its updates
+ High: crmd: Bug lf#2559 - Fail actions that were scheduled for a failed/fenced node
+ High: crmd: Bug lf#2584 - Allow nodes to fence themselves if they're the last one standing
+ High: crmd: Bug lf#2632 - Correctly handle nodes that return faster than stonith
+ High: crmd: Cancel timers for actions that were pending on dead nodes
+ High: crmd: Catch fence operations that claim to succeed but did not really
+ High: crmd: Do not wait for actions that were pending on dead nodes
+ High: crmd: Ensure we do not attempt to perform action on failed nodes
+ High: crmd: Prevent use-of-NULL by g_hash_table_iter_next()
+ High: crmd: Recurring actions shouldn't cause the last non-recurring action to be forgotten
+ High: crmd: Store only the last and last failed operation in the CIB
+ High: mcp: dirname() modifies the input path - pass in a copy of the logfile path
+ High: mcp: Enable stack detection logic instead of forcing 'corosync'
+ High: mcp: Fix spelling mistake in systemd service script that prevents shutdown
+ High: mcp: Shut down if corosync becomes unavailable
+ High: mcp: systemd control file is now functional
+ High: PE: Before migrating an utilization-using resource to a node, take off the load which will no longer run there (lf#2599, bnc#695440)
+ High: PE: Before migrating an utilization-using resource to a node, take off the load which will no longer run there (regression tests) (lf#2599, bnc#695440)
+ High: PE: Bug lf#2574 - Prevent shuffling by choosing the correct clone instance to stop
+ High: PE: Bug lf#2575 - Use uname for migration variables, id is a UUID on heartbeat
+ High: PE: Bug lf#2581 - Avoid group restart when clone (re)starts on an unrelated node
+ High: PE: Bug lf#2613, lf#2619 - Group migration after failures and non-default utilization policies
+ High: PE: Bug suse#707150 - Prevent services being active if dependancies on clones are not satisfied
+ High: PE: Correctly recognise which recurring operations are currently active
+ High: PE: Demote from Master does not clear previous errors
+ High: PE: Ensure restarts due to definition changes cause the start action to be re-issued not probes
+ High: PE: Ensure role is preserved for unmanaged resources
+ High: PE: Ensure unmanaged resources have the correct role set so the correct monitor operation is chosen
+ High: PE: Fix memory leak for re-allocated resources reported by valgrind
+ High: PE: Implement cluster ticket and deadman
+ High: PE: Implement resource template
+ High: pengine: Correctly determine the state of multi-state resources with a partial operation history
+ High: PE: Only allocate master/slave resources once
+ High: PE: Partial revert of 'Minor code cleanup CS: cf6bca32376c On: 2011-08-15'
+ High: PE: Resolve memory leak reported by valgrind
+ High: PE: Restore the ability to save inputs to disk
+ High: Shell: implement -w,--wait option to wait for the transition to finish
+ High: Shell: repair template list command
+ High: Shell: set of commands to examine logs, reports, etc
+ High: Stonith: Consolidate pcmk_host_map into run_stonith_agent so that it is applied consistently
+ High: Stonith: Deprecate pcmk_arg_map for the saner pcmk_host_argument
+ High: Stonith: Fix use-of-NULL by g_hash_table_lookup
+ High: Stonith: Improved pcmk_host_map parsing
+ High: Stonith: Prevent use-of-NULL by g_hash_table_lookup
+ High: Stonith: Prevent use-of-NULL when no Linux-HA stonith agents are present
+ High: stonith: Add missing entries to stonith_error2string()
+ High: Stonith: Correctly finish sending agent options if the initial write is interrupted
+ High: stonith: Correctly handle synchronous calls
+ High: stonith: Coverity - Correctly construct result list for the query API call
+ High: stonith: Coverity - Remove badly constructed memory allocation from the query API call
+ High: stonith: Ensure completed operations are recorded as such in the history
+ High: Stonith: Ensure device parameters are passed to the daemon during registration
+ High: stonith: Fix use-of-NULL in stonith_api_device_list()
+ High: stonith: stonith_admin - Prevent use of uninitialized pointer by --history command
+ High: Tools: Bug lf#2528 - Make progress when attrd_updater is called repeatedly within the dampen interval but with the same value
+ High: Tools: crm_report - Correctly extract data from the local node
+ High: Tools: crm_report - Remove newlines when detecting the node list
+ High: Tools: crm_report - Repair the ability to extract data from the local machine
+ High: Tools: crm_report - Report on all detected backtraces
* Fri Feb 11 2011 Andrew Beekhof <[email protected]> 1.1.5-1
- Update source tarball to revision: baad6636a053
- Statistics:
Changesets: 184
Diff: 605 files changed, 46103 insertions(+), 26417 deletions(-)
- Changes since Pacemaker-1.1.4
+ High: Add the ability to delegate sub-sections of the cluster to non-root users via ACLs
Needs to be enabled at compile time, not enabled by default.
+ High: ais: Bug lf#2550 - Report failed processes immediately
+ High: Core: Prevent recently introduced use-after-free in replace_xml_child()
+ High: Core: Reinstate the logic that skips past non-XML_ELEMENT_NODE children
+ High: Core: Remove extra calls to xmlCleanupParser resulting in use-after-free
+ High: Core: Repair reference to child-of-child after removal of xml_child_iter_filter from get_message_xml()
+ High: crmd: Bug lf#2545 - Ensure notify variables are accurate for stop operations
+ High: crmd: Cancel recurring operations while we're still connected to the lrmd
+ High: crmd: Reschedule the PE_START action if its not already running when we try to use it
+ High: crmd: Update failcount for failed promote and demote operations
+ High: PE: Bug lf#2445 - Avoid relying on stickness for stable clone placement
+ High: PE: Bug lf#2445 - Do not override configured clone stickiness values
+ High: PE: Bug lf#2493 - Don't imply colocation requirements when applying ordering constraints with clones
+ High: PE: Bug lf#2495 - Prevent segfault by validating the contents of ordering sets
+ High: PE: Bug lf#2508 - Correctly reconstruct the status of anonymous cloned groups
+ High: PE: Bug lf#2518 - Avoid spamming the logs with errors for orphan resources
+ High: PE: Bug lf#2544 - Prevent unstable clone placement by factoring in the current node's score before all others
+ High: PE: Bug lf#2554 - target-role alone is not sufficient to promote resources
+ High: PE: Correct target_rc for probes of inactive resources (fix regression introduced by cs:ac3f03006e95)
+ High: PE: Ensure that fencing has completed for stop actions on stonith-dependent resources (lf#2551)
+ High: PE: Only update the node's promotion score if the resource is active there
+ High: PE: Only use the promotion score from the current clone instance
+ High: PE: Prevent use-of-NULL resulting from variable shadowing spotted by Coverity
+ High: PE: Prevent use-of-NULL when there is status for an undefined node
+ High: PE: Prevet use-after-free resulting from unintended recursion when chosing a node to promote master/slave resources
+ High: Shell: don't create empty optional sections (bnc#665131)
+ High: Stonith: Teach stonith_admin to automagically obtain the current node attributes for the target from the CIB
+ High: tools: Bug lf#2527 - Prevent use-of-NULL in crm_simulate
+ High: Tools: Prevent crm_resource commands from being lost due to the use of cib_scope_local
* Wed Oct 20 2010 Andrew Beekhof <[email protected]> 1.1.4-1
- Update source tarball to revision: 75406c3eb2c1 tip
- Statistics:
Changesets: 169
Diff: 772 files changed, 56172 insertions(+), 39309 deletions(-)
- Changes since Pacemaker-1.1.3
+ Italian translation of Clusters from Scratch
+ Significant performance enhancements to the Policy Engine and CIB
+ High: cib: Bug lf#2506 - Don't remove client's when notifications fail, they might just be too big
+ High: cib: Drop invalid/failed connections from the client hashtable
+ High: cib: Ensure all diffs sent to peers have sufficient ordering information
+ High: cib: Ensure non-change diffs can preserve the ordering on the other side
+ High: cib: Fix the feature set check
+ High: cib: Include version information on our synthesised diffs when nothing changed
+ High: cib: Optimize the way we detect group/set ordering changes - 15% speedup
+ High: cib: Prevent false detection of config updates with the new diff format
+ High: cib: Reduce unnecessary copying when comparing xml objects
+ High: cib: Repair the processing of updates sent from peer nodes
+ High: cib: Revert part of a recent commit that purged still valid connections
+ High: cib: The feature set version check is only valid if the current value is non-NULL
+ High: Core: Actually removing diff markers is necessary
+ High: Core: Bug lf#2506 - Drop the compression limit because Heartbeat's IPC code sucks
+ High: Core: Cache Relax-NG schemas - profiling indicates many cycles are wasted needlessly re-parsing them
+ High: Core: Correctly compare against crm_log_level in the logging macros
+ High: Core: Correctly extract the version details from a diff
+ High: Core: Correctly hook up the RNG schema cache
+ High: Core: Correctly use lazy_xml_sort() for v2 digests
+ High: Core: Don't compress large payload elements unless we're approaching message limits
+ High: Core: Don't insert empty ID tags when applying diffs
+ High: Core: Enable the improve v2 digests
+ High: Core: Ensure ordering is preserved when applying diffs
+ High: Core: Fix the CRM_CHECK macro
+ High: Core: Modify the v2 digest algorithm so that some fields are sorted
+ High: Core: Prevent use-after-free when creating a CIB update for a timed out action
+ High: Core: Prevent use-of-NULL when cleaning up RelaxNG data structures
+ High: Core: Provide significant performance improvements by implementing versioned diffs and digests
+ High: crmd: All pending operations should be recorded, even recurring ones with high start delays
+ High: crmd: Don't abort transitions when probes are completed on a node
+ High: crmd: Don't hide stop events that time out - allowing faster recovery in the presence of overloaded hosts
+ High: crmd: Ensure the CIB is always writable on the DC by removing a timing hole
+ High: crmd: Include the correct transition details for timed out operations
+ High: crmd: Prevent use of NULL by making copies of the operation's hash table
+ High: crmd: There's no need to check the cib version from the 'added' part of diff updates
+ High: crmd: Use the supplied timeout for stop actions
+ High: mcp: Ensure valgrind is able to log its output somewhere
+ High: mcp: Use 99/01 for the start/stop sequence to avoid problems with services (such as libvirtd) started by init - Patch from Vladislav Bogdanov
+ High: PE: Ensure fencing of the DC preceeds the STONITH_DONE operation
+ High: PE: Fix memory leak introduced as part of the conversion to GHashTables
+ High: PE: Fix memory leak when processing completed migration actions
+ High: PE: Fix typo leading to use-of-NULL in the new ordering code
+ High: PE: Free memory in recently introduced helper function
+ High: PE: lf#2478 - Implement improved handling and recovery of atomic resource migrations
+ High: PE: Obtain massive speedup by prepending to the list of ordering constraints (which can grow quite large)
+ High: PE: Optimize the logic for deciding which non-grouped anonymous clone instances to probe for
+ High: PE: Prevent clones from being stopped because resources colocated with them cannot be active
+ High: PE: Try to ensure atomic migration ops occur within a single transition
+ High: PE: Use hashtables instead of linked lists for performance sensitive datastructures
+ High: PE: Use the original digest algorithm for parameter lists
+ High: stonith: cleanup children on timeout in fence_legacy
+ High: Stonith: Fix two memory leaks
+ High: Tools: crm_shadow - Avoid replacing the entire configuration (including status)
* Tue Sep 21 2010 Andrew Beekhof <[email protected]> 1.1.3-1
- Update source tarball to revision: e3bb31c56244 tip
- Statistics:
Changesets: 352
Diff: 481 files changed, 14130 insertions(+), 11156 deletions(-)
- Changes since Pacemaker-1.1.2.1
+ High: ais: Bug lf#2401 - Improved processing when the peer crmd processes join/leave
+ High: ais: Correct the logic for conecting to plugin based clusters
+ High: ais: Do not supply a process list in mcp-mode
+ High: ais: Drop support for whitetank in the 1.1 release series
+ High: ais: Get an initial dump of the node membership when connecting to quorum-based clusters
+ High: ais: Guard against saturated cpg connections
+ High: ais: Handle CS_ERR_TRY_AGAIN in more cases
+ High: ais: Move the code for finding uid before the fork so that the child does no logging
+ High: ais: Never allow quorum plugins to affect connection to the pacemaker plugin
+ High: ais: Sign everyone up for peer process updates, not just the crmd
+ High: ais: The cluster type needs to be set before initializing classic openais connections
+ High: cib: Also free query result for xpath operations that return more than one hit
+ High: cib: Attempt to resolve memory corruption when forking a child to write the cib to disk
+ High: cib: Correctly free memory when writing out the cib to disk
+ High: cib: Fix the application of unversioned diffs
+ High: cib: Remove old developmental error logging
+ High: cib: Restructure the 'valid peer' check for deciding which instructions to ignore
+ High: cman: Correctly process membership/quorum changes from the pcmk plugin. Allow other message types through untouched
+ High: cman: Filter directed messages not intended for us
+ High: cman: Grab the initial membership when we connect
+ High: cman: Keep the list of peer processes up-to-date
+ High: cman: Make sure our common hooks are called after a cman membership update
+ High: cman: Make sure we can compile without cman present
+ High: cman: Populate sender details for cpg messages
+ High: cman: Update the ringid for cman based clusters
+ High: Core: Correctly unpack HA_Messages containing multiple entries with the same name
+ High: Core: crm_count_member() should only track nodes that have the full stack up
+ High: Core: New developmental logging system inspired by the kernel and a PoC from Lars Ellenberg
+ High: crmd: All nodes should see status updates, not just he DC
+ High: crmd: Allow non-DC nodes to clear failcounts too
+ High: crmd: Base DC election on process relative uptime
+ High: crmd: Bug lf#2439 - cancel_op() can also return HA_RSCBUSY
+ High: crmd: Bug lf#2439 - Handle asynchronous notification of resource deletion events
+ High: crmd: Bug lf#2458 - Ensure stop actions always have the relevant resource attributes
+ High: crmd: Disable age as a criteria for cman based clusters, its not reliable enough
+ High: crmd: Ensure we activate the DC timer if we detect an alternate DC
+ High: crmd: Factor the nanosecond component of process uptime in elections
+ High: crmd: Fix assertion failure when performing async resource failures
+ High: crmd: Fix handling of async resource deletion results
+ High: crmd: Include the action for crm graph operations
+ High: crmd: Make sure the membership cache is accurate after a sucessful fencing operation
+ High: crmd: Make sure we always poke the FSA after a transition to clear any TE_HALT actions
+ High: crmd: Offer crm-level membership once the peer starts the crmd process
+ High: crmd: Only need to request quorum update for plugin based clusters
+ High: crmd: Prevent assertion failure for stop actions resulting from cs: 3c0bc17c6daf
+ High: crmd: Prevent everyone from loosing DC elections by correctly initializing all relevant variables
+ High: crmd: Prevent segmentation fault
+ High: crmd: several fixes for async resource delete (thanks to beekhof)
+ High: crmd: Use the correct define/size for lrm resource IDs
+ High: Introduce two new cluster types 'cman' and 'corosync', replaces 'quorum_provider' concept
+ High: mcp: Add missing headers when built without heartbeat support
+ High: mcp: Correctly initialize the string containing the list of active daemons
+ High: mcp: Fix macro expansion in init script
+ High: mcp: Fix the expansion of the pid file in the init script
+ High: mcp: Handle CS_ERR_TRY_AGAIN when connecting to libcfg
+ High: mcp: Make sure we can compile the mcp without cman present
+ High: mcp: New master control process for (re)spawning pacemaker daemons
+ High: mcp: Read config early so we can re-initialize logging asap if daemonizing
+ High: mcp: Rename the mcp binary to pacemakerd and create a 'pacemaker' init script
+ High: mcp: Resend our process list after every CPG change
+ High: mcp: Tell chkconfig we need to shut down early on
+ High: PE: Avoid creating invalid ordering constraints for probes that are not needed
+ High: PE: Bug lf#1959 - Fail unmanaged resources should not prevent other services from shutting down
+ High: PE: Bug lf#2422 - Ordering dependencies on partially active groups not observed properly
+ High: PE: Bug lf#2424 - Use notify oepration definition if it exists in the configuration
+ High: PE: Bug lf#2433 - No services should be stopped until probes finish
+ High: PE: Bug lf#2453 - Enforce clone ordering in the absense of colocation constraints
+ High: PE: Bug lf#2476 - Repair on-fail=block for groups and primitive resources
+ High: PE: Correctly detect when there is a real failcount that expired and needs to be cleared
+ High: PE: Correctly handle pseudo action creation
+ High: PE: Correctly order clone startup after group/clone start
+ High: PE: Correct use-after-free introduced in the prior patch
+ High: PE: Do not demote resources because something that requires it can not run
+ High: PE: Fix colocation for interleaved clones
+ High: PE: Fix colocation with partially active groups
+ High: PE: Fix potential use-after-free defect from coverity
+ High: PE: Fix previous merge
+ High: PE: Fix use-after-free in order_actions() reported by valgrind
+ High: PE: Make the current data set a global variable so it does not need to be passed around everywhere
+ High: PE: Prevent endless loop when looking for operation definitions in the configuration
+ High: PE: Prevent segfault by ensuring the arguments to do_calculations() are initialized
+ High: PE: Rewrite the ordering constraint logic to be simplicity, clarity and maintainability
+ High: PE: Wait until stonith is available, do not fall back to shutdown for nodes requesting termination
+ High: Resolve coverity RESOURCE_LEAK defects
+ High: Shell: Complete the transition to using crm_attribute instead of crm_failcount and crm_standby
+ High: stonith: Advertise stonith-ng options in the metadata
+ High: stonith: Bug lf#2461 - Prevent segfault by not looking up operations if the hashtable has not been initialized yet
+ High: stonith: Bug lf#2473 - Add the timeout at the top level where the daemon is looking for it
+ High: Stonith: Bug lf#2473 - Ensure stonith operations complete within the timeout and are terminated if they run too long
+ High: stonith: Bug lf#2473 - Ensure timeouts are included for fencing operations
+ High: stonith: Bug lf#2473 - Gracefully handle remote operations that arrive late (after we have done notifications)
+ High: stonith: Correctly parse pcmk_host_list parameters that appear on a single line
+ High: stonith: Map poweron/poweroff back to on/off expected by the stonith tool from cluster-glue
+ High: stonith: pass the configuration to the stonith program via environment variables (bnc#620781)
+ High: Stonith: Use the timeout specified by the user
+ High: Support starting plugin-based Pacemaker clusters with the MCP as well
+ High: Tools: Bug lf#2456 - Fix assertion failure in crm_resource
+ High: tools: crm_node - Repair the ability to connect to openais based clusters
+ High: tools: crm_node - Use the correct short option for --cman
+ High: tools: crm_report - corosync.conf wont necessarily contain the text 'pacemaker' anymore
+ High: Tools: crm_simulate - Fix use-after-free in when terminating
+ High: tools: crm_simulate - Resolve coverity USE_AFTER_FREE defect
+ High: Tools: Drop the 'pingd' daemon and resource agent in favor of ocf:pacemaker:ping
+ High: Tools: Fix recently introduced use-of-NULL
+ High: Tools: Fix use-after-free defects from coverity
* Wed May 12 2010 Andrew Beekhof <[email protected]> 1.1.2-1
- Update source tarball to revision: c25c972a25cc tip
- Statistics:
Changesets: 339
Diff: 708 files changed, 37918 insertions(+), 10584 deletions(-)
- Changes since Pacemaker-1.1.1
+ High: ais: Do not count votes from offline nodes and calculate current votes before sending quorum data
+ High: ais: Ensure the list of active processes sent to clients is always up-to-date
+ High: ais: Look for the correct conf variable for turning on file logging
+ High: ais: Need to find a better and thread-safe way to set core_uses_pid. Disable for now.
+ High: ais: Use the threadsafe version of getpwnam
+ High: Core: Bump the feature set due to the new failcount expiry feature
+ High: Core: fix memory leaks exposed by valgrind
+ High: Core: Bug lf#2414 - Prevent use-after-free reported by valgrind when doing xpath based deletions
+ High: crmd: Bug lf#2414 - Prevent use-after-free of the PE connection after it dies
+ High: crmd: Bug lf#2414 - Prevent use-after-free of the stonith-ng connection
+ High: crmd: Bug lf#2401 - Improved detection of partially active peers
+ High: crmd: Bug lf#2379 - Ensure the cluster terminates when the PE is not available
+ High: crmd: Do not allow the target_rc to be misused by resource agents
+ High: crmd: Do not ignore action timeouts based on FSA state
+ High: crmd: Ensure we dont get stuck in S_PENDING if we loose an election to someone that never talks to us again
+ High: crmd: Fix memory leaks exposed by valgrind
+ High: crmd: Remove race condition that could lead to multiple instances of a clone being active on a machine
+ High: crmd: Send erase_status_tag() calls to the local CIB when the DC is fenced, since there is no DC to accept them
+ High: crmd: Use global fencing notifications to prevent secondary fencing operations of the DC
+ High: PE: Bug lf#2317 - Avoid needless restart of primitive depending on a clone
+ High: PE: Bug lf#2361 - Ensure clones observe mandatory ordering constraints if the LHS is unrunnable
+ High: PE: Bug lf#2383 - Combine failcounts for all instances of an anonymous clone on a host
+ High: PE: Bug lf#2384 - Fix intra-set colocation and ordering
+ High: PE: Bug lf#2403 - Enforce mandatory promotion (colocation) constraints
+ High: PE: Bug lf#2412 - Correctly find clone instances by their prefix
+ High: PE: Do not be so quick to pull the trigger on nodes that are coming up
+ High: PE: Fix memory leaks exposed by valgrind
+ High: PE: Rewrite native_merge_weights() to avoid Fix use-after-free
+ High: Shell: Bug bnc#590035 - always reload status if working with the cluster
+ High: Shell: Bug bnc#592762 - Default to using the status section from the live CIB
+ High: Shell: Bug lf#2315 - edit multiple meta_attributes sets in resource management
+ High: Shell: Bug lf#2221 - enable comments
+ High: Shell: Bug bnc#580492 - implement new cibstatus interface and commands
+ High: Shell: Bug bnc#585471 - new cibstatus import command
+ High: Shell: check timeouts also against the default-action-timeout property
+ High: Shell: new configure filter command
+ High: Tools: crm_mon - fix memory leaks exposed by valgrind
* Tue Feb 16 2010 Andrew Beekhof <[email protected]> - 1.1.1-1
- First public release of Pacemaker 1.1
- Package reference documentation in a doc subpackage
- Move cts into a subpackage so that it can be easily consumed by others
- Update source tarball to revision: 17d9cd4ee29f
+ New stonith daemon that supports global notifications
+ Service placement influenced by the physical resources
+ A new tool for simulating failures and the cluster’s reaction to them
+ Ability to serialize an otherwise unrelated a set of resource actions (eg. Xen migrations)
* Wed Feb 10 2010 Andrew Beekhof <[email protected]> - 1.0.7-4
- Rebuild for heartbeat 3.0.2-2
* Wed Feb 10 2010 Andrew Beekhof <[email protected]> - 1.0.7-3
- Rebuild for cluster-glue 1.0.3
* Tue Jan 19 2010 Andrew Beekhof <[email protected]> - 1.0.7-2
- Rebuild for corosync 1.2.0
* Mon Jan 18 2010 Andrew Beekhof <[email protected]> - 1.0.7-1
- Update source tarball to revision: 2eed906f43e9 (stable-1.0) tip
- Statistics:
Changesets: 193
Diff: 220 files changed, 15933 insertions(+), 8782 deletions(-)
- Changes since 1.0.5-4
+ High: PE: Bug 2213 - Ensure groups process location constraints so that clone-node-max works for cloned groups
+ High: PE: Bug lf#2153 - non-clones should not restart when clones stop/start on other nodes
+ High: PE: Bug lf#2209 - Clone ordering should be able to prevent startup of dependant clones
+ High: PE: Bug lf#2216 - Correctly identify the state of anonymous clones when deciding when to probe
+ High: PE: Bug lf#2225 - Operations that require fencing should wait for 'stonith_complete' not 'all_stopped'.
+ High: PE: Bug lf#2225 - Prevent clone peers from stopping while another is instance is (potentially) being fenced
+ High: PE: Correctly anti-colocate with a group
+ High: PE: Correctly unpack ordering constraints for resource sets to avoid graph loops
+ High: Tools: crm: load help from crm_cli.txt
+ High: Tools: crm: resource sets (bnc#550923)
+ High: Tools: crm: support for comments (LF 2221)
+ High: Tools: crm: support for description attribute in resources/operations (bnc#548690)
+ High: Tools: hb2openais: add EVMS2 CSM processing (and other changes) (bnc#548093)
+ High: Tools: hb2openais: do not allow empty rules, clones, or groups (LF 2215)
+ High: Tools: hb2openais: refuse to convert pure EVMS volumes
+ High: cib: Ensure the loop for login message terminates
+ High: cib: Finally fix reliability of receiving large messages over remote plaintext connections
+ High: cib: Fix remote notifications
+ High: cib: For remote connections, default to CRM_DAEMON_USER since thats the only one that the cib can validate the password for using PAM
+ High: cib: Remote plaintext - Retry sending parts of the message that did not fit the first time
+ High: crmd: Ensure batch-limit is correctly enforced
+ High: crmd: Ensure we have the latest status after a transition abort
+ High (bnc#547579,547582): Tools: crm: status section editing support
+ High: shell: Add allow-migrate as allowed meta-attribute (bnc#539968)
+ Medium: Build: Do not automatically add -L/lib, it could cause 64-bit arches to break
+ Medium: PE: Bug lf#2206 - rsc_order constraints always use score at the top level
+ Medium: PE: Only complain about target-role=master for non m/s resources
+ Medium: PE: Prevent non-multistate resources from being promoted through target-role
+ Medium: PE: Provide a default action for resource-set ordering
+ Medium: PE: Silently fix requires=fencing for stonith resources so that it can be set in op_defaults
+ Medium: Tools: Bug lf#2286 - Allow the shell to accept template parameters on the command line
+ Medium: Tools: Bug lf#2307 - Provide a way to determin the nodeid of past cluster members
+ Medium: Tools: crm: add update method to template apply (LF 2289)
+ Medium: Tools: crm: direct RA interface for ocf class resource agents (LF 2270)
+ Medium: Tools: crm: direct RA interface for stonith class resource agents (LF 2270)
+ Medium: Tools: crm: do not add score which does not exist
+ Medium: Tools: crm: do not consider warnings as errors (LF 2274)
+ Medium: Tools: crm: do not remove sets which contain id-ref attribute (LF 2304)
+ Medium: Tools: crm: drop empty attributes elements
+ Medium: Tools: crm: exclude locations when testing for pathological constraints (LF 2300)
+ Medium: Tools: crm: fix exit code on single shot commands
+ Medium: Tools: crm: fix node delete (LF 2305)
+ Medium: Tools: crm: implement -F (--force) option
+ Medium: Tools: crm: rename status to cibstatus (LF 2236)
+ Medium: Tools: crm: revisit configure commit
+ Medium: Tools: crm: stay in crm if user specified level only (LF 2286)
+ Medium: Tools: crm: verify changes on exit from the configure level
+ Medium: ais: Some clients such as gfs_controld want a cluster name, allow one to be specified in corosync.conf
+ Medium: cib: Clean up logic for receiving remote messages
+ Medium: cib: Create valid notification control messages
+ Medium: cib: Indicate where the remote connection came from
+ Medium: cib: Send password prompt to stderr so that stdout can be redirected
+ Medium: cts: Fix rsh handling when stdout is not required
+ Medium: doc: Fill in the section on removing a node from an AIS-based cluster
+ Medium: doc: Update the docs to reflect the 0.6/1.0 rolling upgrade problem
+ Medium: doc: Use Publican for docbook based documentation
+ Medium: fencing: stonithd: add metadata for stonithd instance attributes (and support in the shell)
+ Medium: fencing: stonithd: ignore case when comparing host names (LF 2292)
+ Medium: tools: Make crm_mon functional with remote connections
+ Medium: xml: Add stopped as a supported role for operations
+ Medium: xml: Bug bnc#552713 - Treat node unames as text fields not IDs
+ Medium: xml: Bug lf#2215 - Create an always-true expression for empty rules when upgrading from 0.6
* Thu Oct 29 2009 Andrew Beekhof <[email protected]> - 1.0.5-4
- Include the fixes from CoroSync integration testing
- Move the resource templates - they are not documentation
- Ensure documentation is placed in a standard location
- Exclude documentation that is included elsewhere in the package
- Update the tarball from upstream to version ee19d8e83c2a
+ High: cib: Correctly clean up when both plaintext and tls remote ports are requested
+ High: PE: Bug bnc#515172 - Provide better defaults for lt(e) and gt(e) comparisions
+ High: PE: Bug lf#2197 - Allow master instances placemaker to be influenced by colocation constraints
+ High: PE: Make sure promote/demote pseudo actions are created correctly
+ High: PE: Prevent target-role from promoting more than master-max instances
+ High: ais: Bug lf#2199 - Prevent expected-quorum-votes from being populated with garbage
+ High: ais: Prevent deadlock - dont try to release IPC message if the connection failed
+ High: cib: For validation errors, send back the full CIB so the client can display the errors
+ High: cib: Prevent use-after-free for remote plaintext connections
+ High: crmd: Bug lf#2201 - Prevent use-of-NULL when running heartbeat
* Wed Oct 13 2009 Andrew Beekhof <[email protected]> - 1.0.5-3
- Update the tarball from upstream to version 38cd629e5c3c
+ High: Core: Bug lf#2169 - Allow dtd/schema validation to be disabled
+ High: PE: Bug lf#2106 - Not all anonymous clone children are restarted after configuration change
+ High: PE: Bug lf#2170 - stop-all-resources option had no effect
+ High: PE: Bug lf#2171 - Prevent groups from starting if they depend on a complex resource which can not
+ High: PE: Disable resource management if stonith-enabled=true and no stonith resources are defined
+ High: PE: do not include master score if it would prevent allocation
+ High: ais: Avoid excessive load by checking for dead children every 1s (instead of 100ms)
+ High: ais: Bug rh#525589 - Prevent shutdown deadlocks when running on CoroSync
+ High: ais: Gracefully handle changes to the AIS nodeid
+ High: crmd: Bug bnc#527530 - Wait for the transition to complete before leaving S_TRANSITION_ENGINE
+ High: crmd: Prevent use-after-free with LOG_DEBUG_3
+ Medium: xml: Mask the "symmetrical" attribute on rsc_colocation constraints (bnc#540672)
+ Medium (bnc#520707): Tools: crm: new templates ocfs2 and clvm
+ Medium: Build: Invert the disable ais/heartbeat logic so that --without (ais|heartbeat) is available to rpmbuild
+ Medium: PE: Bug lf#2178 - Indicate unmanaged clones
+ Medium: PE: Bug lf#2180 - Include node information for all failed ops
+ Medium: PE: Bug lf#2189 - Incorrect error message when unpacking simple ordering constraint
+ Medium: PE: Correctly log resources that would like to start but can not
+ Medium: PE: Stop ptest from logging to syslog
+ Medium: ais: Include version details in plugin name
+ Medium: crmd: Requery the resource metadata after every start operation
* Fri Aug 21 2009 Tomas Mraz <[email protected]> - 1.0.5-2.1
- rebuilt with new openssl
* Wed Aug 19 2009 Andrew Beekhof <[email protected]> - 1.0.5-2
- Add versioned perl dependency as specified by
https://fedoraproject.org/wiki/Packaging/Perl#Packages_that_link_to_libperl
- No longer remove RPATH data, it prevents us finding libperl.so and no other
libraries were being hardcoded
- Compile in support for heartbeat
- Conditionally add heartbeat-devel and corosynclib-devel to the -devel requirements
depending on which stacks are supported
* Mon Aug 17 2009 Andrew Beekhof <[email protected]> - 1.0.5-1
- Add dependency on resource-agents
- Use the version of the configure macro that supplies --prefix, --libdir, etc
- Update the tarball from upstream to version 462f1569a437 (Pacemaker 1.0.5 final)
+ High: Tools: crm_resource - Advertise --move instead of --migrate
+ Medium: Extra: New node connectivity RA that uses system ping and attrd_updater
+ Medium: crmd: Note that dc-deadtime can be used to mask the brokeness of some switches
* Tue Aug 11 2009 Ville Skyttä <[email protected]> - 1.0.5-0.7.c9120a53a6ae.hg
- Use bzipped upstream tarball.
* Wed Jul 29 2009 Andrew Beekhof <[email protected]> - 1.0.5-0.6.c9120a53a6ae.hg
- Add back missing build auto* dependancies
- Minor cleanups to the install directive
* Tue Jul 28 2009 Andrew Beekhof <[email protected]> - 1.0.5-0.5.c9120a53a6ae.hg
- Add a leading zero to the revision when alphatag is used
* Tue Jul 28 2009 Andrew Beekhof <[email protected]> - 1.0.5-0.4.c9120a53a6ae.hg
- Incorporate the feedback from the cluster-glue review
- Realistically, the version is a 1.0.5 pre-release
- Use the global directive instead of define for variables
- Use the haclient/hacluster group/user instead of daemon
- Use the _configure macro
- Fix install dependancies
* Fri Jul 24 2009 Andrew Beekhof <[email protected]> - 1.0.4-3
- Initial Fedora checkin
- Include an AUTHORS and license file in each package
- Change the library package name to pacemaker-libs to be more
Fedora compliant
- Remove execute permissions from xml related files
- Reference the new cluster-glue devel package name
- Update the tarball from upstream to version c9120a53a6ae
+ High: PE: Only prevent migration if the clone dependency is stopping/starting on the target node
+ High: PE: Bug 2160 - Dont shuffle clones due to colocation
+ High: PE: New implementation of the resource migration (not stop/start) logic
+ Medium: Tools: crm_resource - Prevent use-of-NULL by requiring a resource name for the -A and -a options
+ Medium: PE: Prevent use-of-NULL in find_first_action()
* Tue Jul 14 2009 Andrew Beekhof <[email protected]> - 1.0.4-2
- Reference authors from the project AUTHORS file instead of listing in description
- Change Source0 to reference the Mercurial repo
- Cleaned up the summaries and descriptions
- Incorporate the results of Fedora package self-review
* Thu Jun 04 2009 Andrew Beekhof <[email protected]> - 1.0.4-1
- Update source tarball to revision: 1d87d3e0fc7f (stable-1.0)
- Statistics:
Changesets: 209
Diff: 266 files changed, 12010 insertions(+), 8276 deletions(-)
- Changes since Pacemaker-1.0.3
+ High (bnc#488291): ais: do not rely on byte endianness on ptr cast
+ High (bnc#507255): Tools: crm: delete rsc/op_defaults (these meta_attributes are killing me)
+ High (bnc#507255): Tools: crm: import properly rsc/op_defaults
+ High (LF 2114): Tools: crm: add support for operation instance attributes
+ High: ais: Bug lf#2126 - Messages replies cannot be routed to transient clients
+ High: ais: Fix compilation for the latest Corosync API (v1719)
+ High: attrd: Do not perform all updates as complete refreshes
+ High: cib: Fix huge memory leak affecting heartbeat-based clusters
+ High: Core: Allow xpath queries to match attributes
+ High: Core: Generate the help text directly from a tool options struct
+ High: Core: Handle differences in 0.6 messaging format
+ High: crmd: Bug lf#2120 - All transient node attribute updates need to go via attrd
+ High: crmd: Correctly calculate how long an FSA action took to avoid spamming the logs with errors
+ High: crmd: Fix another large memory leak affecting Heartbeat based clusters
+ High: lha: Restore compatability with older versions
+ High: PE: Bug bnc#495687 - Filesystem is not notified of successful STONITH under some conditions
+ High: PE: Make running a cluster with STONITH enabled but no STONITH resources an error and provide details on resolutions
+ High: PE: Prevent use-ofNULL when using resource ordering sets
+ High: PE: Provide inter-notification ordering guarantees
+ High: PE: Rewrite the notification code to be understanable and extendable
+ High: Tools: attrd - Prevent race condition resulting in the cluster forgetting the node wishes to shut down
+ High: Tools: crm: regression tests
+ High: Tools: crm_mon - Fix smtp notifications
+ High: Tools: crm_resource - Repair the ability to query meta attributes
+ Low Build: Bug lf#2105 - Debian package should contain pacemaker doc and crm templates
+ Medium (bnc#507255): Tools: crm: handle empty rsc/op_defaults properly
+ Medium (bnc#507255): Tools: crm: use the right obj_type when creating objects from xml nodes
+ Medium (LF 2107): Tools: crm: revisit exit codes in configure
+ Medium: cib: Do not bother validating updates that only affect the status section
+ Medium: Core: Include supported stacks in version information
+ Medium: crmd: Record in the CIB, the cluster infrastructure being used
+ Medium: cts: Do not combine crm_standby arguments - the wrapper can not process them
+ Medium: cts: Fix the CIBAusdit class
+ Medium: Extra: Refresh showscores script from Dominik
+ Medium: PE: Build a statically linked version of ptest
+ Medium: PE: Correctly log the actions for resources that are being recovered
+ Medium: PE: Correctly log the occurance of promotion events
+ Medium: PE: Implememt node health based on a patch from Mark Hamzy
+ Medium: Tools: Add examples to help text outputs
+ Medium: Tools: crm: catch syntax errors for configure load
+ Medium: Tools: crm: implement erasing nodes in configure erase
+ Medium: Tools: crm: work with parents only when managing xml objects
+ Medium: Tools: crm_mon - Add option to run custom notification program on resource operations (Patch by Dominik Klein)
+ Medium: Tools: crm_resource - Allow --cleanup to function on complex resources and cluster-wide
+ Medium: Tools: haresource2cib.py - Patch from horms to fix conversion error
+ Medium: Tools: Include stack information in crm_mon output
+ Medium: Tools: Two new options (--stack,--constraints) to crm_resource for querying how a resource is configured
* Wed Apr 08 2009 Andrew Beekhof <[email protected]> - 1.0.3-1
- Update source tarball to revision: b133b3f19797 (stable-1.0) tip
- Statistics:
Changesets: 383
Diff: 329 files changed, 15471 insertions(+), 15119 deletions(-)
- Changes since Pacemaker-1.0.2
+ Added tag SLE11-HAE-GMC for changeset 9196be9830c2
+ High: ais plugin: Fix quorum calculation (bnc#487003)
+ High: ais: Another memory fix leak in error path
+ High: ais: Bug bnc#482847, bnc#482905 - Force a clean exit of OpenAIS once Pacemaker has finished unloading
+ High: ais: Bug bnc#486858 - Fix update_member() to prevent spamming clients with membership events containing no changes
+ High: ais: Centralize all quorum calculations in the ais plugin and allow expected votes to be configured int he cib
+ High: ais: Correctly handle a return value of zero from openais_dispatch_recv()
+ High: ais: Disable logging to a file
+ High: ais: Fix memory leak in error path
+ High: ais: IPC messages are only in scope until a response is sent
+ High: All signal handlers used with CL_SIGNAL() need to be as minimal as possible
+ High: cib: Bug bnc#482885 - Simplify CIB disk-writes to prevent data loss. Required a change to the backup filename format
+ High: cib: crmd: Revert part of 9782ab035003. Complex shutdown routines need G_main_add_SignalHandler to avoid race coditions
+ High: crm: Avoid infinite loop during crm configure edit (bnc#480327)
+ High: crmd: Avoid a race condition by waiting for the attrd update to trigger a transition automatically
+ High: crmd: Bug bnc#480977 - Prevent extra, partial, shutdown when a node restarts too quickly
+ High: crmd: Bug bnc#480977 - Prevent extra, partial, shutdown when a node restarts too quickly (verified)
+ High: crmd: Bug bnc#489063 - Ensure the DC is always unset after we 'loose' an election
+ High: crmd: Bug BSC#479543 - Correctly find the migration source for timed out migrate_from actions
+ High: crmd: Call crm_peer_init() before we start the FSA - prevents a race condition when used with Heartbeat
+ High: crmd: Erasing the status section should not be forced to the local node
+ High: crmd: Fix memory leak in cib notication processing code
+ High: crmd: Fix memory leak in transition graph processing
+ High: crmd: Fix memory leaks found by valgrind
+ High: crmd: More memory leaks fixes found by valgrind
+ High: fencing: stonithd: is_heartbeat_cluster is a no-no if there is no heartbeat support
+ High: PE: Bug bnc#466788 - Exclude nodes that can not run resources
+ High: PE: Bug bnc#466788 - Make colocation based on node attributes work
+ High: PE: Bug BNC#478687 - Do not crash when clone-max is 0
+ High: PE: Bug bnc#488721 - Fix id-ref expansion for clones, the doc-root for clone children is not the cib root
+ High: PE: Bug bnc#490418 - Correctly determine node state for nodes wishing to be terminated
+ High: PE: Bug LF#2087 - Correctly parse the state of anonymous clones that have multiple instances on a given node
+ High: PE: Bug lf#2089 - Meta attributes are not inherited by clone children
+ High: PE: Bug lf#2091 - Correctly restart modified resources that were found active by a probe
+ High: PE: Bug lf#2094 - Fix probe ordering for cloned groups
+ High: PE: Bug LF:2075 - Fix large pingd memory leaks
+ High: PE: Correctly attach orphaned clone children to their parent
+ High: PE: Correctly handle terminate node attributes that are set to the output from time()
+ High: PE: Ensure orphaned clone members are hooked up to the parent when clone-max=0
+ High: PE: Fix memory leak in LogActions
+ High: PE: Fix the determination of whether a group is active
+ High: PE: Look up the correct promotion preference for anonymous masters
+ High: PE: Simplify handling of start failures by changing the default migration-threshold to INFINITY
+ High: PE: The ordered option for clones no longer causes extra start/stop operations
+ High: RA: Bug bnc#490641 - Shut down dlm_controld with -TERM instead of -KILL
+ High: RA: pingd: Set default ping interval to 1 instead of 0 seconds
+ High: Resources: pingd - Correctly tell the ping daemon to shut down
+ High: Tools: Bug bnc#483365 - Ensure the command from cluster_test includes a value for --log-facility
+ High: Tools: cli: fix and improve delete command
+ High: Tools: crm: add and implement templates
+ High: Tools: crm: add support for command aliases and some common commands (i.e. cd,exit)
+ High: Tools: crm: create top configuration nodes if they are missing
+ High: Tools: crm: fix parsing attributes for rules (broken by the previous changeset)
+ High: Tools: crm: new ra set of commands
+ High: Tools: crm: resource agents information management
+ High: Tools: crm: rsc/op_defaults
+ High: Tools: crm: support for no value attribute in nvpairs
+ High: Tools: crm: the new configure monitor command
+ High: Tools: crm: the new configure node command
+ High: Tools: crm_mon - Prevent use-of-NULL when summarizing an orphan
+ High: Tools: hb2openais: create clvmd clone for respawn evmsd in ha.cf
+ High: Tools: hb2openais: fix a serious recursion bug in xml node processing
+ High: Tools: hb2openais: fix ocfs2 processing
+ High: Tools: pingd - prevent double free of getaddrinfo() output in error path
+ High: Tools: The default re-ping interval for pingd should be 1s not 1ms
+ Medium (bnc#479049): Tools: crm: add validation of resource type for the configure primitive command
+ Medium (bnc#479050): Tools: crm: add help for RA parameters in tab completion
+ Medium (bnc#479050): Tools: crm: add tab completion for primitive params/meta/op
+ Medium (bnc#479050): Tools: crm: reimplement cluster properties completion
+ Medium (bnc#486968): Tools: crm: listnodes function requires no parameters (do not mix completion with other stuff)
+ Medium: ais: Remove the ugly hack for dampening AIS membership changes
+ Medium: cib: Fix memory leaks by using mainloop_add_signal
+ Medium: cib: Move more logging to the debug level (was info)
+ Medium: cib: Overhaul the processing of synchronous replies
+ Medium: Core: Add library functions for instructing the cluster to terminate nodes
+ Medium: crmd: Add new expected-quorum-votes option
+ Medium: crmd: Allow up to 5 retires when an attrd update fails
+ Medium: crmd: Automatically detect and use new values for crm_config options
+ Medium: crmd: Bug bnc#490426 - Escalated shutdowns stall when there are pending resource operations
+ Medium: crmd: Clean up and optimize the DC election algorithm
+ Medium: crmd: Fix memory leak in shutdown
+ Medium: crmd: Fix memory leaks spotted by Valgrind
+ Medium: crmd: Ingore join messages from hosts other than our DC
+ Medium: crmd: Limit the scope of resource updates to the status section
+ Medium: crmd: Prevent the crmd from being respawned if its told to shut down when it did not ask to be
+ Medium: crmd: Re-check the election status after membership events
+ Medium: crmd: Send resource updates via the local CIB during elections
+ Medium: PE: Bug bnc#491441 - crm_mon does not display operations returning 'uninstalled' correctly
+ Medium: PE: Bug lf#2101 - For location constraints, role=Slave is equivalent to role=Started
+ Medium: PE: Clean up the API - removed ->children() and renamed ->find_child() to fine_rsc()
+ Medium: PE: Compress the display of healthy anonymous clones
+ Medium: PE: Correctly log the actions for resources that are being recovered
+ Medium: PE: Determin a promotion score for complex resources
+ Medium: PE: Ensure clones always have a value for globally-unique
+ Medium: PE: Prevent orphan clones from being allocated
+ Medium: RA: controld: Return proper exit code for stop op.
+ Medium: Tools: Bug bnc#482558 - Fix logging test in cluster_test
+ Medium: Tools: Bug bnc#482828 - Fix quoting in cluster_test logging setup
+ Medium: Tools: Bug bnc#482840 - Include directory path to CTSlab.py
+ Medium: Tools: crm: add more user input checks
+ Medium: Tools: crm: do not check resource status of we are working with a shadow
+ Medium: Tools: crm: fix id-refs and allow reference to top objects (i.e. primitive)
+ Medium: Tools: crm: ignore comments in the CIB
+ Medium: Tools: crm: multiple column output would not work with small lists
+ Medium: Tools: crm: refuse to delete running resources
+ Medium: Tools: crm: rudimentary if-else for templates
+ Medium: Tools: crm: Start/stop clones via target-role.
+ Medium: Tools: crm_mon - Compress the node status for healthy and offline nodes
+ Medium: Tools: crm_shadow - Return 0/cib_ok when --create-empty succeeds
+ Medium: Tools: crm_shadow - Support -e, the short form of --create-empty
+ Medium: Tools: Make attrd quieter
+ Medium: Tools: pingd - Avoid using various clplumbing functions as they seem to leak
+ Medium: Tools: Reduce pingd logging
* Mon Feb 16 2009 Andrew Beekhof <[email protected]> - 1.0.2-1
- Update source tarball to revision: d232d19daeb9 (stable-1.0) tip
- Statistics:
Changesets: 441
Diff: 639 files changed, 20871 insertions(+), 21594 deletions(-)
- Changes since Pacemaker-1.0.1
+ High (bnc#450815): Tools: crm cli: do not generate id for the operations tag
+ High: ais: Add support for the new AIS IPC layer
+ High: ais: Always set header.error to the correct default: SA_AIS_OK
+ High: ais: Bug BNC#456243 - Ensure the membership cache always contains an entry for the local node
+ High: ais: Bug BNC:456208 - Prevent deadlocks by not logging in the child process before exec()
+ High: ais: By default, disable supprt for the WIP openais IPC patch
+ High: ais: Detect and handle situations where ais and the crm disagree on the node name
+ High: ais: Ensure crm_peer_seq is updated after a membership update
+ High: ais: Make sure all IPC header fields are set to sane defaults
+ High: ais: Repair and streamline service load now that whitetank startup functions correctly
+ High: build: create and install doc files
+ High: cib: Allow clients without mainloop to connect to the cib
+ High: cib: CID:18 - Fix use-of-NULL in cib_perform_op
+ High: cib: CID:18 - Repair errors introduced in b5a18704477b - Fix use-of-NULL in cib_perform_op
+ High: cib: Ensure diffs contain the correct values of admin_epoch
+ High: cib: Fix four moderately sized memory leaks detected by Valgrind
+ High: Core: CID:10 - Prevent indexing into an array of schemas with a negative value
+ High: Core: CID:13 - Fix memory leak in log_data_element
+ High: Core: CID:15 - Fix memory leak in crm_get_peer
+ High: Core: CID:6 - Fix use-of-NULL in copy_ha_msg_input
+ High: Core: Fix crash in the membership code preventing node shutdown
+ High: Core: Fix more memory leaks foudn by valgrind
+ High: Core: Prevent unterminated strings after decompression
+ High: crmd: Bug BNC:467995 - Delay marking STONITH operations complete until STONITH tells us so
+ High: crmd: Bug LF:1962 - Do not NACK peers because they are not (yet) in our membership. Just ignore them.
+ High: crmd: Bug LF:2010 - Ensure fencing cib updates create the node_state entry if needed to preent re-fencing during cluster startup
+ High: crmd: Correctly handle reconnections to attrd
+ High: crmd: Ensure updates for lost migrate operations indicate which node it tried to migrating to
+ High: crmd: If there are no nodes to finalize, start an election.
+ High: crmd: If there are no nodes to welcome, start an election.
+ High: crmd: Prevent node attribute loss by detecting attrd disconnections immediately
+ High: crmd: Prevent node re-probe loops by ensuring manditory actions always complete
+ High: PE: Bug 2005 - Fix startup ordering of cloned stonith groups
+ High: PE: Bug 2006 - Correctly reprobe cloned groups
+ High: PE: Bug BNC:465484 - Fix the no-quorum-policy=suicide option
+ High: PE: Bug LF:1996 - Correctly process disabled monitor operations
+ High: PE: CID:19 - Fix use-of-NULL in determine_online_status
+ High: PE: Clones now default to globally-unique=false
+ High: PE: Correctly calculate the number of available nodes for the clone to use
+ High: PE: Only shoot online nodes with no-quorum-policy=suicide
+ High: PE: Prevent on-fail settings being ignored after a resource is successfully stopped
+ High: PE: Prevent use-of-NULL for failed migrate actions in process_rsc_state()
+ High: PE: Remove an optimization for the terminate node attribute that caused the cluster to block indefinitly
+ High: PE: Repar the ability to colocate based on node attributes other than uname
+ High: PE: Start the correct monitor operation for unmanaged masters
+ High: stonith: CID:3 - Fix another case of exceptionally poor error handling by the original stonith developers
+ High: stonith: CID:5 - Checking for NULL and then dereferencing it anyway is an interesting approach to error handling
+ High: stonithd: Sending IPC to the cluster is a privileged operation
+ High: stonithd: wrong checks for shmid (0 is a valid id)
+ High: Tools: attrd - Correctly determine when an attribute has stopped changing and should be committed to the CIB
+ High: Tools: Bug 2003 - pingd does not correctly detect failures when the interface is down
+ High: Tools: Bug 2003 - pingd does not correctly handle node-down events on multi-NIC systems
+ High: Tools: Bug 2021 - pingd does not detect sequence wrapping correctly, incorrectly reports nodes offline
+ High: Tools: Bug BNC:468066 - Do not use the result of uname() when its no longer in scope
+ High: Tools: Bug BNC:473265 - crm_resource -L dumps core
+ High: Tools: Bug LF:2001 - Transient node attributes should be set via attrd
+ High: Tools: Bug LF:2036 - crm_resource cannot set/get parameters for cloned resources
+ High: Tools: Bug LF:2046 - Node attribute updates are lost because attrd can take too long to start
+ High: Tools: Cause the correct clone instance to be failed with crm_resource -F
+ High: Tools: cluster_test - Allow the user to select a stack and fix CTS invocation
+ High: Tools: crm cli: allow rename only if the resource is stopped
+ High: Tools: crm cli: catch system errors on file operations
+ High: Tools: crm cli: completion for ids in configure
+ High: Tools: crm cli: drop '-rsc' from attributes for order constraint
+ High: Tools: crm cli: exit with an appropriate exit code
+ High: Tools: crm cli: fix wrong order of action and resource in order constraint
+ High: Tools: crm cli: fox wrong exit code
+ High: Tools: crm cli: improve handling of cib attributes
+ High: Tools: crm cli: new command: configure rename
+ High: Tools: crm cli: new command: configure upgrade
+ High: Tools: crm cli: new command: node delete
+ High: Tools: crm cli: prevent key errors on missing cib attributes
+ High: Tools: crm cli: print long help for help topics
+ High: Tools: crm cli: return on syntax error when parsing score
+ High: Tools: crm cli: rsc_location can be without nvpairs
+ High: Tools: crm cli: short node preference location constraint
+ High: Tools: crm cli: sometimes, on errors, level would change on single shot use
+ High: Tools: crm cli: syntax: drop a bunch of commas (remains of help tables conversion)
+ High: Tools: crm cli: verify user input for sanity
+ High: Tools: crm: find expressions within rules (do not always skip xml nodes due to used id)
+ High: Tools: crm_master should not define a set id now that attrd is used. Defining one can break lookups
+ High: Tools: crm_mon Use the OID assigned to the project by IANA for SNMP traps
+ Medium (bnc#445622): Tools: crm cli: improve the node show command and drop node status
+ Medium (LF 2009): stonithd: improve timeouts for remote fencing
+ Medium: ais: Allow dead peers to be removed from membership calculations
+ Medium: ais: Pass node deletion events on to clients
+ Medium: ais: Sanitize ipc usage
+ Medium: ais: Supply the node uname in addtion to the id
+ Medium: Build: Clean up configure to ensure NON_FATAL_CFLAGS is consistent with CFLAGS (ie. includes -g)
+ Medium: Build: Install cluster_test
+ Medium: Build: Use more restrictive CFLAGS and fix the resulting errors
+ Medium: cib: CID:20 - Fix potential use-after-free in cib_native_signon
+ Medium: Core: Bug BNC:474727 - Set a maximum time to wait for IPC messages
+ Medium: Core: CID:12 - Fix memory leak in decode_transition_magic error path
+ Medium: Core: CID:14 - Fix memory leak in calculate_xml_digest error path
+ Medium: Core: CID:16 - Fix memory leak in date_to_string error path
+ Medium: Core: Try to track down the cause of XML parsing errors
+ Medium: crmd: Bug BNC:472473 - Do not wait excessive amounts of time for lost actions
+ Medium: crmd: Bug BNC:472473 - Reduce the transition timeout to action_timeout+network_delay
+ Medium: crmd: Do not fast-track the processing of LRM refreshes when there are pending actions.
+ Medium: crmd: do_dc_join_filter_offer - Check the 'join' message is for the current instance before deciding to NACK peers
+ Medium: crmd: Find option values without having to do a config upgrade
+ Medium: crmd: Implement shutdown using a transient node attribute
+ Medium: crmd: Update the crmd options to use dashes instead of underscores
+ Medium: cts: Add 'cluster reattach' to the suite of automated regression tests
+ Medium: cts: cluster_test - Make some usability enhancements
+ Medium: CTS: cluster_test - suggest a valid port number
+ Medium: CTS: Fix python import order
+ Medium: cts: Implement an automated SplitBrain test
+ Medium: CTS: Remove references to deleted classes
+ Medium: Extra: Resources - Use HA_VARRUN instead of HA_RSCTMP for state files as Heartbeat removes HA_RSCTMP at startup
+ Medium: HB: Bug 1933 - Fake crmd_client_status_callback() calls because HB does not provide them for already running processes
+ Medium: PE: CID:17 - Fix memory leak in find_actions_by_task error path
+ Medium: PE: CID:7,8 - Prevent hypothetical use-of-NULL in LogActions
+ Medium: PE: Defer logging the actions performed on a resource until we have processed ordering constraints
+ Medium: PE: Remove the symmetrical attribute of colocation constraints
+ Medium: Resources: pingd - fix the meta defaults
+ Medium: Resources: Stateful - Add missing meta defaults
+ Medium: stonithd: exit if we the pid file cannot be locked
+ Medium: Tools: Allow attrd clients to specify the ID the attribute should be created with
+ Medium: Tools: attrd - Allow attribute updates to be performed from a hosts peer
+ Medium: Tools: Bug LF:1994 - Clean up crm_verify return codes
+ Medium: Tools: Change the pingd defaults to ping hosts once every second (instead of 5 times every 10 seconds)
+ Medium: Tools: cibmin - Detect resource operations with a view to providing email/snmp/cim notification
+ Medium: Tools: crm cli: add back symmetrical for order constraints
+ Medium: Tools: crm cli: generate role in location when converting from xml
+ Medium: Tools: crm cli: handle shlex exceptions
+ Medium: Tools: crm cli: keep order of help topics
+ Medium: Tools: crm cli: refine completion for ids in configure
+ Medium: Tools: crm cli: replace inf with INFINITY
+ Medium: Tools: crm cli: streamline cib load and parsing
+ Medium: Tools: crm cli: supply provider only for ocf class primitives
+ Medium: Tools: crm_mon - Add support for sending mail notifications of resource events
+ Medium: Tools: crm_mon - Include the DC version in status summary
+ Medium: Tools: crm_mon - Sanitize startup and option processing
+ Medium: Tools: crm_mon - switch to event-driven updates and add support for sending snmp traps
+ Medium: Tools: crm_shadow - Replace the --locate option with the saner --edit
+ Medium: Tools: hb2openais: do not remove Evmsd resources, but replace them with clvmd
+ Medium: Tools: hb2openais: replace crmadmin with crm_mon
+ Medium: Tools: hb2openais: replace the lsb class with ocf for o2cb
+ Medium: Tools: hb2openais: reuse code
+ Medium: Tools: LF:2029 - Display an error if crm_resource is used to reset the operation history of non-primitive resources
+ Medium: Tools: Make pingd resilient to attrd failures
+ Medium: Tools: pingd - fix the command line switches
+ Medium: Tools: Rename ccm_tool to crm_node
* Tue Nov 18 2008 Andrew Beekhof <[email protected]> - 1.0.1-1
- Update source tarball to revision: 6fc5ce8302ab (stable-1.0) tip
- Statistics:
Changesets: 170
Diff: 816 files changed, 7633 insertions(+), 6286 deletions(-)
- Changes since Pacemaker-1.0.1
+ High: ais: Allow the crmd to get callbacks whenever a node state changes
+ High: ais: Create an option for starting the mgmtd daemon automatically
+ High: ais: Ensure HA_RSCTMP exists for use by resource agents
+ High: ais: Hook up the openais.conf config logging options
+ High: ais: Zero out the PID of disconnecting clients
+ High: cib: Ensure global updates cause a disk write when appropriate
+ High: Core: Add an extra snaity check to getXpathResults() to prevent segfaults
+ High: Core: Do not redefine __FUNCTION__ unnecessarily
+ High: Core: Repair the ability to have comments in the configuration
+ High: crmd: Bug:1975 - crmd should wait indefinitely for stonith operations to complete
+ High: crmd: Ensure PE processing does not occur for all error cases in do_pe_invoke_callback
+ High: crmd: Requests to the CIB should cause any prior PE calculations to be ignored
+ High: heartbeat: Wait for membership 'up' events before removing stale node status data
+ High: PE: Bug LF:1988 - Ensure recurring operations always have the correct target-rc set
+ High: PE: Bug LF:1988 - For unmanaged resources we need to skip the usual can_run_resources() checks
+ High: PE: Ensure the terminate node attribute is handled correctly
+ High: PE: Fix optional colocation
+ High: PE: Improve up the detection of 'new' nodes joining the cluster
+ High: PE: Prevent assert failures in master_color() by ensuring unmanaged masters are always reallocated to their current location
+ High: Tools: crm cli: parser: return False on syntax error and None for comments
+ High: Tools: crm cli: unify template and edit commands
+ High: Tools: crm_shadow - Show more line number information after validation failures
+ High: Tools: hb2openais: add option to upgrade the CIB to v3.0
+ High: Tools: hb2openais: add U option to getopts and update usage
+ High: Tools: hb2openais: backup improved and multiple fixes
+ High: Tools: hb2openais: fix class/provider reversal
+ High: Tools: hb2openais: fix testing
+ High: Tools: hb2openais: move the CIB update to the end
+ High: Tools: hb2openais: update logging and set logfile appropriately
+ High: Tools: LF:1969 - Attrd never sets any properties in the cib
+ High: Tools: Make attrd functional on OpenAIS
+ Medium: ais: Hook up the options for specifying the expected number of nodes and total quorum votes
+ Medium: ais: Look for pacemaker options inside the service block with 'name: pacemaker' instead of creating an addtional configuration block
+ Medium: ais: Provide better feedback when nodes change nodeids (in openais.conf)
+ Medium: cib: Always store cib contents on disk with num_updates=0
+ Medium: cib: Ensure remote access ports are cleaned up on shutdown
+ Medium: crmd: Detect deleted resource operations automatically
+ Medium: crmd: Erase a nodes resource operations and transient attributes after a successful STONITH
+ Medium: crmd: Find a more appropriate place to update quorum and refresh attrd attributes
+ Medium: crmd: Fix the handling of unexpected PE exits to ensure the current CIB is stored
+ Medium: crmd: Fix the recording of pending operations in the CIB
+ Medium: crmd: Initiate an attrd refresh _after_ the status section has been fully repopulated
+ Medium: crmd: Only the DC should update quorum in an openais cluster
+ Medium: Ensure meta attributes are used consistantly
+ Medium: PE: Allow group and clone level resource attributes
+ Medium: PE: Bug N:437719 - Ensure scores from colocated resources count when allocating groups
+ Medium: PE: Prevent lsb scripts from being used in globally unique clones
+ Medium: PE: Make a best-effort guess at a migration threshold for people with 0.6 configs
+ Medium: Resources: controld - ensure we are part of a clone with globally_unique=false
+ Medium: Tools: attrd - Automatically refresh all attributes after a CIB replace operation