[CT][MS][CI] CombineMomentumWeight and FusedWeightScaleApplyMomentum error

name	about	labels
Bug Report	Use this template for reporting a bug	kind/bug

Environment

Hardware Environment(Ascend/GPU/CPU):

Uncomment only one /device <> line, hit enter to put that in a new line, and remove leading whitespaces from that line:

/device gpu

Software Environment:
-- MindSpore version (source or binary):
-- Python version (e.g., Python 3.7.5):
-- OS platform and distribution (e.g., Linux Ubuntu 16.04):
-- GCC/Compiler version (if compiled from source):

Related testcase

test_ir_fusion_combine_momentum_weight
test_ir_fusion_fused_scale_momentum_decay

Steps to reproduce the issue

Describe the current behavior

def test_ir_fusion_fused_scale_momentum_decay():
        clear_files()
        context.set_context(save_graphs=True)
    
        epoch_size = 1
        batch_size = 1
        num_classes = 3
    
        input_np = np.random.uniform(0.0, 1.0,
                size=[batch_size, 3, 2, 2]).astype(np.float16)
        label_np = np.ones([batch_size, num_classes]).astype(np.float32)
        net = Net(3, num_classes)
        loss = SoftmaxCrossEntropyWithLogits(sparse=False)
        opt = Momentum(learning_rate=0.01, momentum=0.9,
                params=filter(lambda x: x.requires_grad, net.get_parameters()),
                weight_decay=1.5, loss_scale=1.5)
        lsm = FixedLossScaleManager(loss_scale=1.5, drop_overflow_update=False)
        net = amp.build_train_network(net, opt, loss,
                level="O3", loss_scale_manager=lsm)
        net.set_train()
        for epoch in range(0, epoch_size):
            net(Tensor(input_np), Tensor(label_np))
    
        result = find_files('hwopt*momentum_scale_fusion*ir',
                'FusedWeightScaleApplyMomentum')
>       assert result == '2'
E       AssertionError: assert '0' == '2'
E         - 0
E         + 2

def test_ir_fusion_combine_momentum_weight():
        clear_files()
        context.set_context(save_graphs=True)
    
        epoch_size = 1
        batch_size = 1
        num_classes = 3
    
        input_np = np.random.uniform(0.0, 1.0,
                size=[batch_size, 3, 2, 2]).astype(np.float16)
        label_np = np.ones([batch_size, num_classes]).astype(np.float32)
        net = Net2(3, num_classes)
        loss = SoftmaxCrossEntropyWithLogits(sparse=False)
        conv_params = list(filter(lambda x: 'conv' in x.name,
            net.trainable_params()))
        no_conv_params = list(filter(lambda x: 'conv' not in x.name,
            net.trainable_params()))
        group_params = [{'params': conv_params, 'weight_decay': 0.3},
                        {'params': no_conv_params, 'lr': 0.04},
                        {'order_params': net.trainable_params()}]
        opt = Momentum(group_params, learning_rate=0.03, momentum=0.9,
                loss_scale=1.3, weight_decay=0.7)
        lsm = FixedLossScaleManager(loss_scale=1.3, drop_overflow_update=False)
        net = amp.build_train_network(net, opt, loss, level="O3",
                loss_scale_manager=lsm)
        net.set_train()
        for epoch in range(0, epoch_size):
            net(Tensor(input_np), Tensor(label_np))
    
        result = find_files('hwopt*combine_momentum*ir',
                'CombineMomentumWeight')
>       assert result == '2'
E       AssertionError: assert '0' == '2'

Describe the expected behavior

pass

Related log / screenshot

Special notes for this issue

Appearance & Root Cause
问题：IR图中未出现预期融合算子
原因：由于ME前端的修改，进入到后端的IR图与原设定的apply_momentum_weight_scale_fusion pass不匹配，Cast(input)-->Depend(Cast(input))，不能正常融合。

Fix Solution
解决方法：修改适配apply_momentum_weight_scale_fusion pass，使其能够处理Depend(Cast(input))的情况。
关联PR：!16210:update apply_momentum_weight_scale_fusion pass

pytest -s
test_ir_fusion_combine_momentum_weight
test_ir_fusion_fused_scale_momentum_decay

result pass

GVP MindSpore / mindspore

内容风险标识

Environment

Related testcase

Steps to reproduce the issue

Describe the current behavior

Describe the expected behavior

Related log / screenshot

Special notes for this issue

评论 (2)

GVPMindSpore / mindspore

内容风险标识

[CT][MS][CI] CombineMomentumWeight and FusedWeightScaleApplyMomentum error

Environment

Related testcase

Steps to reproduce the issue

Describe the current behavior

Describe the expected behavior

Related log / screenshot

Special notes for this issue

评论 (2)

搜索帮助

GVP MindSpore / mindspore