我测试了一下模型的推理速度,和论文中Table 8.给出的速度有较大出入。在4090显卡上,对256*256的张量,平均推理速度为1.30s。我的测试代码如下:
`import glob
import os
import torch
import argparse
import time
from models.EVSSM import EVSSM
from torchvision.transforms import functional as F
from torch.utils.data import Dataset, DataLoader
from PIL import Image as Image
from tqdm import tqdm
def main():
model = EVSSM()
model.eval().cuda()
# state_dict = torch.load(args.test_model)['params']
# model.load_state_dict(state_dict,strict = True)
# 1. 预热(不计时)
input_tensor = torch.randn(1, 3, 256, 256).cuda()
for _ in range(10):
with torch.no_grad():
_ = model(input_tensor)
# 2. 正式测量(使用CUDA事件精确计时)
st_time = time.time()
with torch.no_grad():
for _ in range(100): # 多次测量取平均
output = model(input_tensor)
ed_time = time.time()
avg_time = (ed_time - st_time) / 100
print(f"平均推理时间: {avg_time:.4f} 秒")
print(f"FPS: {1/avg_time:.2f}")
if name == 'main':
main()`
我测试了一下模型的推理速度,和论文中Table 8.给出的速度有较大出入。在4090显卡上,对256*256的张量,平均推理速度为1.30s。我的测试代码如下:
`import glob
import os
import torch
import argparse
import time
from models.EVSSM import EVSSM
from torchvision.transforms import functional as F
from torch.utils.data import Dataset, DataLoader
from PIL import Image as Image
from tqdm import tqdm
def main():
if name == 'main':
main()`